

Indian Agricultural 
Research Institute. New Delhi 

\c*m 

» » 

'VP 

I.A.R 1.6. 

GIP NUC—H i I.A.R.I. —10-5 *5—H.«00 





1942 


Annals of Mathematics 

(FQUNDED BT ORMOND STON&) 


EDITED BY 

SOLOMON LEFSCHETZ JOHN VON NEUMANN 

FrEdEric bohnenblust 

WITH THE COOPERATION OF THE 

DEPARTMENT OF MATHEMATICS OF PRINCETON UNIVERSITY 

AND 

THE SCHOOL OF MATHEMATICS OF THE INSTITUTE 
FOR ADVANCED STUDY 

AND 

A. A. Albert T. H. Hildebrandt* M. H. Stone 

Emil Artin Nathan Jacobson Anna Pell-WheejuerVv 

Garrett Birkhoff Saunders Mac Lane 1 G. T. Whyburn*-^ 

M. R. Hestenes 1 N. E. Steenrod Oscar Zarise! * 

( l Representing the Mathematical Association or America) 

(* Representing the American Mathematical Society) 


SECOND SERIES, VOL. 43 


PUBLISHED QUARTERLY 

AT 

MOUNT ROYAL AND GUILFORD AVENUES 

BALTIMORE, MD. 

BY THE 

PRINCETON UNIVERSITY PRESS 

PRINCETON, N. J. 

Copyright, 1942, by Princeton University Press 




2 


W\ J. TRJITZINSKY 


A considerable part of the developments in this work refers to equations con¬ 
taining a parameter X, entering linearly in C. The results for X non real and 
F self adjoint are partially analogous to those found in (C). The theory for X 
real presents additional complications and has some analogy with a work of 
W. J. Trjitzinsky 4 in the field of singular integral equations. 

In sections 2, 3 we carry out the transformation into an integral equation 
(Theorem 3.1). In section 4 the kernel of this equation is investigated in some 
detail (Theorem 4.1) and the equation is iterated a finite number of times, which 
yields an integral equation satisfying certain regularity properties. This equa¬ 
tion plays an essential role in the subsequent developments. In section 5 we 
introduce characteristic functions and values (cf. (5.6), (5.9)) for certain approxi¬ 
mating homogeneous boundary value problems ((5.7), (5.10)); we establish for 
these functions orthogonality properties (Theorem 5.1). 

The developments in sections 6, 7, 8, 9, 10 are for F self adjoint 

In section 6 we first establish existence of solutions for the non homogeneous 
problem (5.1) when 3X ^ 0 (Theorem 6.1). In the complex X-plane we define 
sets T (Definition 6.1) and particular sets T, which we term 0 (see text after 
(6.14)); sets T contain all the points off the axis of reals and may contain certain 
points on the axis of reals, depending on sufficient ‘rarefication’ of the set of 
points represented by the totality of characteristic values formed for an infinite 
sequence of approximating homogeneous boundary value problems. We es¬ 
tablish existence of solutions, ClL 2 , of (5.1) when X is in T (Theorem 6.2). In 
section 7 there is developed a corresponding spectral theory (Theorem 7.1). 
In sections 8 and 9 this theory is applied; we introduce the notion of ‘closure' 
of F with respect to a spectrum 6 (Definition 8.1) and give a generalized Fourier 
expansion of functions, CL 2 (in Z>), in terms of 6 (Theorem 8.1). Further, 
there is given a representation for solutions, referred to in Theorem 6.2, with 
the aid of 0 (Theorem 8.2). In section 9 various representations, convergent 
in the mean square, are given. In section 10 we introduce certain operators 
A x (\ !•••)> with the aid of which certain particular solutions of (5.1) may be 
expressed for X in 0 (Theorem 10.1). With the aid of these operators we in¬ 
vestigate conditions under which F is of class I (Theorem 10.2, Definition 6.2) 
and, finally, questions of uniqueness are studied for X non real (Theorem 10.3) 
and for X in 0 (Theorem 10.4). 

In section 5 a certain “regularity" hypothesis is introduced for the domain D; 
this is needed only in the developments subsequent to (5.7). Some lightening 
of the conditions imposed on the coefficients in (1.1) is possible without any 
essential change in the conclusions; for simplicity this lightening of the hypothe¬ 
sis will be avoided. 

The case when F is of class I corresponds (in the field of partial differential 
equations), as pointed out by Carleman, to the case of “Grenzpunkt” in the well 
known theory of H. Weyl in the field of ordinary differential equations. 


4 W. J. Trjitzinsky, Some problems in the theory of singular integral equations [Annals of 
Math. (1940), 684-619]. 



SINGULAR ELLIPTIC PARTIAL DIFFERENTIAL EQUATIONS 


3 


Every solution u of (3.37), such that the D%u are continuous in D, is solution 
of (1.1) in the ordinary sense; (3.37) thus generalizes the operator F; with F 
assigned this generalized meaning , the partial differential equation and the integral 
equation are equivalent 

2. Preliminaries 

The adjoint of F(u ) is 

(2.1) 0(v) - £ (A ik v) - £ ^ (Biv) + Cv. 

i,k OXiOXk i OXi 

The characteristic form for (1.1) is 

(2.2) A( 7 i, •••7»)= ! Z <4»,*7m = Myi, • • • 7m; x if * • • x m ). 

iik 

We shall make use of the symbol 

dv 

denoting operation of derivation along the transversal direction (cf. (H; 87)). 
This operation is to be understood as follows. We form 

(2.3) A = -4 On , • • ■ 7r m ), 

where the v * are direction cosines of normal to a surface, say 


that is 
(2.4) 


H m H(x i, ... x m ) = 0; 


Vi = 



(dHYT* dH 

\dx m J J dXi 9 


the supposition being, of course, that the D\H exist in the domain under con¬ 
sideration. The transversal direction is defined by the ordinary differential 
system (generally non linear) : 


(2.5). 

where 

(2.5a) 

In the case when 


dx i _ dx 2 

Wl W2 



Wk = W*(Xl, • • • X m ) = 23 Ai,kV% 


t«i 


A i*k 


d 2 

dXi dXk 


= A 


\k = 1, • • • m). 5 

(Laplacian) 


1 See (H; 88) for a detailed discussion of the transversal direction. 



4 


W. J. TRJITZINSKY 


the “transversal” direction will signify “normal” direction. In view of (2.5), 
(2.5a) we are brought to the definition 


( 2 . 6 ) 


du du dxk y* 1 dA du 

dp dXk dv *-i 2 d'Kk dx* 


There is on hand a following fundamental formula (see (H)) 


(cf. (2.3)). 


/ (»») r ( m —1) I” ~ “ 

[vF(u) - uG(v))dt = - / v~ - u ~ + Luv dS. 

J L dp op 


[F from (1,1); G from (2.1)]. Here the first integral displayed is over the space T 
bounded by the surface H = 0; the integral of the second member is over the 
surface S constituting the frontier of T? moreover, 

dAi,k 


L = 52 


i,k OXk 


(2.7a) 
and 

dT - dx, ■ ■ • dx,. dS- 9 -? g-[(g)+. 
Of importance for us will be a function of ( x ) and (z), 


(cf. (2.4)) 




dHYl dT 
dx m ) _ <1H ‘ 


r(x, z) = rfo, x m ;zi, z m ), 


satisfying the partial differential equation 

( 2 ’ 8) A (dZ ’ " • tej Xu •' ‘ Xn ) = 4r, ‘ 

namely, the function for which 

r^j; z) = r(x; z ) 

denotes the “geodesic distance” 3 between the points (x) and (z). This distance 
is formed with respect to the quadratic form 

(2.9) H = H(y u • • • 7m) = Z HUx)ya k , 

i,k 


“ associated ” with the quadratic form A( yi, • • • y m ) (2.2). In this connection 
it is to be noted that, if 

(Ai,i) (i, j = 1, ... m) 


denoted a matrix with Ai tj in the i-th row and j-th column, then 

- ( 4 #)" 1 ; 


that is, the matrix (/?,,,) is the inverse of the matrix (A*,,•). 

With the aid of the developments given in (H; pp. 120-124) we shall study 
r(x; z) in greater detail. 


4 This is asserted under suitable regularity conditions, in the sequel satisfied. 



SINGULAR ELLIPTIC PARTIAL DIFFERENTIAL EQUATIONS 


5 


Consider the differential problem 

dxj _ 1 dA dpi _ __ 1 dA 

ds 2 dpi 1 ds 2 dXi 

A = 23 Ai,k(x)pip k , 

t.fc 

while the initial conditions are 

(2.11a) = z* (for s = 0), pi = p 0 ,t 

we write 

^o(po) = 2 A< t jfc(z)po,tPo,fc = c . 

itk 


( 2 . 10 ) 

where 

( 2 . 11 ) 


(* = !>••• 


(for s = 0); 


As indicated in (H; pp. 120, 121) the solution of this problem may be written 
in the form 


( 2 . 12 ) Xi = <pi(qi , ■ • • Qm ; z \, • • • z m ) (qi = sp 0ti ), 

P i = tf/iiqi , * * * 1 2i , • • • 2m) (Pi = SPi)j 

from these relations one obtains 

%i = <Pi( Pit * * * t Pm y X 1 , ••• 2l m ), = ^i( Pi , * * * t P m t %l y * * * #m). 

It is observed that (2.10) determines geodesics, with respect to the form H , 
through the point ( z ). Moreover, for solutions of (2.10) one has A(p, x) = 
constant. On substituting s = 0 and using the initial conditions this constant 
is found to be c. 

In consequence of known theorems for ordinary differential equations 7 we 
infer the following. If the 

(2.13) AM, D„Ai f k(x) {v = 1, . • • 4) 

are continuous for ( x) in D, which we henceforth assume , then the elements Xi , 
Pi of the solution (2.12) are continuous and have continuous partial derivatives 
of the first and second orders with respect to the g, and the z ,•; furthermore, 
the third order derivatives will exist. More generally, if the D n +iA i, k (x) are 
continuous, the partial derivatives with respect to the g>- and z,-, of order N — 1, 
of the Xi and P, will be continuous and the derivatives of order N will exist. 
The above statements refer to local properties—for (z) representing a point in D. 
According to (H) 

(2.14) r(z, z) = A(q u ••• q m ;zi, • • • z m ) = £ Ai,k(z)q>qk. 


i Cotton, Sur lea Equations diffbrentiellea dependant de paramUres arbitraires [Bull. 
Soc. mat., t. 37 (1909), 204-214]. Also see t. 38 (1918), p. 4. 



6 


W. J. TRJITZINSKY 


Here the g, are to be replaced by the expressions, in terms of the Zj and x / 
obtainable from the first m relations (2.12). 

We have 


(2.15) 

where 


Xi = x»(s) = £<(0) + x»- l) (0)^j + (a\- 2) (0) + &W) —, 


lim &(«) = 0 


#-*o 


(i =!,-•• 


By (2.11a) a\(0) = Zi ; on the other hand, in view of (2.10) and (2.11) 


1 dA 

2 dpi 

so that, by definition of , 


Xi V ( 0) = « ( s = 0) = 2 Ai, k (x)p k (8 = 0) = X) Ai, k (z)p 0tk 

4 opi k k 


sx J l> ( 0 ) = X) Ai, h (z)Qk . 

k 


(2.16) 

By (2.10) 

(2.17) 

Now the first sum in the last member of (2.17) is 


Xi 2 ) (s) = ~ S A itV (x)pv = A itV (x) + X p, ds 


dAi %v (x ) 


,m - -Itam f - -J z p. 


v . a,fi 


dx. 


n 


moreover, since g, = sp 0 ,j , 


(2.17a) 
We have 


» ! «<0) - -5 Z A..W «. 


r.a.3 


9(J' 


tL4i,.(x) _ y-> d.A<,,(x) <Zx 7 _ 

ds dXy ds 


whence by (2.16) 

dAM~\ s = £ ^( 2 ) sx a) (0) = 2 AM q k 

as J«-0 7-1 OZy y,k OZy 


and 


(2.17b) IZP.^#'. 


0 2 V 1 3 A <,*»(&) 

) ?i 7 ,fc ^ C/Z 7 




In consequence of (2.17a), (2.17b) from (2.17) one infers that 

^x< 2) (0) = | g(0) + last member of (2.17b) = £ A i:a ,e(z)q a q », 

«I a a ,0 


V' A 3A a ,j(«) + 1 v A j.\ dA <-*( 2 ) 


( 2 . 18 ) 


1 



SINGULAR ELLIPTIC PARTIAL DIFFERENTIAL EQUATIONS 


7 


Finally, one may write (2.15) in the form 

(2.19) Xi - Si - 2 Ai, k (z)qk + £ U<=«,/s(z) + X<:«.ij(z; s))?«3/j , 

k a,0 

where the Ai :a Az) are from (2.18) and 

lim Q) = 0 (as ( q) -+ (0); (z) in D ); 

furthermore, the \i :a ,p are continuous in 

Z 1 , ••• Zm,qi, ••• qm (I ?/1 ^ ?0 ; ?o > 0; (z) in D). 

Inversion of (2.19) will give 

(2.20) qi = Yj Hi, k (z)(x k — Zk) + £ + hi:a,p)(x a - z«)(x 0 - zp), 

k a,p 

where 

(2.20a) H, ; .. t ( 2 ) = -E A ta . lfl Oi)H w OB)ff..,(r)H Ar (*) (cf. (2.18)) 

a,p,k 

and the X are continuous in the x,-, z,, while 

( 2 . 20 b) lim Xt :a ,0 = lim Xi :cr , 3 (^; x) = 0 (as (x) —► (z); (z) in D). 

We note that the H i:a , T are polynomials in the 

Ai.jiz), DiAij(z), Hij(z). 

Accordingly, the functions of (2.20a) are continuous in D. 

Substitution of (2.20) into (2.14) will yield 

r(x, z) = ^ i Hk,ki(z)(Xk Z k ){Xk x ““ Zk j) 

k,k x 

+ 2 2 Hk 1,0 Ax, z)(x» — Za)(x0 ~ Zp)(x kl — 2 *,) + T 4 , 

a.p,k x 

where 

• H kl , aS {x, z) = H kl ., a .„(z) + L,:.* (cf. (2.20a), (2.20b)) 

and 

r4 = ^ . (Xa Za)(Xf J 2^)(x ol — 2 ai )(X(j, ~ 20,) A r ,v(.z)IIr-.a,flflv: a,.0, • 

a,p,ai,0i r,r 

Finally, we obtain 

(2.21) r(x, z) = 23 Hk>k x (z)(xk — 2 ^)^! — 2 *^ + r 8 (x, z), 

k,k x 

where 

(2.21a) r 8 (irj z) « ^ 1 (2 Hk x ;a,fi{z) H” 2l))(2*a %ct)(Xp ^)(^l?x ^A?i) 

a,0,ki 

(cf. (2.20a)); Acre 

(2.21b) X* lio ,fl(x, 2 ) -*• 0 (as (x) -*• ( 2 ); ( 2 ) in Z>); 



8 


W. J. TRJITZINSKY 


the functions , involved in (2.21b), are continuous in (x) for (x) in a neighborhood 
of (z). 

When 


'Pa d 
iX '-“dXidXt 


= A 


(Laplacian), 


we have 


(3.1) 


r(s, z) = (Xi - Zif + • • • + (x m - Z m ) 2 . 

3. The transformation 

In view of the definition (2.6) one has 
d 


w x v 1 dA dr 

* r<I ' z) -§2S.aJ, 


(cf. (2.4), (2.3)), 


the derivative in the transversal direction being with respect to a surface H = 0. 
With p(> 0) suitably small we consider the surface 

(3.2) H p (x) m T{x, z) - p = 0 

Correspondingly 

/Q Q\ _ ^ 1 

dXi a(x } z) 7 
By (2.3) and (3.1) 


(( 2 ) fixed in D). 
2/ x /arV , . /arV 

' fe,) ■ w + -" + U.J • 


~ r(x, = A 0lk (x)r a ~. 

dp *-l a dZ* 


Thus, in view of (3.3), 


dT dT 1 


.-r(x, 2) = YjA. a , k {x) 

dv a,k dx a dx k a(x, Z) 


and, by virtue of (2.8), 




Whence, when a is a constant, 

( 3 . 4 ) £ r(x, 2) = «r“ _ 1 (x, 2) £ r(x, 2) = r“(x, 2) (cf. ( 3 . 3 )). 

With a view to applying the fundamental formula (2.7) we wish to find a 
function 


(3.5) 

such that 


V(x, 2) = T{ r(x, 2 )) 


(3.5a) 

for (x) on the surface /f p (x) = 0 (3.2). 


dv n 

’ - s. ~ 0 


(( 2 ) fixed in D) 



SINGULAR ELLIPTIC PARTIAL DIFFERENTIAL EQUATIONS 


9 


On taking account of (3.4) it is observed that a function (3.5) satisfying (3.5a) 
may be taken of the form 


(3.6) v(x, z) = r i<2 7">Gr, z) + -L r i(m - 2, (z, *) - A. 

Inasmuch as m > 2, it is noted that v becomes infinite essentially as when 

Or) (z). 

With 

(3.7) r 2 = (xi — 2 i) 2 + • • • + (x m — Zm) 2 ((z) fixed in D) 
the differential element of volume, dT , may be given in the form 

(3.7a) dT = r m ~ l dr d% 

where dti is dement of surface of a hypersphere of radius unity in the ra-space 
of Or). We recall that the total surface of this hypersphere is S m = 2w* m /g (^j > 

where g(u) is the Gamma-function. The volume is — S m . From (3.7a) it is 

m 

inferred that the element of surface of a hypersphere of radius r is 


(3.7b) 





The element dS of the surface (3.2) satisfies the relation 


(3.8) 


dS cos r = dw, 


where r is the angle formed by the line extending from ( z ) and having direction 
cosines (x t — z x )/r and by a suitable direction of the normal to the surface (3.2) 
at Or); the direction cosines of this normal are the T t of (3.3) (with a suitable 
sign for <j(x, z))\ thus, 


cos r = 


1 


ra(x, z) 


arQ r, z) 
dx x 


(Xi — Zi) 


and, by (3.8) and (3.7b), 
(3.9) dS 


a(x y z)r m di 2 



Zi) 


6r(x, z) 

dXi 


(cf. (3.3)). 


Let K s denote a sphere with center at (z) and radius 8 (0 < 8 ^ 5 0 ), where So 
is taken sufficiently small so that the surface S s , of K 5 , lies interior the domain 
T p bounded by the surface (3.2). 8 We have a domain 

(3.10) D(p , 8), 

whose frontier consists of the spherical surface St and of the surface H p (x) = 0. 


8 Tp contains (z) in the interior and is topologically a sphere. 



W. J. TRJITZINSKY 


10 • 


Suppose u = u(x) is a function, continuous with the D\u , satisfying in D the 
differential equation (1.1). We substitute in the fundamental formula (2.7) u 
equal to the aforesaid function and v = v(x, z ), from (3.6), and we extend the 
integrations over the domain D(p, 5) (cf. (3.10)) and over the frontier of D(p, $); 
the variable of integration is to be ( x ). Thus, 

(3.10a) / [vf - uG(v )] dT = Q(p, 8) 

" D(p,6) 

where 

Q(p, S) = -/ <m ” |"t> P _ u p + Luv 1 dS. 

On taking account of (3.5a) one obtains 

(3.11) Q( P , 8) = - [ (M ^ \v ~ - u ~ + Luv 1 dS. 

Jst L w J 

* 

With the notation (3.7) we have 

r = 8 (for (x) on Si) ; 

moreover, by virtue of the statement referring to (3.7b), dS in (3.11) is of the 
form 

(3.11a) dS = dw{r = 5) = 8 m ^dil. 

In (3.11) L is defined by (2.7a), where the ir* are the direction cosines of the 
normal to the surface S s , extended in the direction interior D(p, 5); that is, the 
normal in question is directed outwards from the sphere K t . We have 




x% z% 
8 


dAi t k(x ) 


and 

(3.11b) L = E - £ 

* i,k OXh 

By definition (2.6) 

du 


TTi. 


(3.21) 

Now 
(3.13) 

and, in consequence of (3.6), 


ou 1 dA du . / . du 

7" 2mmj O 5 n / -J Ai t k\X)TT{ - 

dv k—i 2 dir/c dXk hi dXk 


dT 


dT 


x ■ Z 4<,*(x)ir< 
ov k t i dXk 


(3.14) = 

OV L * 


((x) in St) 


*) + (^V 2 ) r’-fe, »>] 


dr 

dv ’ 


(on <Sa). 



SINGULAR ELLIPTIC PARTIAL DIFFERENTIAL EQUATIONS 


11 


Before proceeding further we shall study the function 

(3.15) w(x, z ) = /====== (r 2 = (xi - 2i) 2 + • • • + (x» - z m ) 2 ). 

v r(x, z) 

By (2.21) 

(3.16) w\x, z) = [H(z, t) + g t (x, z)]~ l 
where 


(3.16a) H(z, ir) = Z) > ** = —-- 

k,ki r 

and 

(3.16b) fir 8 (x, z) = i r.(*, z) (cf. (2.21a)). 

r i 

For r | io 

(3.16c) | 2 H ki;a , $ (z) + «) I ^ Xo(z), 


where Xo(z) is independent of (x). Accordingly, by (3.16b) 

(3.16d) | g»(x, z) I g r £ ^(z^air^, ^ rX(z) (0 g r ^ 5o) 

a,P,ki 

(\(z) independent of (x)). Now H(z , 7r) is a positive definite quadratic form; 
thus, inasmuch as iri + ••• + *£= 1, one has 

(3.17) H(z, it) £ h(z) = £ H kM {z)irWk- > 0 

k t k\ 

(for some values 7r* , with *■(* + ••• + »■« = 1), where h(z) is independent of 
(*■). Hence, in view of (3.17) and (3.16d) 

(3.17a) H(z, 7 r) + g 3 (x, z) ^ h(z) — r\(z) ^ A(z) — 5 0 X(z) = h\{z) > 0 




12 


W. J. TRJITZINSKY 


by (3.16), (3.16d), (3.17) and (3.17a) we have 

I oti(x z) I =_ 1 z ) I _ < 

1M * } 1 i h(z, x) 11 ff (*, x) + g,(*, z) i - mm) 

for 0 ^ r ^ 5 0 . Accordingly, the function of (3.15) satisfies the relation 


(3.19) 

w\x, z) = rrT—\ + 

H (z, ir) 

(cf. (3.16a)) 

where 



(3.19a) 


(r £ So); 


here p(z ) is independent of (x). In particular 
(3.19b) lim w(x , 2 ) = #“*($, 7r), 

r—»0 


provided (a:) —» (z) along a fixed direction whose direction cosines are 71*1 , • • • ir m . 
In (3.19b) the limit is attained uniformly with respect to the rrj ; that is, with re¬ 
spect to direction of approach. Of course, the limit depends on the direction 
of approach, but there is uniformity in the sense implied by (3.19), (3.19a). 


The function of (3.11) will be expressed in the form 


(3.20) 

Q(Pt 8) = Ql(py $) + Q*(p) 5), 


where, on 

taking account of (3.11a), 


(3.20a) 

Qi(p, S) = - p ” ^ + Lu] v5 n - l dSl 

(cf. (3.11b), (3.12)), 

and 



(3.20b) 

Qi(p,&) = f ) u d £s m ~ l dQ 

(cf. (3.14), (3.13)). 


If we denote by l the upper bound in the spherical domain K h of 


ElAtol + E 

i i,k 


dx k 


in consequence of (3.11b) it is deduced that 

| L | Z (on <ST »; for 0 < S ^ $<>). 

On the other hand, by (3.12), for ( x ) on Si one has 


(3.21) 


=a 2 I | 

JM 


du 

dXk 




(0 < S g So), 



SINGULAR ELLIPTIC PARTIAL DIFFERENTIAL EQUATIONS 


13 


where h is independent of 8. In this connection one may define h as the upper- 
bound of the second member in (3.21) for (x) in the spherical domain .* 
Similarly, 

| u | ^ U (on S 6 ; k independent of 5(^$ 0 )). 

In consequence of (3.6) 

(3.22) v8 m ~ l = 8 {(jff - 2 + Six, z)a m_2 | ((*) on S t ), 

where 

z) - JL. 

Clearly 

f(x, z ) ^ /o(z) < co (0 ^ r ^ 8o) f 

/o(z ) being independent of (x). Thus, by (3.22) and (3.18) 

(3.23) | v8 m_1 1 ^ Slh^iz) + /o(z)5 m " 2 } g S/i(z) ((*) on S,; 0 < 8 ^ 8 0 ) 

with fi(z ) (< co) independent of (x) and 5. 

On taking account of (3.21) and (3.23), from (3.20a) it is inferred that 

/ (m—1) 

(h + lk)fi(z) dQ = 5(Zi + ll 2 )fi(z)Sm ; 

here is the area of the surface of a hypersphere (in the space of (x)) of radius 
unity. Consequently 

(3.24) lim Qi(p, 8) = 0. 

5-*0 

Turning our attention to the integral (3.20b) we investigate the integrand 
involved. In view of (2.21) and of certain other considerations of section 2 
one has 

~ = X) (2 H k , kl (x) + u k , kl (z, x))(x kl - z kl ), 
ox k kl 

where the w are continuous in (z) and (x), while 

(3.25) wfc,A: 1 (z, x) > 0 (as (x) ♦ (z)) 

uniformly with respect to direction of approach. Whence by virtue of (3.13) 

= 23 Ai,k(x)lTi S (2H ktkl (x) + (*)k,k\ (,Zy X))(Xk X ~ Zky) 
dv k,i k x 

= 2 Z*i) + 7i 

k,i,ki 

du 

• It is implied that the t— are continuous in D . 

a** 



14 


W. J. TRJITZINSKY 


where 

m 

7 “ 2 ]F) — z^) X) Ai t k(x)Hk t ki(x) = 2 2Z — s») = 26. 

<.ifci * *-i 

Hence, the transversal derivative being with respect to the spherical surface St , 

~ = 2 + w'(z, a;) ((*) on S»); 

5 dv 

here «'(z, x) is continuous for (x) in r p and (in view of (3.25)) 

(3.26) | <o'(z, x) | g El 11 *>*.*,( 2 , *)| -*0 

k»i,ki 

(as ( x) —> ( 2 )) uniformly with respect to the direction along which (a;) approaches 
the fixed point (z) in D. Whence, on taking account of (3.15), it is inferred that 


(3.27) 

By virtue of (3.14) 


r ^ = w(x, z)(2 + w'(z, x)) 
dv 


((x) on S t ). 


»■" - (»(*. *>r- + (’V ? ) r l --is—] i-*2. 


dy r m—: 

a* 


Since m ^ 3 we have 


(3.28) 
where 

(3.29) | wi(i, z) | = 


Jr -1 = z) + a m "V(x, 2)1 

w [ 2 J op 




Wi{z) < 00 , 


u>i(z) being independent of (x). Finally, substitution of (3.27) in (3.28) will 
yield 


^ 6" 1 = (2 - m)w m (x, z) + R(x, z), 
dv 


(3.30) 
with 

R(x, z) = 25 m-I u>i(x, z)w(x, z) + u'(z, x)£ 


2 — m 


w m (x, z) + 6 m 1 Wi(x, z)w(x 


c,«)J. 


In view of (3.29), (3.18) and of (3.26) it is observed that R(x, z ) is continuous in 
(x), for (x) in T p ; moreover, 

R(x, z) -* 0 (as (x) —*• (z)), 

uniformly with respect to the direction along which (x) —*• (z). 



SINGULAR ELLIPTIC PARTIAL DIFFERENTIAL EQUATIONS 15 

On taking account of continuity of u , in consequence of (3*30) we have 

(3.31) u(x) ^ 6 m ~ l = ( u(z ) + u{z , x))[(2 — m)w m (x , z) + R(x y z)] 
ov 

where u(z, x) is continuous hT(z), (x) (( 2 ), ( x ) in D ) and —> 0 uniformly (for (z) 
fixed in Z)), as (x) —► (z). 10 Relation (3.31), together with (3.19b) (satisfied uni¬ 
formly in the previously indicated sense), leads to the conclusion that for the 
integral (3.20b) one has 

Thus, by (3.20), (3.32) and (3.24), there is on hand the relation 

(3.33) lim Q(p, 8) = —k(z)u(,z), 

i-> o 

where 

/ (m—1) 2 

vm:*))-”' 

with 

H(z y 7r) = Hi,j(z)wiTj. 

In (3.33a) di 2 is the differential element of surface of a hypersphere of radius 
unity, (x) = (xi , • • • Wm) is thought of as representing the variable point (on 
this sphere) with respect to which the integration is performed. 

At this stage it will be convenient to obtain certain inequalities for fc(z). 
By (3.33a) and (3.17) 

(3.34) Hz) g (m - 2 )h~ im (z) f” " dQ - (m - 2 )h~ im (z)S m . 

On the other hand, 

(3.35) 77 (z, 7 r) ^ H(z) < oo (for (x) on unit sphere) 

for (z) in D; here H(z) may be taken continuous (independent of (x)). Ac¬ 
cordingly 

(3.36) Hz) (» - 2) p " H~* m {z) <Kl - (m - 2 )H~ im (z)S m > 0. 

By virtue of (3.10a) and (3.33) it is concluded that 

i«(m) 

—Hz)u(z) - / [v(x,z)f(x) - u(x)G(.v(x, z))]dT x , 

Jr, 


10 Uniformly with respect to direction of approach. 





16 


W. J. TRJITZINSKY 


where r p is the domain, containing (z) in the interior, bounded by the surface 
r(x, z) — p 2 = 0 (p(> 0) sufficiently small). Accordingly, u satisfies the 
integral equation 

/ (m) 

K(x, z)u{x) dT x = f 0 (z), 

with 

(3.37a) /,(*) = - f™ "-jg-ptf*) dT x> 

(3.37b) K(x,z) = 'g(v(x,z)). 

fc{z) 

We sum the above developments as follows. 

Theorem 3.1. Every solution u of (1.1) which, together with the D\U is continu¬ 
ous in the domain D, satisfies the integral equation (3.37). Here f p (z) is given by 
(3.37a); the kernel K(x, z) is defined by (3.37b). The domain Y p contains (z) and 
is bounded by the surface T(x } z) — p 2 = 0 [p(> 0), sufficiently small ; (z) in D], 
The function v{x, z) involved in (3.37a) and (3.37b) is of the form 

v(x, z) = r“ 2 ~ m) (x, z) + -L r iCm - 2) (x, z)- - |r 2 = v(z, x). 

The differential operator G is the adjoint of F and is, accordingly, defined by (2.1) 
(i the derivatives are with respect to the xf). Finally, k{z) is the function of (3.33a) 
and satisfies the inequalities (3.34) and (3.36) for (z) in D). 

We define v{x, z) and, hence, K(x, z) as zero for (x) {in D ) exterior T p {i.e. for 
(x) such that T(x, z) ^ p 2 , when p(> 0) possibly depends on (z)). Then K{x , z) 
will be defined for all {x) and, (z) in D. With this in view, the field of integration 
in (3.37), (3.37a) may be replaced by D. 


4. Investigation of the integral equation 

In consequence of the developments of section 2 it is inferred that 

(4.1) —= 2 (2 H k , kl {x) + o k , kl (z, x))(x kl - z kl ), 

tyXje k\ 

d 2 Y 

(-4.1a.) Q^dx^ = 2 #*,* 1 (a0 + °UM x), 

where the functions 

OkM (Z, X), OjU(z, X) 

are continuous in (x) for (x) in r p , while 

o k , kl (z, x), o' klkl (z, x) -*■ 0 (as (*) —> («)). 

Now 


i,k dXidXk 


( Ai, k v ) 


V a & v 

2-4 At.ifc -- — 

i,k dXi dXk 


+ 22 

i,k 


dAi,k dv 


+ v 2 


dXi dx k t* dx ( dx k 


A<„ 


k, 



SINGULAR ELLIPTIC PARTIAL DIFFERENTIAL EQUATIONS 


17 


hence by (2.1) 

(4.2) G(v) = £ AM 


d 2 v 




i,k 


where 

(4.3) 

By (3.6) 


B k (x) 

» dX{ 


C'(x) = £ Ai - k “ ? + C • 

».& t OX{ 


v = v(x, z) = T(r(x, «)), T(r) = ? i(2 “ m) + - -Jr 2 . 


p 2m-4- 


Thus 


dx k dx.k ’ dX{ dX/c 


d * v = r l2) (r) ^ + r (1) (r) 

dx» dx/c dXi dXk 


and, by virtue of (4.2), 

G(v) = r a> (r)fe A a (x)-^ 

( »,/k dX t - 


s+? J *‘>£} 

+ c"(*mn + t <2) (d e A a (x)^ ^. 

i,k OXi OXk 

On making use of (4.2) and of the differential equation (2.8), we accordingly 
obtain 


(4.4) 

where 


G(v ) = r(x, z)T w ( r) + c l (x)T(r) + 4r7 ,<2) (r), 


(4.4a) r(i, z) — £ A<,*(x) 

i.k 


d 2 r 


ax. to . + ? Biw S- G(r) - c ' wr ' 


here 


(4.4b) 


r (1) (r) = 


ra — 2 


_^ __ ■pi(w»—4) __ -p—Jm 

_p2m-4 1 1 J’ 


r (2) (r) 


_ to — 2 1 /: 


1 /to — 4\ 

^ 2~ y 


pi(m—fl) ^ pit—m 

2 


->]. 


From (4.4a) by virtue of (4.1) and (4.12) we deduce 
(4.5) t(x, z) — 2m + ri(x, z), 

n(x, z) = £ A <1 t(x)o,-.»(z, x) + £ B' k (x)(2H k , kl (x) + o* ltl (z, x))(x», - z kl ). 

i,k k,ki 



18 


W. J. TRJITZINSKY 


It is observed that t(x, z) is independent of C(x); moreover, r(x, z) is continuous 
in (a;) for ( x ) in r p . Also 

(4.5a) T\{x y z) 0 (as (x) —► (z)). 

It is possible to obtain a more detailed expression for r x {x , z ), namely 

(4.6) n(x, z) = Z (ri,y(z) + o,(z, x))(x,- - z,); 

i 

here the Tij(z) are polynomials in the 

Hi tk {z ), Ai, k (z), DxAi.kiz), Bi(z ); 

moreover, the dj(z, x) are continuous in (a;) for (a:) in r p , while 

(4.6a) dj(z, x) —> 0 (as (x) —> (z)). 

The precise form of the n,y(z) in (4.6) is 

(4.6b) r u (z) = X?(z) + Z 2H.,*(z) — 

t.fc 02y 

where 

(4.6c) X?(z) = Z AiMNUz) + 2 Z 

i ,v i 

with 

(4.6d) iVj„(z) = Ti. ; - + 2t, + Tj,< + T‘.„ + , 

(4.6e) - - Z Hr.^ ^ + * Z 

r.-y CrZ* j F r OZ% 

By (4.4), (4.5) and (4.4b) 

G(t>) - (2m + T,(», z)> I-*"-® - r-'-l 

+ C"(*)[r“—> + JL^ r“—' - JL] + (m - 2)[?t=J r"-» + mi-*"]. 

Thus 

(4.7) T im G(v) = — 2~) T ^ X ’ ^ Z ^ C'(.x)(X + «*(*» *))r(», z )> 

where 

«i(x, z) = (2(m — 2) + iri(x, z)) I”" -2 , 

(4.7a) 

«*(*> «) = r " -!! ” ^r 2 T 1 ” -1 (n(x, z) from (4.6)-(4.6a)). 

Theorem 4.1. The kernel of the integral equation (3.37) is of the form 

1 



SINGULAR ELLIPTIC PARTIAL DIFFERENTIAL EQUATIONS 


19 


where k(z) is given by (3.33a) (c/. (3.34)-(3.36)) and G(v) is of the form (4.7), 
(4.7a); moreover , C(x) enters in C'(x) ( cf . (4.3)) only . 

In view of the preceding developments and in particular in consequence of 
(4.6b) — (4.6e) it is inferred that, when the A iti are constants, while the Z? t - * 0, 
we have 

n,/W =0 (j = 1, ... m) 

and 

ti(x , z) = 5,(2, x)(x,- - Zj) (cf. (4.6a)); 

1 

a further examination will show that, in this case, n(x, z) is of the order of mag¬ 
nitude of r 2 (= (xi - 21) 2 + • • • + (x w - z m ) 2 ). 

We recall that by (3.15) and (3.18) 

M.8) ^4 (0 S r s W. 

Now, in consequence of (4.7) and (4.7a) 

| r im G(t;) | ^ rUz, So ) (r ^ S 0 ), 

where ki(z , 5 0 ) is independent of (x). Hence from (4.8) it is deduced that 
(4.9) | G{v) | k(z, So)r~ m+l (r £ 5„), 

where 

is independent of (x). 

Choosing for the element of volume, dT x , the expression of (3.7a), in view of 
(4.9) it is concluded that the integral 

/ (m) 

G\v)dT x , 


extended over a neighborhood of (x) = (z), will not necessarily exist. That is, 
the integral 

/ (m) 

K\x, z) dT x 

may be divergent. 

We shall consider now the effect of iterating the equation (3.37). It is in¬ 
ferred that the result of a v-fold iteration is 


(4.10) 
where 

(4.11) 


/•(w) 

w(z) — / K,(x, z)u{x)dT x = ( y = 1, 2, • • •), 

Jd 

/ (m) 

K(x, xi)K,-i(xi, z) dT Zl 

fr-1,2, ; Koix, z) = K(x, z)) 



20 


W. J. TRJITZINSKY 


and 

(4.11a) 


/ (m; 

K,-i(xi,z)f„(xi)dT xl 

(* = 1, 2, • • • ;/„, 0 (z) = /,(*)). 
In view of a known theorem 11 it is concluded that, if K{x, z ) is of the form 

(4.12) K(x, z) = - ---- (r 2 = (xi — zi) 2 + • • • + (ar m — z m ) 2 ; 0 < a < m), 

r a 

where H{x , z) is bounded (integrable), then IC^x, 2 ) is bounded (for (x), ( 2 ) in 
r p ) for v such that v + 1 is the least integer for which 

1 


*+ 1 > 


1 


a 

m 


that is, for v equal to the least integer for which 

a 


v > - 


m — a 


Now, in the actual case under consideration it is observed that, in view of 
(4.9), one has 


(4.13) 


I K(x, z) | g £— | G(v) | g Hz, So)r- m+1 ; 


thus, (4.12) holds with a = m — 1. Accordingly, the function K v (x , z) of (4.11) 
is bounded (and can be shown to be intcgrable in (x) over D) for v = m and for 
(x) in D. The upper bound of | K P {x, z) | will in general depend on ( 2 ). 

The integrals defining the/ PlF ( 2 ) (4.11a) will certainly exist since/(x) is bounded 
in any closed subset of D and is integrable over T p . This fact is a consequence 
of the following considerations. By (3.6) and since 

T(x, z) p 2 


one has 
where 

Furthermore 


v(x, z) - r * (2 n \x, z)v o(x, z), 
| v 0 (x, z) I g 4 


v(x, z) — w(x, z) m \(x, z)r 


-m+2 


and, by virtue of (3.18), 
(4.14) 


| v(x, z) | g h'(z)r 


—m+2 


(for (a;) in r p ) 

((a;) in r„). 
(cf. (3.15)) 

0h'(z ) = 4/* m+2 (z)) 


11 See, for instance, V. Volterra and J. P6r6s, Thiorie genirale die fonctionnelles . . . 
[Paris, 1936; p. 276]. 



SINGULAR ELLIPTIC PARTIAL DIFFERENTIAL EQUATIONS 21 

for ( x ) in r p . On taking 

dT. = r m ~ 1 dr di 2 

and on using (4.14), we obtain for the integrand of (3.37a) the inequality 

Whence the integral (3.37a) exists. In particular, if 

(4.15) |/0*0 I ^ M (in T p ) 

one has 

(4.16) | f„(z) | g ML 2 S m g /*(«) (L = diameter of r p ). 
In view of (3.36), (4.14) we may take 

(4.17) fib) = H im (z)K m+ \z) (cf. (3.35)). 

771 Zt 

Here hl(z) = h{z) — 5 0 X(2), where h{z ) is the minimum of H(z , 7r) for (7r) on the 
unit hypersphere; X(z) (from (3.16d)) may be taken as < 7 X 0 ( 2 ) (suitable positive 
constant <r), with Xo( 2 ) from (3.16c). In this connection, p(> 0) is to be taken 
sufficiently small so that , for (x) in T p , one has r ^ S 0 < h(z)/\(z). 

Let 

[ 2 ] 

denote generically a positive function of (z) uniformly bounded in every closed 
subset of D. On taking account of (4.11a) (with v = 1) and of the fact that, 
in consequence of (4.13) 

| K ( Xl , *) | S [z]rT m+1 

(j"i = distance between (z) and (xi)) we have (on taking dT Xl — r““ l dn dQi) 
| K(xi , z)/ p (x 1 ) dT Xl | g [z][xi] dri dSh , 

it is concluded that |/ Pl (z) | = [z]. In view of (4.11a) (with v = 2) and since 
I -Ki(*i, dT Xl | g [z][a:i]rr mi dT Xl (-rth g -m + 1), 

so that 

I Xx(xj, z)/ p (xi) dT Xl I g [z][x 1 K”’ 1+m ~ 1 dn da , , 

we infer that, inasmuch as—TO i + to — 12s0, necessarily |/ Pl2 (z) | = [«]. 
Thus, on using the fact that 

I K,(x, z) | £ [z]r" m ', 

where 

—to + 1 S —To! ^ —mj ^ , 



W. J. TRJITZIN8KY 


from (4.11a) we deduce, step by step, that 

I/,.»(«) I = [*1 

for v « 1, - • • m. In the above we use the fact that for every ( z) in J5, there 
is a closed subset of D (containing ( z )) so that K,(x } z) is zero for (x) exterior 
this subset. 

Theorem 4.2. Every solution u of the partial differential equation (1.1), 
which together with the Dyu is continuous in the domain D, satisfies the integral 
equation 

(4.18) u(z) - / K m (x , z)u(x) dT x = / p , m (z) (p' > p), 

uj/icre iCm(^, z) and f p , m are defined with the aid of the relations (4.11), (4.11a). 
This equation has the advantage over the original equation (3.37) in the fact that 
the kernel K m (x , z) satisfies the inequality 

| K m (x , z) | g k*(z) < oo ( k*(z ) independent of (x)) 

for (x) in IV , and that K m (x, z) is integrabkj in (x) } over T pf . 

Note. By taking p = p(z) (> 0) suitably small we arrange to have 
p' = p'(z) > p(z), above, so small that the domain I> (i.e. the set of points (x) 
such that r(x, z) g (p') 2 ) lies in D . We have K m (x , z) zero for (x) (in D) ex¬ 
terior I> ; in (4.18) we may then replace the field of integration by D. The oc¬ 
currence of p' takes place in consequence of the following considerations. By 
(4.11) (with v = 1) 

p (m) p (m) 

(4.19) Ki(x, z) = | K(x, Xi)K(xi, z)dT Xl = / . K(x, Xi)K(xi, z) dT Xl 

J d Jr p 

since K(x i, z) = 0 for (xi) exterior T p . Now K(x , x{) = 0 for (x) (in D ) such 
that T(a;, Xi) ^ p(x i) > 0. There exists a number pi = pi(z) > p(z), independ¬ 
ent of (xi) ((xi) in r p ) so that K(x f x x ) = 0 for (x) (in D) exterior r P1 , for all 
(xi) in r p . Hence K x (x, z) = 0 for (x) exterior r Pl ; pi(z) can be made as small 
as desired by suitable choice of p = p(z). By (4.11) (with v = 2) 

/•(m) p(m) 

(4.19a) K t (x,z)= / K(x,xi)Ki(xi,z)dT Xl = / K{x, x x )Ki{xi, z)dT Xl . 

Jd J r Pl 

Repeating the above reasoning, with pi in place of p we find that Kt(x, z) = 0 
for (x) exterior T Pi , where p 2 = p 2 (z) (> pi(z)). Proceeding step by step in 
such a manner, we obtain the stated property of K m (x , z), with p' = p(z). 

In the sequel the prime over p mil be deleted. 

If the D n are domains, in D, such that D n C D n +i and D n —► D (as n —> + oo), 
with the aid of the above we obtain the following result: 

p{m) 

u(z) - / K m (x, z)u(x) dT x = /p,m(2) 

Jd-' 


(4.20) 



SINGULAR ELLIPTIC PARTIAL DIFFERENTIAL EQUATIONS 


23 


for all ( 2 ) in D n ; here n f is some integer > n such that D n > contains every point 
(x) for which r(x, z) g p 2 (z), when ( z ) is any point in D n ; with p{z) suitably 
small such n f certainly exists and , in fajct ) may be taken as n + 1. Clearly 
K m (x , z) = 0 for (z) in D n and (x) in D — D n . 

We have k*(z ) g b n < <*> {fot ( z) in X) n ), where h*(z) is from (4.18a). Thus 

(4.20a) | K m (x , z) \ Sl b n < 00 (<constant b n ) 

for all (x) in D and for all (z) in D n . 


6. Characteristic values and functions 

In place of (1.1) we shall now consider the partial differential equation 

(5.1) F(u) + Xu = / (F from (1.1); continuous / CZ L 2 in Z)), 

where X is a parameter. This equation is obtained by replacing C by C + X. 
Any equation of the form 


F(u) + \q(x)u = / 

is reduced to the form (5.1), provided that, for (x) in D, the functions 


Ai t k<r, Bar , Ca, fa 


^with a — 



satisfy conditions as previously imposed on the functions 


A it k , Bi , C,/, 


respectively. 

The introduction of a parameter, as above, is in agreement with the situation 
in the Schrodinger wave theory. 

On taking account of (3.37b), v(x, z) being of the form (3.6), and of (2.1), re¬ 
placing C by C + X we obtain 


(5.2) K(x, z) = K°(x, z ) + v(x, z), 

where K?{x, z) stands for the function which in section 4 has been designated as 
K(x, z). 

The function K(x, z), defined in (5.2), is the kernel of the integral equation 
(3.37), corresponding to the partial differential equation (5.1). 

Application of (4.11), for v = 1, • • • m, will yield 

m-f 1 

(5.3) K m (x, z) = K° m (x, z) + £ z). 

;~1 

Here ISj,(x, z) is the function designated in section 4 as K m (x f z). The A m ,,(x, z) 
are functions of the same type as K m (x, z) of section 4 and accordingly satisfy in¬ 
equalities of the form (4.20a). 

In view of (337a), (4.11) and (4.11a) it is observed that/ p , m ( 2 ) is a polynomial 



24 


W. J. TRjmiNSKT 


in X of degree m, whose coefficients are uniformly bounded (in absolute value) and 
integrable over 1%. In order to put in evidence dependence on X we shall 
write 

fp,m(z) — fp.mfat X). 

The integral equation corresponding to (5.1) is 

/ (m) r S + 1 1 

u(x)\Kl(x, z) + £ Am, j(x, z)jdT x = fp, m (z, X). 

Before proceeding further we shall note that in consequence of a recent work 
of G. Giraud, 12 in the sequel referred to as (G), the following may be asserted 
with regard to the problem 

(5.5) F(u) + \u = / ((; x ) in D n ), 

(5.5a) ^ + a n (x)u = p n (x) # ((x) on S n ) y 

dv 

where D n is a domain, for which D C D, and S n is the limiting surface of D n ; 
the functions. a n (x) y p n (x) are defined and continuous on S n • It is assumed 
that D n and S n are “ regular” in the sense that S n can be covered by a finite 
number of domains, in each of which one of the Cartesian coordinates of a point 
(variable in S n ) is expressible with the aid of m — 1 other coordinates through a 
function whose partial derivatives of the first order belong to the class Lip. h 
(with h g l). 13 

In (G) the problem (5.5), (5.5a) (and, in fact, a more general problem) is re¬ 
duced to integral equations to which the Fredholm theory applies. The final 
conclusion of (G), needed in the sequel, is that “either the only solution of the 
homogeneous problem is zero, and then the non homogeneous problem is com¬ 
patible for all / (continuous in D n + S n ) and p n {x)) or the homogeneous problem 
has solutions not identically zero, deducible from p (p finite) linearly inde¬ 
pendent solutions, and then the non homogeneous problem is not compatible, 
unless certain p necessary and sufficient conditions are satisfied. ,, The latter 
conditions are supplied by the Fredholm theory. 

We shall take p n (x) = 0. Associated with (5.5), (5.5a) there is a set of charac¬ 
teristic values , 

(5.6) A n ,< (i-1,2, ...), 

the sequence A n ,i, A n , 2 , • • • possessing no finite limiting values. When A ^ A Wt ,, 


18 G. Giraud, Nouvelle methode pour traitor certains problhmes relatifs aux Equations du 
type elliptique [Journal de Math6matiques, Tome Dix-huiti&me (1939), 111-1431. 

18 This is the type of surfaces considered in (G). 



SINGULAR ELLIPTIC PARTIAL DIFFERENTIAL EQUATIONS 


25 


then the non homogeneous problem (5.5), (5.5a) has a solution, u = u n , con¬ 
tinuous in D n + S n . For X = X«, t the homogeneous problem 


(5.7) 


F{u) + \n,iU = 0 
du 


dv 


+ a n (x)u = 0 


(in D n ), 
(on S n ) 


has at least one solution, u = u n%i , distinct from zero, continuous in D n + S n . 
We assume that D is such that there exists a sequence of domains 

D n (n = 1,2, ... ; D n <Z D) } 

whose limiting surfaces are S n (n = 1,2, • • • ), respectively , and which are “regular” 
in the sense specified above , while 

D n (ZD n+ 1, lira Dn = D. H 

n 

Forthwith the D„ and S n will have the meaning just indicated. 

In connection with (5.1) one may consider the “adjoint” non homogeneous 
equation 


G(v) + \v = g 


(continuous g C Li in D). 


With respect to this equation results will hold precisely analogous to those 
stated from (5.1) to (5.4) for the equation (5.1). 

Furthermore, applying the work of Giraud to the problem 


(5.8) 


G(v) + pv = g 

^ + b n (x)v = q„(x) 
dv 


(in D n ), 
(on S n ), 


where b n (x), q n (x) are continuous on S n , we have a situation similar to that 
described in connection with (5.5), (5.5a). In particular, the problem (5.8) 
has a set of characteristic values 


(5.9) 

the sequence p„,i, p n ,t, 
homogeneous problem 


(5.10) 


Pn.j (j ~ 1) 2, • • • ), 

having no finite limiting points, such that the 


G(v) + p n ,jV = 0 

^ + b n (x)v = 0 
dv 


(in D n ), 
(on S n ) 


has a solution, v = , distinct from zero, continuous in D n + S n 


14 Here lim D n * Di D* -f • • • . 



26 


W. J. TBJITZINSKY 


Under suitable general conditions on u and v, in view of the fundamental 
formula (2.7) we have 

(5.11) f [vF(u) - uG(v)]dT = + L n uyJd<S, 

where (cf. 2.7a)) L„ is defined on S„ by 

(5.12) = E v,(n)2?i - E «<») 

• i.fc OXk 

here the t <(n) are direction cosines of normal to the surface S» . In view of the 
previously made suppositions with respect to the B,, -4,-,* and the surface S n , 
the function L n is continuous on S m . 

It is observed that, if v satisfies the second relation (5.10) and u satisfies the 
last equation (5.7), one has 

V 1T ~ u ^~ + LnUV s ttn - a n + L n )uv (on S n ). 

dv dv 

Hence in order to make the above expression vanish on S n we shall take 


(5.13) b n — a n L n . 

Theorem 5.1. 1°. The set of characteristic values {X n ,t} [(5.6)] is identical 

with the set of characteristic values {p n j} [(5.9)]. 16 We write X n ,< = p n .<. 

2°. When F is self adjoint the characteristic values are all real and the character¬ 
istic functions may be taken real . 

3°. There are on hand orthogonality relations : 

Mm) 

(5.14) / u n ,i{x)v n ,j{x) d.T = 0 . (for X„,< ^ X»,,); 

J °n 
Mm) 

(5.14a) / u n ,i(x)u„,j{x) dT -- 0 ( for X„,» X„,/, ic/ien F is sei/ adjoint ). 

To prove (1°) consider a characteristic value X„,» for (5.7); thus, 


(5.15) 


F(Un t i) "f* Xfi ( i Un,i 0 


dv 


4* a n (x')u n< i — 0 


(in £>„), 
(on /S„), 


where ««,»• is a characteristic function. Suppose, if possible, that X«,,- is not a 
characteristic value for (5.10) (under (5.13)). 1# Then every non homogeneous 
problem 

G(v) + X„,iV - g 

dv - , A 
— + b n v = 0 
dv 


1# Nothing is said about the multiplicities of these values. 
16 We take ni < n% < • • • ; »* -* « with s. 


(in D n ), 
(on S n ), 



SINGULAR ELLIPTIC PARTIAL DIFFERENTIAL EQUATIONS 


27 


where A„,< is fixed but g is any function continuous in £>„ + <$>„, will have a 
solution v. Now substitute this, supposedly existent, function v, as well as 
u = u n ,i , in (5.11). One obtains 

(m) /• (m) 

[vF(Un,i)*“ Un,iG(v)]dT = - / Un ti gdT, 

J D n 

which implies that u nti = 0 in D n , contrary to the assertion made subsequent 
to (5.7). Hence X n ,< is a characteristic value for the “associated” homogeneous 
boundary value problem. The converse is proved by a similar method. 

Substitute in (5.11) u = u n ,% f v = v n ,j . In view of the remark preceding 
(5.13) the integrand of the second member in (5.11) is zero on S n ; accordingly 

p(m) f(m) 

(5.16) 0 = / [Vn t jF(u n ,i) - Un,iG(v n ,j)] dT = (Xn,y “ X n ,i) / U n ,iV n ,jdT . 

This implies (5.14). 

In view of (4.2) and (4.3) the case when F is self adjoint, that is when F = G, 
is on hand if and only if 

(5.17) £*(x) = £ (in Z». 

Thus in the self adjoint case it follows by (5.12) that 

Ln = 0. 


Under (5.13), the homogeneous boundary value problems (5.7) and (5.10) 
become identical. In consequence of these considerations and of (5.14) it is 
inferred that (5.14a) holds as stated. 

With F self adjoint suppose, if possible, that there is a non real characteristic 
value X n> *. On taking conjugates of the members in (5.15) we deduce that X„,i 
will be another characteristic value, while u n ,i will be a characteristic function 
corresponding to X n ,». By virtue of (5.16), where we put 


it is concluded that 


X n ,y — X n ,t , — Un,i y 


/*(m) 

(An.,- - Xn.i) \u n ,<\ 2 dT = 0 

"Dm. 


and that, accordingly, X n ,» — X n ,< = 0. This is impossible with X„ f * non real. 
Thus part 2° of the Theorem has been established. 

In the self adjoint case the u n ,i (t =* 1, 2, • • • ) will be always arranged as a set 
orthonormal in D n ; we define 

u n ,i(x) = 0 (in D - (D n + S n )); 


the Un t i(x) will form a set orthonormal in D. 

When F is not self adjoint orthonormalization of the w n .« (i = 1, 2, •. • ) 
will yield a sequence whose members in general will not be characteristic func- 



28 


W. J. TRJITZINSKY 


tion. This apparently involves the main reason for the difficulty of the treat¬ 
ment of equations which are not necessarily self adjoint—at least in certain 
questions relating to such equations. 

The set of numbers 

Xn,< (n, i = 1,2,..-) 

is at most denumerably infinite. Hence there exists a real number X 0 distinct 
from all of the above numbers. One may write 

F(u) + \u = F*(u ) + \*u, G(u) + \u ss G*(u) + X*w, 

where 

F* = F 4* Xo > G* = G -f- Xo, X* = X — Xo. 

This amounts to augmenting the coefficients of u in F and G by X 0 . In view of 
these considerations, augmenting, if necessary, C (and, hence C' = C" + C> 
where C" is independent of C) by a suitable Xo and retaining the original nota¬ 
tion, we may consider that the X n> , (ft, i = 1, 2, • • • ) are all distinct from zero ; 
clearly, this entails no loss of generality. 

In the sequel we confine ourselves to the, case when F is self adjoint and when , 
accordingly , (5.17) holds. 

6. The non homogeneous self adjoint problem 

Let us first consider equation (5.1) for a non real value of X. Then, in view 
of the reality of the characteristic values (cf. 2° of Theorem 5.1) of the problem 
(5.7), it can be asserted that the problem (with continuous / possibly complex 
valued, dL 2 in D ) 



F{u) + \u = f 

(in D n ), 

(6.1) 

^ + a n (x)u — 0 
av 

(on S n ) 

has a continuous solution u„ in D n + S n . The function u n will satisfy 


F(Un) + hUn = / 

(in D n ), 


^ + a„(x)w„ = 0 
av 

(on S n ). 

Accordingly application of (5.11) to 



U = U n , V = tin 


will yield 

i»(m) 

0 = / [ii»F(un) - u n F(Un)\dT 

" D n 


and, hence, 

p(m) 1 r ^ 

J D | Un I 2 dT = __ ^ (Onf - Unf) dT. 




SINGULAR ELLIPTIC PARTIAL DIFFERENTIAL EQUATIONS 


29 


By the Schwarzian inequality 

r (n) 2 r fW -Ur/Hm) -u 

L |u "l ^ s |T^T|[/„. 1“-' dr ] u, !/!*«] 


and, finally, 

/ (m) 4 (m) 

l«»l 2 dr ^ J d \f\ 2 dT 


I X - X) 2 


i*(m) 

/ |/ | s dT = a(A) 

J D 


(n = 1, 2, • • • ; <*(A) independent of n and < + ®°). We define 

(6.3) u„(x) = 0 (in D - (Z)„ + S„)). 

Then 

p(m) 

(6.4) | Un | 2 (IT g a(X) (n - 1, 2, ; cf. (6.2)). 

J D 


This is analogous to a procedure in (C). 

In consequence of a theorem of F. Riesz, inequalities of the form (6.4) imply 
that there exists an infinite subsequence 


«»<(*) 

and a function u(x), for which 


(t = 1,2,--.) 


r (m) 

(6.5) / \u(x)\ 2 dT g a(\) 

J D 

such that 

u ni (x) —> u(x) (in D; as n,- —► <») 

in the sense. 

Now, inasmuch as u ni (x ) is a solution of (5.1), in D n . , w nt . satisfies the integral 
equation (5.4); thus with r > i 


p (m) [“ m+1 “ 

( 6 . 6 ) Un,(z) = / Mn r (x) £l(l, z) + 22 z) dT* + /p,m(z, X) 

L —1 J 


for (z) in D ni (p(z), >0, being suitably chosen so that the kernel vanishes for 
(*) in D — D n . +l , when (z) is in £>„.). 

We recall now a theorem of F. Riesz which, for a single variable, asserts 
that if 

f,(x) C U (on (a, b); v = 1, 2, • • • ), f,(x) ->f(x) (weakly) 


and if 


g(x) C Li 


(on (a, b)) 


lim 


f 

"a 


f,(x)g(x) dx = 


f 

J a 


f(x)g{x) dx. 


then 



30 


W. J. TRJITZINSKY 


An obvious extension of this theorem to the m-space of the point (x) enables 
us to assert that 

/ (m) * (m) 

u nr (x)K m (x, z) dT x = / u(x)K m (x, z) dT x 

J N+i 

(for (z) in £>„<), if 

K n (x, z) C L % (in (x), over D). 

Now the latter property certainly holds by the preceding. 

Thus, given any (z) in D, we choose i so that D n( contains (z); from (6.6) 
and (6.7) we then infer that 

(6.8) lim w„ r (z) = u'{z) 

exists in the ordinary sense for (z) in D; clearly 

(6.9) u'(z) = u(z) (in D) 

and 

i*(m) 

u(z) — / u(x)Km(x , 2 ) dT x = /p,m(z, X). 


In (6.9) w(z) is the function for which inequality (6.5) has been asserted. 

We have the following theorem. 

Theorem 6.1. Consider the 'partial differential equation (5.1), with f(x) con¬ 
tinuous and CL 2 in D; let F be self adjoint. 

For every non real value of the parameter X (5.1) possesses a solution u(x) such 
that 



| u(x) 1 2 dT ^ a(X) . 


(a(X) from (6.2)). 


This theorem is analogous to a result established in (C) for the case when 
(1.3) holds. 

We now wish to investigate the more complicated situation when X is allowed 
to be real. In this there will be a certain analogy with an earlier work of 
Trjitzinsky. 4 

For X distinct from the X n ,» (i == 1, 2, • • • ) the problem (6.1) has a solution 
u n (x). On substituting u = u n , v = u n ,i in (5.11) and noting (5.15) and the 
fact that L n = 0 we obtain 


and 


[Km) 

/ [u n ,iF(Un) — u„F(u„,i)]dT = 0 

J Dn 

[ (m) J [ (m) 

/ u„(x)u„,i(x) dT = --r— / f(x)u„,i(x) dT. 

JD n A An.t Jd h 


Thus, by (6.3) 


/•(*») 1 fim) 

(6.10) / u„(x)u„,i(x) dT = --— / f(x)u n ,i(x) dT 

J D A — An,» J D 



SINGULAR ELLIPTIC PARTIAL DIFFERENTIAL EQUATIONS 


31 


It is noted that (6.10) constitutes a relation between the Fourier coefficients, 
with respect to the u n ,i (i = 1, 2, • • • ), of the functions u n (x) and /(x). 

Now an orthonormal set {w n ,<} (i = 1, 2, • • • ) is necessarily complete . This 
can be established by an extension of the familiar methods, involved in known 
analogous, though much simpler, boundary value problems. 

On writing 


(6.11) 
we have 
(6.11a) 

In view of (6.10) 

(6.12) S\ = Z |« n 'i 2 = Z 


Mm) 

u n ' x = / u n (y)u n ,i(y ) dT, 

J D 

r (m) 

/ \ Un (x)\ 2 dT = £ K’T. 

Jd i 


r 




/•(m) 

r‘= / f(x)uMdT. 

J D 


Definition 6.1. We designate by T a set of points X, in the complex \-plane , 
such that 


(6.13) 


z 

t 



g B(X) < + oo 


(n = ni, n 2 , • • • ;/ n,t from (6.12) 16 


for all X in T, the function B(\) being independent of n. 

Let us form the complement, with respect to the X-plane, of the set of points 


(6.14) 


Xn,i 


(n = Wi, n 2 , • • • ; i = 1, 2, • • •)• 


IFe define 0 as the set of interior points of this complement Clearly 0 contains 
all the points not on the axis of reals . 0 will contain also some points on the axis 

of reals if the complement with respect to the axis, of the set of points (6.14) contains 
interior {with respect to the axis) points—the latter points mil belong to O. 17 With 
8 positive, let 0« be the subset of 0 consisting of points at the distance ^ 8 from 
the frontier of 0; then O a will be a closed two dimensional set; moreover, 

(6.15) | X - X n ,» (n » ni, nt, • - • ; t = 1, 2, • • • ) 


whenever X is in 0 a . In view of (6.15) we have 


r 


SaZir‘1 


(in Os) 


and, in consequence of the second relation (6.12) as well as of Bessel’s inequality, 


z 

i 






1 pKm) 


(in 0 t ). 


17 It is conceivable to have the set (6.14) everywhere dense on the axis of reals, in which 
case, by definition, 0 will consist of the X-plane minus the axis of reals. However, the set 
T of Definition 6.1 may have points on the axis of reals, even under these circumstances, 
provided the functions/ are suitably chosen. 



32 


W. J. TRJITZINSKY 


The set 0, defined in connection with (6.14), is a set T of Definition 6.1 with 

B(X) = i / d \f(x)\ 2 dT 

for X in 0; here 5 is the distance from the point representing X to the frontier of O . 

An important feature of sets T of the special form 0 lies in the fact that they 
are independent of the choice of /—in so far as / C L 2 (in D ); this is not so for 
the general sets T. In any case, of course, T includes all the points not on the 
axis of reals. 

By (6.12) and (6.13) 

S\ ^ B(\) (n = wi, rut, • • • ; in T) 

and, in view of (6.11a), 

Mm) 

(6.16) / | u n (x) \ 2 dT g B(\) < + 00 (n = «i, n*, • • •) 

for X in T. The second member, here, being independent of n, one may assert 
that the sequence (ni, ns, • • contains a subsequence (k \, fc 2 , • • • ) so that 

Ukj(x) —> u(x) (as kj —+ oo ; in Z>), 

convergence being in the weak sense to a function u(x ) for which 

rim) 

/ \u(x)\*dT £ B(\). 

Jd 

By a reasoning employed before and on making use of the integral equation it is 
concluded that u(x) is a solution of the non homogeneous problem (5.1) for 
X in T. 

We have the following Existence Theorem. 

Theorem 6.2. Let F he self adjoint (in D ). The X n ,» (n = n \, n *, • - • ; 
i = 1, 2, • • • ) are the characteristic numbers of a sequence of approximating 
boundary value problems; together with the characteristic functions u n ,i the X n ,» 
satisfy (5.15). In terms of the u n ,%(x) (u n i ,, u n , 2 , • • • orthonormalized in D ) we 
define numbers f n,t by the second relations (6.12). Let T be a set for which (6.13) 
of Definition 6.1 holds. 

Then the non homogeneous problem 

F(u) + \u = / 

will have a solution u, in D y CZLz in D for every X in T. 

Definition 6.2. It will be said that F is, of class 1 on a set 0, if every solution , 
CL* in D, of the homogeneous problem 

F(u) + Xw = 0 

is zero in D—this being so for every X in 0. In the contrary case F will be said 
to be of class II. 

Designation of classes I, II, here, is suggested by an analogous usage in 
Carleman's theory of integral equations. 



SINGULAR ELLIPTIC PARTIAL DIFFERENTIAL EQUATIONS 


33 


It is to be recalled that corresponding to the equation (5.1) we have an iterated 
integral equation (5.4); the latter may be written in the form 

/• (m) m-f-l 

(6.17) u(z) — / u(x) 22 Am,z) dT x = f f , m (z, x ) C4 m , 0 — K° m ). 

J D t—0 

On taking account of the text in connection with (4.20) we note that in (6.17) 
the field of integration may be taken as Z> n +i if (z) is in D n . The expression 

™+i r r (m) -|4 

(6.18) a(z \D)= Z [ AUm, z) dT. J 

defines a function for all ( z) in D. In fact, let ( z ) be a fixed point in D. Then 
for some n the domain D n contains ( z ); for all ( Z \) in D — Z5 n +i we have 
z ) = 0. Hence 


r (wi) /• (m) 

/ AlM , 2) , Z) . 

Jd „ +1 


Here 


I ^m,t(zi , z) | ^ 7n < 00 . (all ( z) in D n ; all (zi) in D) 

in view of the statement with respect to (4.20a). Thus the integral above exists. 

7. Spectral theory 

The “spectrum” of F, with respect to the boundary value problem 

F(u) + \u = 0 (in D n ; parameter X), 

(7.D du 


dv 


+ o„(x)w = 0 


we define to be the function 6 n (x, y \ X) for which 
(7.2) 


6n(x, y I X) = 22 u„,i(x)u n ,i(y) 

0 


6 n (x, y | X) = — 22 Un,i(x)u n ,i(y ) 


(on Sn), 

(for X > 0), 
(for X < 0), 


while Or, V 1 0) =0; this function is defined for (x), (y) in D„ + <S„ and for all 
real X. Clearly 0 n (x, y \ X) = 8„(y, x | X). Such definition is possible inasmuch 
as it had been previously arranged to have the X„ f < all distinct from zero and 
since the X„,< are all real. 

In consequence of the definition of u B .< we have 

e„(x, y | X) = 0 

for ( x ) exterior D n + S n , as well as for (y) exterior D„ + S n . 

From the definition of 9 n it follows that 

Axjfln = e n (x, y | X») - dn(x, y\\i) = 2 U».<0 t)Un,i(.y) 

when X» > Xi. Thus, on taking 

Xo < Xi < • • • <X 4 , 



34 


W. J. TRJITZINSKY 


we obtain 

(7.3) 6 n = I Ax'-,0n(z, y\\)\ g Z I U„,i(x)u n ,i(y) I 

r«—l 

for (x) and (y) in D. 

In view of the first relation (5.15) and of the previously established connection 
with integral equations we have 

m-f 1 i* (m) 

(7.4) Un,i(z) = 23 K,i / Un,i(x)A n ,j{x y z) dT s [A m , 0 = K°m] 

j-0 JD n 

for (z) in jD n _i (if p = p(z), >0, is suitably chosen). The functions in the kernel 
of (7.4) have the general properties indicated subsequent to (5.3). 

In consequence of (7.3) and (7.4) one has 

i i* (w») i 


wti /» 

5« ^ ^».* / 'U'n,i{&i)A m ,ji (#1 j x) (ITt] 

Jl—0 


m+i /*(m) 

* ^ X n 2 ,i / 'M'n,i(.Z2)A m ,j i (Z2jy')dTg 2 

Ja— 0 

for (x) and ($/) in Z) n _i. Here (as well as in (7.4)) D n may be replaced by D . 
On letting X* denote the greater of the numbers 1, | Xo |, | X a | from (7.5) it is 
inferred that 

f I r ^^ 

5n ^ (X*) 2m+2 Z Z x)u n .<(zi) dT tl 

J 1,32 { i I 

I p(m) 

• / A m ,j } (z 2 , y)u n ,i(z 2 )dT, a ((x), ( y ) in Dn-i). 

I J D 

Thus 

dn ^ (x *) 2m+2 z {[z | £ m> A Wl («i, x) u „, i (z 1 ) dr., |*J 

■ j J * (^2 j y)un,i(zt) dT t j | I* 

for (x), (y) in J5 n _i. On making use of Bessel's inequality we obtain 
Sn g (x *) 2m+2 Z T f m) Al h (z 1 ,x)dT. 1 l 


r /•<«») . -i) 

• J D AU(z it y)dT. t j = (X*) 2m+2 a(s, y) 


(0*0, (y) in -Cn-l), with 

(7.7a) a(x, j/) = aix | D)a(y | D), 

where a(x | D) is defined by (6.18). 



SINGULAR ELLIPTIC PARTIAL DIFFERENTIAL EQUATIONS 


35 


It is essential to note that the last member in (7.7) is independent of the 
mode of subdivision of the interval (Xo, X g ) and is independent of n. 

By (7.7) and in view of the definition of $ n , given in (7.3), one has 

(7.8) Vi0 n (£, y\\) < (\*) tm+2 a(x, y) 

for (a;), (y) in D n -i . Here Vi denotes variation, with respect to X, on the interval 
(a, /S); X* is the greatest of the numbers | a |, | 0 |, 1. Also, since 9 n (x, y | 0) = 0 
we have 

| e n (x, y | X) | - | AX | 

so that, in view of (7.7), 

(7.9) | 6 n {x, y | X) | g (X') 2m+2 a(z, y) 

((x), ( y ) in jD n — 1 ) where X' is the greater of the numbers 1, | X |. 

For any # integer r ^ n, corresponding to (7.4), we have 

m-t-l /• (m) 

(7.10) u T ,i(z) = xj, ,i / M r ,,(x)A m ,,(x, z) dT x 

7-0 JD n 

for ( 2 ) in D n _ 1 . This is a consequence of the fact that D r (r ^ n) contains D n 
and that for (x) in D — D n 

(7.11) z) = 0 (for ( 2 ) in £> n - 1 ). 

By a reasoning of the type employed subsequent to (7.4) for the purpose of 
derivation of (7.8) and (7.9) we now proceed from (7.10), (7.11) obtaining the 
inequalities 

(7.12) VUr(x, y I X) g (X*) 2m+2 a(x, y), 

(7.13) | 6 r {x, y | X) | g (X') 2m+2 a(x, y) 

for (x), (y) in jD n _i and for r = n + 1, • • * . 

If the 6 r (r ^ n) had a property in the nature of equicontinuity in (x), ( y) } 
for (x), (y) in D n - 1 , then the “Compactness” theorem of Carleman 2 would be 
applicable. Under the existing conditions, however, a modification of the de¬ 
velopments in (Ci ; pp. 21-24), together with (7.12), (7.13), enables assertion 
of the following. 

Given n(> 0), there exists a subsequence 

(7.14) 0r(n,l) 7 0r(n, 2) , * • * 
such that the limit 

lim Or(n,i) = 0(x, y | X) 
i 

exists in /5 n _i , for real X; moreover, 

(7.15) V{e(x, y I X) g (p*) 2m+! a(x, y), 

(7.15a) | d(x, y | X) | ^ ( P '? m+2 a{x, y) 



36 


W. J. TRJITZINSKY 


for (x) and (y) in A~i, for all real X, except perhaps for a denumerable infinity 
of values X. Offhand, 0 may depend on n; whenever (x) or (y) is in D — A-i, 
we define this function 0 as zero . 

We select a subsequence (7.14) for n = 2, obtaining a limiting function, 
i0(x, y | r) for (x), (y) in A . From this subsequence we choose another sub¬ 
sequence 

(7.16) Br ltl f 0r tfi , 0r M , • * • 

converging, for ( x ), (y) in A, to a limiting function 20 , which is necessarily 
identical with i0 for (x), (y) in A . 

We replace the sequence (7.16) by the sequence 

(7.17) 0 r ( 2,1) y 0r Zti y 0r 8|S , 0r if4 y • • • 

which has the same limit as the sequence (7.16). From (7.17) we select a 
sequence 

0r(2,l) y 0r i>2 y 0r 4f8 y 0r 4|4 , 0r 4># > * * * ' 

converging to 3 0 in A ; clearly 

f 20 (for (*), (y) in A), 

30 = 

[ i0 (for ( x ), 0/) in A). 

Continuing this process of consecutive selections one obtains a sequence 

0r(2,1) y 0r 8> 2 ) 0r 4f8 > 0r 6>4 > * * * 

converging to a limiting function 

d(Xy y I X) 

for (x), (y) in D, for all real X, except perhaps for a- denumerable infinity of 
values X; 6 is independent of n and satisfies the conditions stated in connection 
with (7.15), (7.15a), this being so for all (x), ( y ) in D. 

Definition 7.1. A function 0(x, y | X) obtained as described above will be 
termed a spectrum of F , associated with the sequence of boundary value problems (7.1). 

Theorem 7.1. Associated with the sequence of boundary value problems (7.1) 
there exists at least one spectrum 0, satisfying the conditions stated in connection 
with (7.15), (7.15a) (c/. (7.7a)); we also note the statement preceding Definition 7.1. 

In the remainder of this section we shall state a number of properties of 
0(x, y | X). These properties are analogous to those found, for functions desig¬ 
nated as 0, by Carleman in Chapter I of (Ci). These results may be proved 
with the aid of the theorems referred to in (Ci ; 7-24), as well as of Theorem 7.1. 
We shall omit the details of proof. 

According to the preceding, there exists a sequence 

0m!, 0m a , • • • (nil < m* < • • • ) 

such that 

(7.19) lim 0 m , = 0 ((x), (y) in D; as m, —► »). 

With (7.19) in view we shall write n = m, . The following is asserted. 



SINGULAR ELLIPTIC PARTIAL DIFFERENTIAL EQUATIONS 


37 


For almost all ( x ) in D 


(7.20) 


p(m) 

fix, X) = / e(x, y I \)h(y) dT y = lim f n (x, X), 

J D 

/•(m) 

fnix, X) = / 0 n (x, y I \)h(y) dT v , 

J D 


whenever h(y) d L 2 in D. If, in addition, g CL L 2 in D, then 

i*(m) /.(m) 

/ y(xty(x, X) dT x = lim / y(x)^„(x, X) dT* 

* D Jd 

1* (m) (m) 

= lim / / 0*(x, j/1 \)g(x)h(y)dT x dT v -, 

Jd Jd 


the order of integration in 


(7.21) 


J p (m) /• (m) 

/ 0(x, y | \)g(x)h(y) dT x dT y (g,h<Z U in Z)) 

D J D 


is immaterial. 

On using (7.17) (for 0 n ) it is inferred that 


o * r r (m) “i* 

Vif n (x, X), Vi fix, \)£ao | h(y) \ 2 dT v a(x \ D), 

_J D J 


where a 0 = a 0 (a, 0) is finite for a, 0 finite. Moreover, 


(7.22) 

where 


fnix, X) |, | f(x, X) | aia 2 


l h{y) 1 2 dT y 


(7.23) ax = (X') 2m+2 [X' = max. (1, | X |]. 

We have, whenever g, h d L t in D, 

J p(m) pirn) 

/ e n ix,y\\)g(x)h( y )dT x dT y 

d Jd 

and 

1 p tm) p (m) 

/ / 6 n (x,y\\)g(x)h(y)dT x dT y 

I Jd J d 

r pim) “lir pirn) -|i 

| g(x) pdTxJ [/ D |%)| 2 dr,J; 

these inequalities will hold for 6(x, y | X) as well. 

When «(X) is continuous for Xo ^ X ^ Xj one has 

J |»Xi /*Xi 

«(X) d\6(x, y | X) = lim I co(X) d\6 K (x, y | X), 

X 0 J \0 



38 


W. J. TRJITZINSKY 


p*l p{m) p\\ pint) 

/ u(\)d\ I h(y)e(x, y | X) dT v = lim / «(X)dx / h(y)6 n (x,y\\)dT v , 

J\o Jd J\ q Jd 

p*l pint) Mm) 

/ «(X)dx / / #(*, y \\)h(x)g(y) dT x dT v 

J\q •'jD 

/»Xi i.(m) j»(m) 

= lim / «(X)dx / / 0n(z, y | \)h(x)g(y) dT x dT v 

J\o Jd Jd 


(7.25) 


for (x), (y) in D, whenever h, g CZ Li in D. 
Also 


/ (»*) r r Xi *i r X! r Mm) “i 

J <*>Lix w ^ x ) <***(*> ^ I x ) J = J x <o( x ) dx ^J D 0(a:, y|X)X(j/)dr„J, 

/ (m) Mm) r i»Xi “I 

g(x)h(y) I w(x) d x e(x, y | X) J dT x dT y 

/•x 1 r i»(m) /•(»») r ”1 

~ w(X) j^dx 1^ 6(x,y\\)g(x)h(y)dT x dT v \, 

/ int) ( fXj r f(m) “T\ 

ff(x) | to(X) dx Jn 0(x, y | X)fc(y) dT v J j dT 1 * 

/ Xi r pim) p (m) ”1 

w(X) dx ^ 1^ 1^ e ( x > V I \)g(x)h{y) dT x dT y J. 


The integral 


Z oo - (m) /• Cm) 

dx 6{x,y\\)Kx)h{y)dT x dT v 

oo J D J D 

converges for all h C L 2 (in D). There exists a function ^(z), <ZL 2 (in D), 
such that 

\h(x) = ^ rf x £ J 0(x, y | \)h(y) dT v J -> ^(a:) 

(as Z —* + oo ), convergence being in the mean square for (x) in D. 


8. Developments on the basis of spectra 

For a fixed n = m„ consider the set of functions u n ,i(i = 1,2, • • • ). Recalling 
the character of this sequence it is observed that the ParcevaPs equality for 
the u n ,i , expressed with the aid of a Stieltjes integral, is of the form 

/ «o pim) p (m) p (m) 

dx / e n (x,y\\)Ux)h(y)dT x dT v £ I \h(y)\ 2 dT v . 

oo Jf) vJ X) 

Here A is any function, Cis in Z), and 0„ is the function introduced in (7.2). 
With 0 < l < », we have 

pi pint) pint) pint) 

(8.1) B n ,,= dx O n {x, y [ \)Ti(x)h(y)dT x dT v S \h{y) \*dT y . 

J-i Jd Jd Jd 



SINGULAR ELLIPTIC PARTIAL DIFFERENTIAL EQUATIONS 


39 


This may be asserted in view of the fact that 


pP pim) pim) 

d\ / On(x, y\\)Ji(x)h(.y)dT x dTy ^ 0 

J a J D J D 


for all real a, In consequence of (7.25) 


/ l pim) rim) 

d\ / / 6(x, y | \)7i{x)h{y) dT x dT y . 

I J D J D 


Hence by (8.1) 


/ l pim) pirn) p{m) 

d\ / / 0(x, y | \)Kx)h(y) dT x dT v g / | h(y) \ i dT„. 

I J D J D Jd 

Thus, on letting Z —> we obtain (note convergence of the integral (7.26)) 


p °° Mm; Mm; Mm) 

(8.2) dj / flOp,»|X)S(af)A(y)dr.dr,S / |X(y)Nr # , 

JL. oo J/) J/j 

which is the generalized Bessel’s inequality related to our problem; this inequality 
is associated with a certain sequence of homogeneous boundary value problems. 
It will be convenient to introduce the following Definition. 

Definition 8.1. Suppose F is self adjoint. It will be said that F is closed 
with respect to a spectrum 0 if 

p 00 pim) pim) pim) 

(8.3) dj / e{x,y\\)l{x)h{y)4T x dT y = / \h{y)\ 2 dT v 

J- oo Jd Jd Jd 

for every h C L 2 {in D). 

It is observed that (8.3) is a particular instance of (8.2) and constitutes a 
generalized Parceval’s relation . 

Suppose now F is closed with respect to a spectrum 0. Let h be real valued. 

Replacing h{x) in (8.3) by a{x) + b(x), where a(x), b(x) d Li (in D ), in conse¬ 

quence of the closed character of F we obtain 


Z oo - p (m) p (m) 

dx / / 6{x, y | \)a(x)b(y) dT x dT v 

oo Jd Jd 


Z °° pim) pim) Mm; 

dx / e(x, y\\)b(x)a(y)dT x dT u = 2 / a{y)b{y)dT v . 

oo Jd Jd Jd 

Interchanging (x) and (y) in the second term of the first member above, making 
use of the symmetry of 0 and taking note of the permissibility of the interchange 
of order of integration in the integral (7.21), it is concluded that this term is 
equal to the first integral displayed in (8.4). Accordingly 

pim) pim) pim) 

dx / 0(z, y | \)a(x)b{y) dT x dT y = / a(y)b(y) dT y . 

oo Jd Jd Jd 

Let C be a “cube” in the m-space, defined by inequalities 
(8.6) Cj ^ Xj g Cj + L (j = 1, ■ • • m; L > 0). 



40 


W. J. TRJITZIN8KY 


We choose the point (c) = (ci , • ■ • c m ) and L sufficiently great so that 
(8.6a) DCZC, 

supposing that D is bounded. We define 6, for (x) and (y) in C, by the relations 
(g ^ 0(x, V I X) = 0 ((*) in C - D; (y) in C), 

0(x, y | X) =0 (( y ) in C - D; (x) in C). 

One then may express (8.5) in the form 

/ °° Cm) i*(m) i»(m) 

d\ / 6(x, y\\)a(x)b(y)dT x dT v = / a(y)b(y)dT v , 

«o *C Jc «C 

where a(x), b(y) are any functions CZL 2 in C, while 

(8.8a) a(x) = 0 (in C - D). 

Substitute, in particular, 

b(y ) = b t (y) = b(t j , ■■■ t m ;yi, ••• y m ), 

t 

where the function 6 ( (y) is defined, for (<) and (y) in C, by the relations 


(8.9) 


b t (y) = 1 
h<(j/) = 0 


From (8.8) we then obtain 


( 8 . 10 ) 


- 00 « 

j d\a(t; X) = r 

J—ao Jy 


1 1 

Vl— C 1 


(c, ^ ^ tj \j - 1, ... m), 

(elsewhere). 18 


[ a(y) dT v 


where, in view of (8.8a), 


• f(m) r /»«i ^ 

(8.10a) «r(<; X) = / a(x)< ■■■ 9{x, y | X) dT v \dT x 

J Dl ^Jyi—Cj Jy m -C m ) 


With the aid of (8.10) the following result is established. 

Theorem 8.1. Suppose 6 is a spectrum , with respect to which F is closed , and 
that D is bounded . T/iere is then on hand the following generalized Fourier ex¬ 
pansion 


( 8 . 11 ) 


a(t) = 


d m 

dt\ • • • dtm 



d*(t;\) 


for almost all (t) in D; this is valid for all a C L 2 in D; here 8(t; X) is defined by 
(8.10a), with 0 subject to (8.7) {defined in (8.6), (8.6a)). 

An important application of spectra is, as will be now shown, in establishing 
explicit formulas for solutions of the non homogeneous problem (5.1). 

We have 


(8.12) Unix) = 2 W n,< Wn,i(3). 

i 


18 That is, for points (y) such that at least some y/ exceeds tj ((y), (0 in C). 



SINGULAR ELLIPTIC PARTIAL DIFFERENTIAL EQUATIONS 


41 


Ordinary convergence, here, instead of convergence in the mean square takes 
place by reasoning of the same type as is used to prove the analogous fact in 
certain known simpler classical cases. As has been shown previously u n (x) —* 
u(x) (as n — kj —► °o) in the_weak sense as well as in the ordinary sense. Cer¬ 
tainly 


(8.13) 



dT x 


as kj —> oo, for (x) in C. 19 Here and in the sequel 

(8.14) / a(x)dT, = / ••• / a(h, • ■ ■ t m ) dh ■ • • dt m 

(( x ) in C; a(x) integrable over C). Moreover, the functions u n (x) are taken 
equal to zero in C — D. 

In view of (6.10) and of (6.11), from (8.12) it is deduced that 


(8.15) 


1 1 p (m) 

Unix) = - 7~f''u n ,i(x) = - 7 ~ u n ,i(x) / /(y)u„,<(y) dT, 

i A An,t t A An,i Jd 


We write 
(8.16) 

(8.16a) 


p*> i Mm) 

= / r- d, / f(y)e n (x, y | p) dT y . 

J— CO A — p J D 

Unix) = T„,i(x) + r„,iix), 

Z l 1 p (m) 

^- dp / fiy)dnix, y | p) dT y , 

l A ~ p J D 


(8.16b) r„,i(x) = {h + L* )\^ r p dp Id ^ 6n( ' X ’ V I dTy ' 

On making use of the symbol (8.14) and employing the extended definition (8.7) 
of 6 n one may express (8.16)-(8.16b) in the form 

(8.17) / u„(x)dT* = / r„,i(x) dT x + / r n ,iix)dT x , 

J{c) J(c) j (c) 

p{x) pi 1 pix) p (m) 

(8.17a) / r n>l (x) dT x = / r-^—dp / /(y)*„(*, y I p) dT, dT„ 

.(c) J_I A — P J(c) Jc 

/ (*) / 1 /•<*> r< m) 

r„,j(x) dT, = M j d p J c fiy)Bnix, y | p) dT,dT v . 

From (8.17a), (8.17b) we finally obtain 

«(r) r l 1 p (m) p (m) 

(8.18) / r„,i(x) dT, = / r-^—dp / 6,(f)/(y)0„«, y | P )dT t dT y , 

J(c) JLj A — p Jc Jc 



42 


W. 3 . TRJITZINSKT 


/•(») 

/ r n ,i(x) dT x 
J (fl) 

(8.19) *-i\ -(m) i*(m) 

- (J z + L ) Jc L h ^y )e ^ y i dr * 

for (x) in C; here b x (t) is from (8.9). 

Let T (Definition 6.1) be of the form 0 (cf. text subsequent to (6.14)). For X 
in T one has 


(8.20) | X - \ n ,i | 2 * = »(A) >0 (n = fa , k 2 , .. • ; i - 1, 2, ... ), 

where 5(X) is independent of n and t. Let (7(5) be the set of points whose dis¬ 
tance from the interval ( —Z, +Z) is <5. 

Case A. The point represented by X is on the frontier of (7(5) or is ex¬ 
terior (7(5). 

Case B. The point X is in (7(5). 

In Case A we have 

(8.21) | X - p | ^ 5 > 0 (for all p on (-Z, Z)) 

and, consequently, 

(8.21a) , 

x - P 

as a function of p, is continuous on the closed interval ( — 1,1). In view of (7.25) 
from (8.18) it can be therefore deduced that 

. (8.22) lim Tk it i(x) dT x = / d p / / b x (t)f(y)d(t y y | p) dT t dT v . 

kj J(c) J—l A — p Jc Jc 

In Case B we take l sufficiently great so that the circle S(\, 5), whose center 
is at X and whose radius is 5, intersects the interval in two points, represented by 
numbers a , 0, where 

-l < a < 0 < 1 

The discussion, below, will still hold without any essential modifications when Z 
is left unchanged. In view of (8.20) there are no points X n ,» (n — k \, kt , • • • ; 
i = 1, 2, • • • ) in the open interval ( a , 0). Whence 

/ r- 1 - d > / Ut)f(y)6n(t, y I p) dr.dr, - 0 

J« A — p Jc Jc 

(n = , kt , • • • ) and, by (8.18), 



f(m) <*(m) 

dp / / b x (t)f(y)0n(t 9 y | p) dTtdT 9 . 

Jc J<7 


(8.23) 



SINGULAR ELLIPTIC PARTIAL DIFFERENTIAL EQUATIONS 


43 


In the field of integration indicated in (8.23) one has (8.21) and, accordingly, 
the function (8.21a) is continuous, in p, for p on the closed intervals 

( —Z, a), (fi f l ). 

We apply (7.25) to each of the integrals 



letting n = Ay —» oo. From (8.23) it is thus inferred that 


(8.24) 


lim 



T kj ,i(x)dT x 




p 




b x (t)f(y)6(t, y | p) dT t dT x . 


On the other hand, inasmuch as 



bx(t)f(y)6 n (t , y\p)dT t dT x = y n (x) 


(n = fci, fc 2 , - • •), 


where y n (x) is independent of p for a < p < 0, and in so far as the limit 

r (m) p(m) 


(8.25) 


p \.*n) p\m) 

lim / b z (t)f(y)O n (t, y \ p) dT t dT y 

ki «C J C 


p (m) p (m) 

= / / b x (t)f(y)0(t, y\p)dTtdT y = y(x) 

J c Jc 


exists (in view of the developments of section 7), it is concluded that y(x) of 

(8.25) is independent of p for a < p < 0. The implication of the latter fact is 
the relation 

pP -I p(m) pym) 

(8.26) ——d, b x my)e(t,y\p)dT t dT v = 0. 

Ja A — p Jc *C 

By virtue of (8.26) one may put (8.24) in the form of (8.22); that is, the equality 
(8.22) holds in any case (for every X in T). 

In order to study the function (8.19) we take l so great that 

— 1<R\<1 (R\ = real part of X). 

It is then deduced that 


(8.27) | X - p | ^ l - R\ > 0, 

for p ^ l, and that 
(8.27a) 


\ \ - p \ ^1 + R\> 0 



44 


W. J. TRJITZINSKY 


for p £ —l. In view of (8.27) we find that 

!•(») /»(m) 


I f* -I pkm) p(tn) 

l r=~p d 'lc l b ^y )6k ^y\p) dT ‘ dT »\ 

1 p Cm) p (m) 

- r^Rx v? Jo Jc y i p) dTtdT » 

(j = 1, 2, • • •). Now by virtue of (8.9) and (7.24) 

vr ■ ■ • s A = L* ra [£ m> 1/(2/) I 2 dr„j, 

where L is from (8.6); A is independent of kj . Thus 

I roo -j pint) p (m) 

/ \ / / b z (i)f(y)0kj(t f y | p) dTtdT y 

x _, J* X - p Jc Jc 

«X)-‘ 0 = 1,2,...; (x) in C). 
Similarly, on making use of (8.27a) it is concluded that 

I r~ l a 

(8.28a) / integrand of the first member in (8.28) ^ 

I J— oo i "T” jflA 

in C. Thus, by (8.28), (8.28a) and (8.19) 

(8.29) |£’r...W^|^[f^ + rF«x] (! “ C) 


for n = k \, & 2 , * • • and for X in T . For any X, fixed as specified, the second 
member in (8.29) can be made arbitrarily small by suitable choice of l ; this can 
be done uniformly with respect to n = kj (j = 1, 2, - • •). Using this fact, as 
well as the relation (8.22) (valid in any case), in view of (8.17) it is deduced 
that 

p(x) -00 -t /.(m) p(m) 

(8.30) lim / u ki (x)dT x = d, I / b z (t)f(y)d(t, y\ p) dT t dT v . 

kj J(c) J— oo X p Jc Jc 

On taking account of (8.30), (8.9) and (8.13) it is inferred that 

(8.31) u(x)dT x = / f(y)e(x,y\ P )dT x dT v . 

J(c) J-oo X — p j(c) Jc 

Finally 

(8.32) u(x) = . * / d f f(y)6(x, y\p)dT x dT y 

for (x) in D. 

In view of the above it is possible to formulate the following Theorem. 
Theorem 8.2. Suppose D is bounded . Every solution u(x), for X in 0, re¬ 
ferred to in Theorem 6.2 and satisfying the non homogeneous problem (5.1), is ex¬ 
pressible in the form (8.32) ( cf . (8.14)). 



SINGULAR ELLIPTIC PARTIAL DIFFERENTIAL EQUATIONS 


45 


The results of this section can be extended to hold for D unbounded if the 

integration / is replaced by integration / , where e are Borel measurable sub- 
J(c) J e 


sets of Z>, with frontiers of zero measure, and if the derivation 
replaced by set-function derivation D x . 



is 


9. Some further developments involving spectra 

We shall now essentially modify the procedures of section 8 and shall establish 
a number of spectral representations involving integrals convergent in the mean 
square. Throughout , ~ unit denote convergence in the mean square . We shall 
first establish the following result. 

Theorem 9.1. Suppose F is closed with respect to a spectrum 6 (.Definition 8 . 1 ). 
Then every function a(x), C L 2 in D, has a following representation 

/ oo /• (m) 

d\ / 9(x, y | \)a(y ) dT v ((x) in D). 

00 J D 


To prove this we note that by virtue of the last statement of section 7 there 
exists a function b(x), C L 2 in D , for which 


(9.2) 


/ OO p (m) 

d\ / 0 (x, y | \)a(y)dT y . 

00 J D 


The function 


y(x) = b(x) — a(x) 

belongs to L 2 ; in view of (8.3) and of the stated property of closure 

f(m) /»<» p(m) p (m) 

(9.3) / y\y)d,T y = d\ / 6(x, y | \)y(x)y(y) dT x dT v = lim w,, 

Jd J—qo J d Jd 1 


where 

/ l p(m) p (to) 

d\ / 6(x,y\\)y(x)y(y)dT x dT y . 

Replacing here 7 (y) by b(y) — a(y) and making use of the formula preceding 
(7.26) one obtains 

(9.4) wi = wi,i - wi , 2 , 

where 

£ 1 p (m) p (m) 

d\ / 0(x, y\\)y{,x)b{y)dT x dT v 

l Jd Jd 

and 

pi p(m ) p(m) p(m) 

( 9 . 5 ) w ^ s= j_ l d ^J D J D &(x,y\\)y(x)a(y)dT x dT v = y(z)a t (x) dT x ; 



46 


W. J. TKJITZINSKY 


here 

/ l i*(m) 

i <hJ D 0(x,y\\)a(y)dT v . 

In consequence of (8.5), which holds by virtue of the closure of F, one has 

/•Cm) 

(9.6) lim wi,i = / y(x)b(x)dT x . 

i Jd 

Now by (9.2) 

/•(m) 

lim / (b(x) — ai(x)) 2 dT x = 0. 

i Jd 

Hence from (9.5) it is inferred that 

/•(m) 

(9.6a) limw if2 = / y (x)b(x)dT x . 

i Jd 

From (9.4), (9.6), (9.6a) we finally conclude that 

lim wi = 0. 

i 

Whence, as may be seen from (9.3), y{y) = 0; thus b(x) = a(a:) and the relation 
(9.2) becomes the representation (9.1), which was to be established. 

Theorem 9.2. Let X be in 0. The problem 

(9.7) F(u) +\u=f 

has a solution (C L 2 in D) 

r°° 1 rCm) 

(9.8) m(i)~/ r- d, f(y)0(x,y.\'p)dT v . 

J— oo A P J D 

It will be sufficient to give the proof for real X in 0 and for f real valued . We 
shall write 

/• (m) /* (m) 

S n (x,p)=* / f(y)e n (x, y \p)dT v , 8(x,p)= / /(j/Ma:,j/|p)dr w . 

Jd Jd 

It will be first proved that, for 0 < l < + 00 and for X in O, 

(9.9) lim/' dpS n< (x, p) = f -l—d p s(x\p)- 

n j J— 2 A p J— l A p 

With fi = 5(X) equal to the distance from X to the closure of the set of points 
X„,i (n, i — 1,2, • • ■) one has 

d„s n (x | p) = 0 


for 


X- 5 ^pgX+? 



SINGULAR ELLIPTIC PARTIAL DIFFERENTIAL EQUATIONS 


47 


(9 ' 10) C rh, dMx 1 p) = (C + £.) x4- p «.•.(* i») 

(^' = X — |, X" = X + l suitably great^. By the formula preceding (7.25) 

, , - (C + £.)x 4 - p <m*i»> 


(9.10a) 


inasmuch as 


r 1 i 

= / r-d,s(:c|p), 

JLf A — p 


d p s(x | p) = 0 


(X ; s p S X"). 


Thus (9.9) holds. We let 


Ul) ttn(z, 0 = | p), ^n(x, Z) = (/. +/ 


Then, in view of (8.15), 

(9.11a) w»(s) = w n (x, Z) + r n (z, Z). 

It has been established previously that the limit 

lim u ni (x ) = u(x) 

exists in D; also, 

f l 1 

f9.11b) lim u ni (x, l ) = u(x, Z) = / --cZ p s(x | p) 

n, J —l A P 


in consequence of (9.9). Hence, by (9.11a), the limit 

(9.11c) lim r ni (x, l ) = r(x, f) 

»< 

similarly exists. With the notation so introduced one has 
(9.11d) u(x ) = w(x, l ) + r(x, 1); 


(9.12) 


/■(m) i»(m) 

/ I m(x) - u(x, o I 2 dT x = / | r(x, Z) | 

«/0 •/ £) 


We have (admitting complex values), by virtue of (9.11), 

I 1 /• (m) /^(m) 

I r„(*, i) | 2 = £' L' rrr- rrr~ / / 

a P A A n ,a A A n ,p */z> 

* Wn,a(^)^n,p(ir)Wn t «(y)Wn,p(2) d>Ty dT§ , 

where the primes over the summation symbols indicate that the sums are taken 



48 


W. J. TRJITZIN8KY 


corresponding to the intervals (— <», — l), (l, + °°). Using the orthonormality 
relations of the u n ,i (i = 1, 2, • • •) one accordingly obtains 

/ I r„(x, o I 2 dr, = 23' j \ \—12 / / /(j/)/(«)w».a(y)w».«(z) dr„ dr,. 

a j A An,or | 

The spectral form for this relation is 

L 1 ^ z) = (l M + h) |X-p| 2<Z '* b(p >’ 

/»(m) i*(m) 

*»(p) = / / f(y)}{z)d n (y, z | P ) dr tf dr.. 

JjD •//) 


In consequence of (7.24) 


for all finite a, 8. 

With Z > | | (X real) 


Then 


/•(m) 

ViUp)^ \S\'dT 

J D 


1 

1 

C 1 (n > l) 

1 

X - p 

= l - fix 

X - p 


l + -Kx 


(p S — 0- 


r (m) [” i 1 “| /•(»») 

L I rn(x ’ Z) ' 2 - [(I - fix ) 2 + (T+SO 2 J L \f\ 2<iT = * x(z) - 

The limiting function r(x, l ), of (9.11c), will satisfy the inequalities 


J i» (m) /»(m) 

| r(x, Z) | 2 dr, g Ijm | r„(x, Z) [ 2 dT, g a x (0- 

I) n J D 


Hence 


r\rn) 

lim / | r(x, l ) | 2 dT = 0. 

Z-*oo 

Together with (9.12) and (9.11b), this implies that 

r 1 i 

/ -- d p s(x | p) ~ u(x) (in D\ as Z —► <*>), 

J-Z A — p 

which establishes (9.8) of the theorem. 

We shall say that v is “admissible” if v is real, C Lt (in D) and if 


(9.13) 


+ a„(x)v = 0 (on S n ; n = »i, nj, • • •) 
ov 


for some functions a n (x), continuous on S n (n — 1, 2, • ...)• 

It will be of interest to obtain a spectral representation for the differential operator 
F. We form the u„,<, 0 B and 0, corresponding to the o n (x) (n « ni, n», • • •) of 
(9.13). The following can be established (compare with (C)). 



SINGULAR ELLIPTIC PARTIAL DIFFERENTIAL EQUATIONS 


49 


If v is “admissible” and F(v) C L t (in D), while F is closed with respect to 0, 
then 

[ (m) 

(9.14) F(v(x)) ~ — I Xd\ I 6(x, y \ X)v(y) dT v 

' J-oo Jd 

(when there is only one spectrum, it will be independent of v, of course); more¬ 
over, in order that the integral of the second member in (9.14) should converge 
in the mean square it is necessary and sufficient that the integral 

Z oo *(m) /• (m) 

X 2 dx / / 0(x, y | X)v(x)v(y) dT x dT v 

oo J D J D 

should converge in the ordinary sense. 

To prove (9.14) we apply the Fundamental Formula to v, u n ,<, obtaining the 


relation 


/•Cm) 


(9.16) / 

Jd 

F(v)u n ,idT x = — X n ,i 

Now by Theorem 9.1 


(9.17) 

F(v(x)) ~ gi(x) 

where 



(as l-*+ oo), 


/ l /»(m) 

j d\ J6(x, y | X)F(v(y)) dT v . 

In view of the formula preceding (7.25) 

qi(x) = lim qi, n (x) (as n - rij-* «), 

n 

with 

/ l r (m) /• (m) 

dx / 0n(x, y | X)F(v(y)) dT y = / F(v(y))u n ,i(y) dT v , 

i . Jd i Jd 


where the summation corresponds to the interval (—1, l). By (9.16) 

/•(m) 

= ^ ^ Xn,»W n ,t(ir) / ^(y)‘W’n,i(2/) 

* Jd 

Z i /»(m) 

; Adx J D 0*(x, y | xMy) dr v . 


In view of the relation preceding (7.25) 

mI i* (m) 


^ * /• (m) 

gj(*) = - J_ t X dx J D e(x, y I X)v(y) dT y 


This, together with (9.17), implies (9.14). The statement with respect to (9.15) 
can be proved with the aid of section 7. 



50 


W. J. TRJITZINSKY 


10. Operators A and their application 

Let X be in 0 and let u n be solution of the problem 


F(u n ) 4- Xm» = / 

(in D n ), 

4- a„(x)u n = 0 
dv 

(on Sn), 

while w n is solution of the problem 


F(w n ) + \w n = q 

(in D„), 

d ~ + a n (x)w n = 0 

dv 

(on S n ) ; 

here /, q are continuous and 

(10.1) f,qCU 

(in D). 

It is to be noted that the set O is independent of the choice pf / and q . We 


define u n , w n as zero in D — ( D n + S n ). 

For u = u n and v = w n application of the Fundamental Formula (5.11) will 
yield 

/•(m) /»(m) 

(10.2) / w n fdT z = / u n qdT x (w = n x , n J( • • •)• 

There exists a subsequence (&i, k 2 , • • •) of (ni, n 2 , • • •) such that 


(10.3) w kj {x) —> w(x), u kj (x) —> w(x) (as ft,- oo ;,in D), 
where 

w(z), iy(ar) C L 2 (in D), 

while, for (x) in Z>, one has 

(10.4) F(u) + Xu =/, 

(10.4a) F(w) + Xu; = g; 


moreover, 

rim) 

(io.5) / | »*,(*) |* dr., 

J D 



!«*,(*) | s dT x £<r(X) 


< 4-00 


(j = 1, 2, • • •) with ff(X) independent of j. These facts are a consequence of 
the Existence Theorem 6.2. 

In view of (10.5), (10.3) and (10.1) from (10.2) we derive 


( 10 . 6 ) 


r (m) (m) 

/ wfdT= ugdT. 

J D J D 


By a reasoning of the type employed for an analogous purpose in (C) or using 
spectral representations one is able to choose the subsequence (Jc i, &*,•••) 
independent of / and q. 



SINGULAR ELLIPTIC PARTIAL DIFFERENTIAL EQUATIONS 


51 


The limits u(x ), w(x) (cf. (10.3)), so obtained, can be represented as 

(10.7) u(x) = A x (\ |/), w(x) = A x {\ | q). 

Theorem 10.1. Let 0 be the set defined subsequent to (6.14). Suppose F is 
self adjoint . There exist an operator 

A x {\ | ...) 

so that , whenever continuous /, q C L 2 in D, formula (10.7) will represent solutions 
of equations (10.4), (10.4a), respectively , for X in 0. This operator satisfies the 
identity 

p(m) /» (m) 

(10.8) / A*(X I ?)/(*) dT* = A.(X !/)«(*) dr,, 

which is valid for every X in 0 and for all /, q of the described type. A x {\\ • • •) 
may depend on the choice of the a n (x). 

Theorem 10.1 is analogous to a result in (Ci ; p. 56). 

Let /, q C Z/ 2 (in D) and 


(10.9); (10.10) 

F(f), F(q) C U 

(in D). 

We then have 



(10.11) 

F(f) + X/ - /* C U 

(in D), 

(10.11a) 

F(q) + \q = q* C L t 

(in D). 

Suppose F is of class /, with respect to O. 

Application of (10.8) will enable inversion of (10.11), (10.11a) so as 

to yield 


/•(m) /«(m) 

(10.12) / q(x)f*(x) dT x = / f(x)q*(x) dT x . 

J D * D 


Replacing/*, q* in (10.12) by the expressions from (10.11), (10.11a) one obtains 

/•(m) /.(m) 

(10.13) / q(x)F(f) d,T x — f(x)F(q)dT x . 

J D J D 

The condition (10.13) is accordingly necessary in order that F be of Class I 
in O. 

Conversely, suppose now that (10.13) holds for all functions /, qd Li {in D) 
for which one has (10.10). If F were not of class I (for X in O) there would be on 
hand a function <p(x) C Li and a value Xi in O so that 

(10.14) F(<p) + \i<p = 0, 

J r(m) 

I <p(x) \*dT*0. 

D 

Now, let us define q(x) as a solution, C Lt (in D), of the equation 
(10.15) F(q) + Xi# = q>. 



52 


W. J. TRJITZINBKY 


By Theorem 6.2 such a solution q{x) exists. Having assumed (10.13), we shall 
have in particular 

p (m) p (m) 

(10.16) q(x)F(,<f>) dT t — <p{x)F(q)dT x . 

J D J D 

Replacing F(g) by the expression obtained from (10.15), in consequence of (10.16) 
one obtains 

p (m) p (m) p (m) 

/ I <p(x) 1 2 dT x = Xi / <p(x)q(x) dT x + / q(x)F( v )dT t . 

Thus, by (10.14) 

p(m) p(m) 

(10.17) / |*(*)| 2 dT x = / qMlFM + XtddT.-O. 

J D JD 

Now (10.17) is contrary to (10.14a). Hence F d I (in O). Accordingly the 
following Theorem has been established. 

Theorem 10.2. Consider self adjoint operators F. In order that F be of class 
I (for X in O), in accordance with Definition 6.2, it is necessary and sufficient that 

J r^w) p(m) 

q(x)F(f) dT x — f(x)F(q) dT x 

D «/ D 

for all functions f, q belonging to L 2 (in D),for which F(f ), F(q ) belong to L 2 (in D). 
On repeatedly using (10.8), it can be shown that the following holds. 
Theorem 10.3. If for a fixed non real Xi the equation 

(10.18) F(u) + \u = 0 (in D ) 

has no solutions , 20 C L 2 , the same will be true for all non real X. For non real 
values of\ the number of distinct i 21 solutions , C L 2 , of (10.18) is the same . 

The proof of this theorem will be omitted, as it may be given following closely 
the lines of proof of analogous results in (C\ ; pp. 55, 58); it would be necessary, 
however, to use some of the previous developments of this section. 

We shall now consider questions of uniqueness of solutions for X in O, that is, 
when X may be real. Such considerations would correspond to certain develop¬ 
ments given by Trjitzinsky 4 for singular integral equations. The following will 
be proved. 

Theorem 10.4. 1°. If for a fixed Xi , in O, the equation (10.18) has no solutions, 

C L 2 , the same will be true for all non real X. 

2°. The number m of distinct solutions , C L 2 , of (10.18) for any X fixed in O is 
equal to or is greater than the number n of distinct solutions for non real values X. 
Consider part 1°. If Xi is non real the conclusion 1° follows by virtue of 


19 Apply theorem (Ci; pp. 132, 133). 

10 Here and in the sequel trivial, i.e. null, solutions, of homogeneous problems are dis¬ 
regarded. 

* l I.e., linearly independent. 



SINGULAR ELLIPTIC PARTIAL DIFFERENTIAL EQUATIONS 


53 


Theorem 10.3. Take Xi real in O. 22 If 1° fails, there exists a non real value X 
and a corresponding function u, C Lt, so that (10.18) holds and 

p(m) 

/ \u\UT* 0 . 

J D 

One has 


(10.19) F(u) + \ x u = (Xi - \)u. 

Now the equation 

F(<p) + Xi <P = fi (fi C L 2 in D) 

cannot have two distinct solutions, since otherwise their difference co would 
satisfy (10.18) for Xi, while 

pirn) 

\ U \*dT* 0 , 

Jd 

contrary to hypothesis. Thus,one may express the relation (10.19) in the form 

(10.20) u(x) = A x (\ i | (X x - \)u ). 

Now by (10.19) 

F (u) + \id — (Xi — \)u 
and, in place of (10.20), one obtains 
(10.20a) u(x ) = ;l*(Xi | (Xi — X)w). 

In consequence of (10.8) 


r (m) p (m) 

/ ^*(Xi| (Xi — X)m)(Xi — \)udT x = / 

J d Jd 


4x(Xx I (Xi - X)u)(Xi - \)udT x 


and, substituting from (10.20), (10.20a), we deduce 

*(m) i* (m) 


p(m) p (m) 

(X, - X) I u I *dT x = (Xi - X) I u \ 2 dT x 

J d Jd 


which implies that X is real, contrary to our supposition. 

We now proceed to part 2°. By Theorem 10.3 n is independent of X. If 2° 
does not hold, there is a value X, in O, for which the number m of distinct solu¬ 
tions, 

Ui(x ), • • * U m (x), 


is less than n. By Theorem 10.3X must be real. Let X* be a non real value; 
with X* there will be associated n distinct solutions, 

u$(x), ... ul(x). 


We have 

F(uj) + \Uj = 0, F(u*) + \*ut = 0 (in D). 


» Supposing that O has real points; cf. text subsequent to (6.14). 



54 


W. J. TRJITZINSKY 


Since X is real the Uj may be taken real. Any solution u* for X* is expressible 
in the form 


u*(x) = £ CpU*(x) 

i 


(constants c v ). 


With m < n, the c v may be chosen so that u*(x) will satisfy 

f (m) /. Cm) 

(10.21) / \u*(x)\ 2 dT * 0, / u*(x)u(x)dT = 0 

J D J D 


for all solutions u(x ) for the value X. 
We have 


whence 

( 10 . 22 ) 

Accordingly 

(10.23) 


F(u*)+ X*u* = 0, F(il*) + \*u* = 0; 

F(u*) + Xu* = (X - X*)u* = /, 
F(u*) + Xu* = (X - X*)u* = /. 


u*(z) = A*(X | /) + u>i(z), 
u*(a;) = A x (X | /) + ^ 2 ( 2 :); 


here Wi{x), w 2 (x) are some solutions of (10.18) for X. Since the operator A can 
be so defined that 

Ax(X ! w) — Ax(X | w ) (all w C L 2 ), 

we have 


Wi(x) = Wi(x). 


By (10.8) 

/•(m) /• (m) 

Ax(X|/)/drx= Ax(X|/)/d!Tx. 

Jo Jo 

Substitution of (10.23) and of the expression for /, / from (10.22) will yield 

r (m) f> (m) 

(X - X*) / (u* - Wi)u*dT = (X - X*) (u* - Wi)u*dT. 

J D J O 


Further, in view of the property stated in connection with (10.21) 

/•(m) /.(m) 

(X - X*) / I u* 1 2 dT x = (X - X*) / I u* ! s dTx. 

This leads to the conclusion that X must be real; thus there arises a contradic¬ 
tion, which completes the proof of the Theorem. This can be extended to the 
case when the number of solutions could possibly be infinite. 

By 1° of Theorem 10.4, if for a fixed X t , in O, the equation (10.18) has no 
solutions, C Lt and distinct from zero, F will be of class I for all non real X; 



SINGULAR ELLIPTIC PARTIAL DIFFERENTIAL EQUATIONS 


55 


by the necessary part of Theorem 10.2 this would imply that the identity of that 
Theorem holds (for all /, q of stated type); in consequence of the sufficient part 
one then infers that F Cl I in every set 0. Hence we have the following: if the 
equation 

F(u) + \u = 0 

has no solutions , d L$ in D and distinct from zero , for a value Xi in some set 0, 
then the same will be true for all \ in every conceivable set 0. 

In another paper the present author intends to obtain direct conditions on 
the Ai,j , Bi ,C under which F is of class /, the latter property being important in 
connection with questions of uniqueness and the closure of 0. 

University of Illinois 
and 

Institute for Advanced Study 



Annals or Mathematics 
Vol. 48, No. 1, January, 1942 


ON A THEOREM OF TANNAKA AND KREIN 

By S. Bochner 

(Received June 13, 1941) 

Let G denote an arbitrary group,, C the class of almost periodic functions on 
G and Co the sub-class of all finite linear combinations of (irreducible) representa¬ 
tion coefficients. With the norm 

II/II = sup* c a | /(x) I, 

C is a Banach space, and Co is a dense but not closed subspace. Therefore a 
functional L(f) which is additive, that is 

(1) L(af + bg) - aL(f) + bL(g), 

need not be bounded on Co and hence need not be continuable onto all of C. 
However if L(f) is positive, that is 

(2) L(f) ^ 0 for / (real and) ^ 0, f eC 0 

then | L(g) | ^ 2 J| g || *L(1) and thus L(f ) is bounded and has a (unique) exten¬ 
sion onto all of C which is again positive. By a recent theorem of M. Krein, 1 
assumption (2) can be replaced by the weaker assumption: 

(3) L(\ g | 2 ) ^ 0 for geCo. 

Thus, if L(f) on Co has the properties (1) and (3) it also has property (2). This 
theorem of Krein includes an earlier theorem of T. Tannaka 2 that any functional 
on Co having property (1) and the additional property 

Ufvh) - L(/i)-L(/ 2 ) 

has an extension onto C itself. 

The proof of Krein is based on ideas of N. Wiener and I. Gelfand which are 
extraneous to the problem, and we are going to give a new proof which stays 
wholly within the technique of uniform approximation to elements of C by 
elements of Co . 

We consider a complete set of irreducible representations 

WvM), xeG. 

The letter p designates an element of an index set of suitable potency, and for 

1 On positive functionals on almost periodic functions, Doklady Moscou SO, pp. 9-12, (1941). 
* tfber den Dualitdtssatz der nicht-kommutativen topologischen Qruppen, Tohoku J. Math., 
46, pp. 1-12, (1938). 


56 



ON THEOREM OF TANNAKA AND KREIN 


57 


each p, p, q = 1, • • • , h = A(p), where /i(p) is the (finite) dimension of the p^ 
representation. Any element of C 0 can be written in the form 

(4) f(x) = £ ap<r*£«(z), 

p,p,q 

where only a finite number of the constants a p pq are 0. Introducing 

(5) y p pq - L(<p p pq ) 
we have 

Lif ) = £ 

p,p,q 

We call L(/) a special functional if only a finite number of the constants (5) 
are ^ 0. In this case, since 

K,l ^ 11/II, 

we have 

\L(f)\ g ll/ll-£ |/ Pt |. 

P.P.a 

Thus L(f) is bounded and has a continuous extension onto C. Furthermore 
every non-negative element of C is a uniform limit of squares of elements of Co . 
Hence we obtain 

Lemma. For any special L(/), property (3) implies property (2). 

We will next consider a non-negative almost periodic function A (t) and its 
Fourier coefficients 

\pq = Mt {A(t)<Ppq{t)} . 

If the function A (t) is a class function then 

Xpg \p5pq . 

In this case, A (t) is called a weight-function , and, if only a finite number of the 
values \p q is s* 0, a special weight function. 

Take any element (4) of C 0 which can be put in the form / = | g | 2 , g e Co, 
and introduce the family of elements 

fv = f(xy- 1 ). 

Since f y — | g{xy~ l ) | 2 , and 

Vtlixy' 1 ) = £ <fiprix)<f>triy)> 

r—1 

we see that (3) implies L(f y ) ^ 0 and therefore M t {A(t)L(f t )} 0, the latter 
inequality being explicitly 

(6) ]C ^ 0* 

p.p.fl 



58 


S. BOCHNER 


Thus given the functional L(f) and the weight functional A(t) there exists a 
functional A L such that 


A L(<p% q ) = 

and, what is decisive, if L has property (3) then so does AL. Now if K[t) is a 
special weight function then A L is a special functional, and by our lemma, 
(6) holds for any / ^ 0 belonging to Co. 

Now let / be any fixed non-negative element from C 0 , and consider a sequence 
of weight functions, then 

E a' 4 X< n y p4 S 0. 

P.JM 

where p varies over a fixed finite index set, say p = 1, • • • , m, which is inde¬ 
pendent of n, and n = 1,2, • • • . However, given any finite index set {p} there 
exists a sequence of special weight functions such that for each p from the set, 
lim n -oo Xp n) = l. 8 Hence we obtain 

£ a f Pt yU ^ 0 

P.P.Q 

for each non-negative element of Co , and this completes the proof of (2). 
Princeton University 

3 S. Bochner and J. von Neumann, Almost periodic Junctions in groups , Trans. Amer. 
Math. Soc., 37, pp. 21-60 (1935), especially pp. 37-40. 



Annals or Mathematics 
Vol. 43, No. 1, January, 1942 


ON THE UNIFORM DISTRIBUTION OF THE ROOTS 
OF CERTAIN POLYNOMIALS 


By P. ErdOs 
(Received July 21, 1941) 


Let 


z{ 2) xi 2) 


be a triangular matrix, where, for each n , 

1 ^ xi n) > x^ n) > ••• > 4 n> ^ - 1 . 

Since a\- n> may be written in the form z,-" ) = cos (t^ n> ), where 0 g t?* n) g ir, 
we may define another triangular matrix 

^ n) • • • d ( n n) 

with 

0 g «?{ n) < ^ n> < < ^i n) ^ T. 

Put o>„(x) — n> - Xi). 1 Suppose 0 ^ A < B ^ t. We denote by 
N n (A , B) the number of the in (A, B). Let — lsja<6i*l. Then we 
denote by M n (a , b ) the number of the x, in (a, 6). It does not matter whether 
the intervals (A, B) and (a, 6) are open or closed. 

In a previous paper 2 Tur&n and the author proved that if 

!«»(*)! < f ~ 


then 

N n (A, B ) = n + OCn^log/Cn)) 1 ). 

7T 


1 We omit the upper index n where there is no danger of confusion. 

* On the uniformly dense distribution of certain sequences of points , Annals of Math. Vol. 
41 (1940), pp. 162-173. 


59 





60 


p. erdcSs 


In another paper* we proved that if | ll n) (x) | < Ci then 

Nn(A, B) = - n + 0[{B - A)n]‘ + \ 

r 

(li n) (z) denotes the fundamental polynomials, i.e. ll” } (x) = u(x)/[a'(xk)(x — **)] 
is of degree n — 1, and h(xk) = 1, h(xi) = 0, i k.) 

In the present paper we are going to improve these results. First we prove 
Theorem 1. Put x 0 = —1, Xn+i = 1, and let 

/»_ /». 

(1) max | u n (x) | < — and max | u„(x) \ > - , k — 0, 1 • • • n. 

-ig*gi 2" **S*£**+i 2 n 

Then 


N n (A, B) = - -- n + 0[log n(B - A)]. 

IT 


This result is the best possible. 

Next we prove 

Theorem 2. Let | l k (x) | < c 4 ; then 

Nn(A, B ) = n + O[(log n) (log n(B - .4))] 

7r 

if | l k {x) | < n c \ then 

NM, B) = LzA n + O[(log n) 2 ]. 

7r 


Theorem 2 is also the best possible. Theorems 1 and 2 can be generalized to 
Theorem 3. Let o){x) be such that 


a ' W '< max 


2 n 


*^<*^**+i 


2 n 


k =s 0, 1, 2 • • • ft; 


then 


N„(A, B) = --- n + O[(log ft)(log /(w))]. 

7T 


Similarly, if \ h(x) \ < cgf(n) then 


N n (A, B) = -—- n + 0[(log n)(log n/(n))]. 


To prove Theorem 1 we first have to prove two lemmas. 
Lemma 1. Suppose that (1) holds; then 


( 2 ) 


- < t?*+i — dk < , k = 0, 1 • • • ft. 

ft n 


On interpolation Hi, ibid. pp. 510-553. 



UNIFORM DISTRIBUTION OF ROOTS 


61 


Proof. A theorem of M. Riesz states that if h(x) is a polynomial of degree n 
which assumes its absolute maximum in (—1, 1) at the point xq , and if 
Vi, • • • , y r are the roots of h(y) = 0 in the interval (-1, 1), then | 0, - 0 O | 1 
ir/2n, where cos 0 X = y< and cos 0 O = x 0 . Thus if x 0 lies between the roots y t 
and y i+ i , then 6 i+1 — 9i ^ r/n. Also if max h{x) assumes its smallest 

value for i — k then 


&k+i — Ok ^ 7r/n. 

Suppose that (2) does not hold, for example assume that 

&k +1 — &k> r(ri)/n, 

where lim r(n) = ». Take « > 0, and define u and v by the relations: u and v 
are symmetric with respect to (x* + x* +1 )/2, and arc cos u — arc cos v = ir/n + e. 
Consider the polynomial <f>(x) = «(x)-(x — u)-(x — v)/(x — x k )(x — x k+ i). 
It can be seen that if u x ^ v then 


hence 


(x — m)(x — y) 

0 c - x*)(x - X*+i) 


< c u /r(n); 


(3) max | <f>(x ) | < (cu/r(n)) max u„(x). 

Ugx£v z k ^x^x k +1 

Also, since the sum of two quantities whose sum is fixed increases as they tend 
to equality, we have, in the intervals ( — 1, x k ) and (z*+i, 1), 

(4) | <*>(x) | > | w(x) | . 


We have arc cos v — arc cos u > t/u; and a simple calculation shows that, 
if r(n) is large enough, #*+1 — arc cos v > ir/n and arc cos u — $ k > rr/n; thus 
it follows from the lemma of M. Riesz (applied to </>(x)) that max | <l>(x) | between 
two consecutive roots of <t>(x) 7 assumes its smallest value between the roots Xi 
and Xi+i , where either i g k — 2 or i ^ k + 2. Thus, from (3) and (4), 

T\Yl) 

max | co(z) | > —- • min max | u(x) |. 

**^*^** + 1 C 11 J-0,1,••*,»» Zj£x£Xj +! 

This contradicts (1), which completes the proof. By the same argument we 
could prove the other inequality in (2). 

Corollary. We obtain from Lemma l, by a simple computation , that 


^•(1 - x£* < Xk+i -x„< C — (1 - xl)\ (fc = 1,2, •••,» - 1). 

n n 


Lemma 2. 


Suppose that (1) holds ; then for — 1 ^ x ^ 1, 


I h(x) | < Ch 


(1 ~ xtf 

(x - x*)n‘ 



62 


P. ERDOS 


Proof. We have l k (x) = u(x)/[w'(xk)(x — *»)]; thus by (1) it suffices to 
show that 


u'(x h ) > Cu 


n 

2 B d - xir 


Consider the polynomial \p(x) = w(x)/(z — Xk ). It is clear that either | co f (xk) | = 
k) § | i(y) | if Xk- 1 £y £ x k , or | u'(z*) j = ^{x k ) ^ \p{y) if x k ^ y g a;* + i. 
Without loss of generality we can assume that the first inequality holds. Then 
by (1) and the corollary to lemma 1 we have 


u'(x k ) 


max j w(x) | 
Xk - Xk-l 


n 

> Cl6 2"(l - xl)*’ 


which completes the proof. 

Now we can prove Theorem 1. To simplify the calculations we assume that 
a = 0, b = 1. Then we have to show that, assuming (1) 


- - Cie log n < M n { 0, 1) < | + Cn log n. 


It will be sufficient to prove the first inequality. Suppose that it does not 
hold; then 

71 - 

M „(0, 1) < - — r(n) log n, lim r(n) = =». 

J 


Consider the polynomial g(x) whose roots are defined as follows: In the interval 
(— 1, log n/n), g(x) has the same roots as T n -i(x)(T n (x) denotes the n th Tchebi- 
cheff polynomial); at the points (f) r log n/n, r = 1,2, • • • s where s is such that 


g(x) has a root of multiplicity 



; and finally g(x) vanishes at the roots of 


w(x) in the interval (0, 1). Clearly the degree of g(x) does not exceed 


| + log n + 3 r(n) + | - r(n) log n < n - 1 


if r(n ) > 10. Thus, by the lemma of M. Riesz, g(x) assumes its absolute 
maximum in the interval (log n/n, 1). Suppose that it assumes its absolute 
maximum at x 0 , log n/n ^ z 0 |1. We have for some r 

(!)'¥<*•<(!)"¥• 

(If (f) r+1 log n/n > 1, we replace it by 1.) Put (|) r log n/n = q; we consider 
the polynomial 



UNIFORM DISTRIBUTION OF ROOTS 


63 


By the Lagrange interpolation formula we evidently have 


Qi{x) = ]£ Qi(xk)lk(x) 

Jk-l 


where the x* are the roots of co(x). Thus 
(5) 


0l(*o) = 2Z gi(xk)lk(xo). 

it-1 


Now gi(x k ) = 0 for 0 ^ x ^ 1; and since x 0 was the place where g(x) takes its 
absolute maximum, we have 

gi(xo) ^ [2 it + l)] p 0i(x) 

if x satisfies 


(6) + »($-. 

(6) may be verified by noting that 


t = 0,1,2 ••• 


0l(*o) = 


g(x o) 


g(x) 


(x 0 - q) v (x 0 - q) p 
Hence from (5) and (6), by putting 


= gi(x) 


u t , 



we obtain 


* E 

t 


M n (u t , u,+i) max | h(x 0 ) 

_ Ut^X k ^X h +1 


2 (t + l) p 


= 


where in t is restricted by u t 5: — Now by the corollary to Lemma 1, 
and Lemma 2. 


< X Ci»n(«(+1 M|)ci9 - -7-——rrr~ < Cjo 2 ") fTTT:—j—rr-,- 

igo n ( +i [2(< + 1)]” «To [2(< + l)] p 


for sufficiently large p. 

For the Xk in we clearly have Xk < — Thus by lemma 2. 


Es < max | h(x 0 ) | < i 

for sufficiently large p. Thus 23» + E* < 1» and this contradiction establishes 
the proof. 

In the proof we did not use the full strength of Lemma 2; in fact we only used 



64 


p. erd6s 


| h(x) | < cm —j-[. We would have had to use the sharper estimate if 

we had not restricted ourselves to the interval (0, 1) but had considered a 
“small” interval near — 1 or +1. 

Now we have to prove that the error term in Theorem 1 is the best possible. 
Put 




kir 

n 


k 1 
+ z , 


-1 1 



where k and l take all positive integral values such that #* < ir — n“ 2 , and 
t h > n~ 2 it is easy to see that the number of the #’s is n + 0(1). Consider the 
polynomial co(x) whose roots are the cos #’s. It can be shown by elementary 
computations that a>(x) satisfies (1). We do not give the details. On the other 
hand it is easy to see that 


1) < ^ — C 23 log n 

which shows that the error term in Theorem 1 is the best possible. 

The proof of Theorem 2 is very similar to that of Theorem 1. The difference 

is that, in defiiiing g(x), g(x) now has roots of order |^r(n)^g^J ^ tkg points 

(f ) r log n/n. The proof of Theorem 3 also runs along the same lines. 


University of Pennsylvania 




Annals of Mathematics 
Vol. 43, No. 1, January, 1942 


ON THE ASYMPTOTIC DENSITY OF THE SUM OF TWO SEQUENCES 

By P. Erdos 
(Received June 24, 1941) 

Let a\ < 02 < • • • be an infinite sequence, A, of positive integers. Denote 
the number of a’s not exceeding n by f(n). Sehnirelmann has defined the 
density of A as G.L.B .f(n)/n. 1 Now let a x < a 2 < * • • ; bi < b 2 < • • • be two se¬ 
quences. We define the sum A + B<) f these two sequences as the set of integers 
of the form a,- or b, or {a»* + &,}. Sehnirelmann proved that if the density of A 
is a and that of B is fi then the density of A + B is ^ a + 0 — a(3. 

Khintchine 2 proved that, provided that a = P g £, the density of A + B 
is ^ 2a. He conjectured more generally that if a + p g 1 the density of A + B 
is ^ a + 0. It i§ easy to sec that if a + (3 ^ 1 then every integer is in A + B, 
so the density of A + B is 1. Khintchine’s conjecture seems very deep. 

Besicovitch 3 4 defined /3' = G.L.B. <p(ri)/(n + 1) where <p(ri) denotes the number 
of the b ’s not exceeding n, and proved that the Sehnirelmann density of the 
sequence of numbers {a t , a» + &,} is ^ a + @ f . An example of Rado showed 
that this result is the best possible. 

Define the asymptotic density of A as ljm/(n)/n. Then if a ^ £ and ai = 1 
I have proved that the asymptotic density of A + B is ^ \a. A The following 
simple example of Hcilbronn shows that this result is the best possible: Let the 
a\s be the integers = 0, 1 (mod 4). Then A + A contains the integers = 0, 1, 2 
(mod 4). In the present note we prove the following 

Theorem: Let the asymptotic density of A be a and that of B be 0, where 
a + P ^ 1, /3 ^ a, bi = 1. Then the asymptotic density of A + B is not less 
than a + and , in fact, one of the sequences {a,-, a t + 1} or {a, + bj\ has 
asymptotic density ^ a + %/3. 

It is easy to see that if a + P > 1 then all large integers are in A + B. For 
if not then, none of the integers n — ai belong to B, and the asymptotic density 
of B would be not greater than 1 — a < p. 

To prove our theorem we first need a slight sharpening of the theorem of 
Besicovitch; in fact, we prove the following 

Lemma: Define the modified density of B as follows: 

1 Sehnirelmann, Ober additive Eigenschaften der Zahlen y Math. Annalen 107 (1933), pp. 
649-690. 

* Khintchine, Zur additiven Zahlentheorie , Recueil math, de la soc. Moscow 39 (1932), 
pp. 27-34. 

# Besicovitch, On the density of the sum of two sequences of integers , Journ. of the London 

math. soc. 10 (1936), pp. 246-248. „ 

4 Erdos, On the asymptotic density of the sum of two sequences one of which forms a basis 
for the integers. ii. y Travaux de Tinstitut math, de Tblissi 3 (1938), pp. 217-223. 

65 



66 


p. erd6s 


1 ) 


ft 


= G.L.B. 

n >fc 


n + 1’ 


where the integers 1,2, • • • , k belong to B } but k + 1 does not belong to B. Clearly 
ft ^ ft. Then Schnirelmann density of the sequence {a», a* + 6, } is not less 
than a + ft . 

The proof of this lemma follows closely the proof of Besicovitch. Denote by 
f(u } v ), <p(u, v), ypiUy v) respectively the number of a’s, b's, and terms of the 
sequence [a iy a» + ft } in the interval (u y v )—that is, among the integers u + 1, 
u + *2, • • • , v. We first observe that if r + 1 is any integer which does not 
belong to the sequence {a,, a* + ft} then 


2 ) 


f(u , + <p(r — v, r — u) % v — u. 


For as t runs through (u y v), r + 1 — t runs through (r — v, r — u) y and if 
t belongs to A then r + 1 — t does not belong to B. 

We may assume that the Schnirelmann density of the sequence {a, , a» + fey} 
is less than 1, and that a > 0, so that a\ = 1. Define mo = 0, define r 0 + 1 
as the least positive integer not belonging to \a iy a» + ft}, define mi + 1 as the 
least integer greater than r 0 belonging to A y define ni + 1 as the least integer 
greater than mi not belonging to {a,-, a» + ft}, and so on. 

It suffices to prove that for each x in (ri_i, m<) we have 

3) *(0, x)^ (a + ft)*, 


for if (3) holds, suppose that for some y in (m,, r,) we had 

*(0, y) < (a + ft)*/. 


(We may suppose j > 0; else y £ r 0 , so that ^(0, y) ±= y ). Then since all the 
integers my + 1, • • • , y belong to {a,, a» + by) and a + ft ^ 1 we should have 

iKmy) < (a + ft)my, 

which contradicts (3). 

It follows from the definition of fc and the definition of m» and r t - that 


4) u — mi > Jc (i = 0, 1, 2 • • • ). 

Let r,_i < x <; m ,-; we have 

5) , a;) ^ <p(r*-i - m*_i — 1, x — m t _i — 1), 

since any number m<_i + 1 + w, where w belongs to B, is in {a,-, + by). Also 

6 ) f(mi- 1 , r»—i) = r*-i - m« S /(m x _r, 7\_i) + v?(0, r<_i - m^) 

by (2). Clearly by the definition of the numbers r,-, my we have for i < 
x ^ mi , /(my-i, x) = /(m.-i, r,_i). Hence by adding (5) and (6) 

7) ^(m t *_i, x) ^ /(m^i, a:) + <p( 0, x - m w - 1) /(m<-i, x ) + ft(x - m<-.i), 

since by (4) x — m<_i — 1 ^ r»_i — m^i > fc. In particular 



DENSITY OF SUM OF TWO SEQUENCES 


67 


8) tirrij, m j+ i) ^ /(to,- , m m ) + $i{m m - m,) (j = 0, 1, • • • )• 

Summing (8) for j = 0, 1, • • • i — 1 and adding (7) we have 

*(0, x) £J(0, x) + fax ^ (a + fa)x, 

which completes the proof of the Lemma. 

Now we can prove our theorem. We may assume P > 0. Suppose first that 
there exists an x belonging to A, such that the modified density of (the positive 
terms of) a t — x is ^ a — \p. Clearly x + 1 has to be in A since a — $/3 > 0. 
It follows that there exists for every positive real e a y such that the Schnirel- 
mann density of the positive terms of the sequence [bj — y) is ^ P — €. To see 
this choose y to be the greatest integer with 

^ =£ 0 - e. 

y 

(Since Jjm <p(y)/y = P such a y exists, unless <p(y)/y > P — € for all positive y; 
in this case we have y = 0). Then by the definition of y it is clear that <p(y , z) 
i.e. the number of [bj — y}’s in (0, z — y), is not less than (P — e)(z — y), which 
proves our assertion. 

Now consider the sequence [bj — y, bj — y + Oi — x\. By our lemma its 
Schnirelmann density is ^ a + \P — e; hence by adding x + y to its members 
we obtain the sequence [bj + x, + bj} whose asymptotic density is clearly 
^ « + hP “ « for every e > 0. But since x is in A , bj + x is in {a, + bj}. 
Hence the asymptotic density of the sequence {a, + bj} is ^ a + $p y which 
proves our theorem in the first case. 

Suppose next that Case 1 is not satisfied. We may suppose that there exist 
arbitrarily large values of i such that a, and a* + 1 are both in A ; otherwise 
{a,, a*- + 1} has asymptotic density 2 a > a + iP . Let a *, be the first a, such 
that a*! + 1 is also in A. Then since Case 1 is not satisfied and since a = 
lim /(n)/n, there exists a largest integer mi such that /( a kl , mi) < 
(a — £/3)(mi* ~ + 1)* Again let a ki be the least a,* greater than mi such 

that a* 2 + 1 is also in A ; there exists as before a largest m 2 such that/(a* 2 , m 2 ) < 
(a — i^)(m 2 — a* 2 + 1) and so on. Take n large and let m r be the least m ^ n. 
It is clear that the intervals (a*. — 1, m,), i = 1, 2 • • • r do not overlap; thus 

Z) rru) <£ m r 

«-i 

Now since the asymptotic density of A is a, we have /(0, m r ) > (a — e)m r , 
if ft is large enough, and therefore the number of a/s in (0, n) outside the inter¬ 
vals (a ki , m»), i = 1, 2 • • • r is not less than 

(1 -•)-'* (I - *)*- 

But for all these a/s with the exception of a kl , a k% , • • • , a* f , a + 1 is not in A . 




68 


p. erdOs 


Moreover, the intervals (a ki , mi) do not contain only a’s; else, whenever p > a ki 
is such that (a ki , p) does contain integers not in A, we have p > m<. There¬ 
fore f(a k{ , p) ^ (a — J/3)(p — a ki + 1) (by definition of mi) ; so that the modified 
density of the positive terms of {a, — a k{ ) (j = 1, 2 • • •) is S a — \fi, and we 
are in Case 1. Thus each of the intervals ( a ki , mi) has to contain an x which 
is in A, such that x + 1 is not in A. Hence, finally, the number of integers ^ n 
of the form a,- + 1 which are not in A is S (§£ — «)(n — 1). Hence the num¬ 
ber of integers g n of the form {a, , a,- + 1} is not less than (a + — e)n — 1, 

if n is large enough, which completes the proof of our theorem. 

University of Pennsylvania 



Annals of Mathematics 
Vol. 43, No. 1, January, 1042 


SOME NEW SUMMABILITY METHODS WITH APPLICATIONS 

By Otto SzAsz 1 


(Received November 4, 1941) 

1. Given an infinite sequence of functions 

(1.1) <po(x), <p 2 (x), ■■■ , <p,(x), 

defined on a finite or infinite range R of the real or complex variable x with the 
limit point £; the range R may be discrete or continuous, but must contain 
infinitely many points. The series-to-function transform 

(1.2) $(*) = £ <fiv(x) 

p-0 


defines, under certain assumptions for (1.1), a summability method, and 
lim*-* <£(:r), if it exists, is called the generalized sum of the series Xo 1 u v . If it 
is true that for every convergent series X u * > $( x ) exists in R and lim $(:r) = 
X w,, then the method is called regular. The necessary and sufficient condi¬ 
tions for regularity are: 

(1.3) lim <p v {x) = 1, for v = 0, 1, 2, • • • 

00 

(1.4) X I <P»( X ) — <P*+i(x) I uniformly bounded on R. 

o 


We then say that (1.2) is a regular transform; the summability method defined by 

> 

(1.5) lim <£(a:) = s 


is called the method of convergence factors (according to C. N. Moore). 2 Corre¬ 
sponding to this method we obtain new methods in the following way: 

We select a sequence of values x = x n —* £, and associate with (1.2) a series- 
to-sequence transform 


(hQ) An(j^n) — An X , ft — 0, 1, 2, 

its matrix is of triangular type. The summability method defined by 


1 Presented to the American Mathematical Society, September 2,1941. 

1 For a comprehensive discussion of the general regular transforms cf. Szdsz [8], Agnew 
[1]; particular methods of the type (1.5) have been discussed by Perron [3]. Numbers in 
brackets refer to the literature listed at the end of this paper. 

69 



70 


OTTO SZXSZ 


(1.7) lim 4 n = 8 

n-*oo 

is regular (by the theorem of Silverman and Toeplitz) if and only if: 

(1.8) lim <p,(x n ) = 1 for v = 0,1,2, • • • , 

n —»oo 

n—1 

(1.9) 23 | <p,(x n ) — <p,+\{x n ) I + I <pn(x n ) | is uniformly bounded in n. 

?*■>() 

It is obvious from our regularity conditions that the method (1.7) is regular 
whenever (1.5) is, but the converse is not true in general. 

We can apply the same generating process to a sequence-to-function transform : 

(1.10) *(*) = E «,*,(*), 

where {^(x )} is a suitably given sequence of functions. From (1.10) we get the 
summability method: 

(1.11) lim ty(x) = s = gen. lim. s n . 

This method is regular (i.e. lim s n = s implies (1.11)) if and only if: 

(1.12) lim \p v (x) = 0, for v = 0,1, 2, • • •, 

00 

(1.13) 23 I t^x) | uniformly bounded on R, 

v—0 

* (1.14) X) ^v{x) —> 1 . as x —> £. 

v-0 

We now associate with (1.10) and with a sequence of values x = x n —*► £ 
a triangular-type sequence to sequence transform: 

n 

(1.15) B n (x n ) = B„ = 23 s,\py(x n ), n = 0,1,2, • • •. 

The summability method 

(1.16) lim B„ — s = gen. lim s„ 
is regular if and only if: 

(1.17) lim ^v(x„) =0, for v = 0,1, 2, • • •, 

n —♦oo 
w 

(1.18) 2 I ^»(*n) I uniformly bounded in n, 

>-o 

lim 23 ^(*n) = 1. 

n -♦oo r—0 


(1.19) 



SOME NEW SUMMABILITY METHODS 


71 


It is seen easily that regularity of the method (1.11) does not imply regularity 
of the method (1.15), and conversely. 

Note that A n can also be written as a sequence to sequence transform: 

n—1 

An = ^ i Sp{^p(x n ) } ”f“ 8n<Pn(Xn)j 

K-0 


where 

n 

^ j W = 0, 1. 2, • • •, 

1 0 

A generalization of (1.6) is 

m 

A m (x n ) — X) u,tp,(x n ), - * where m = m(n) —> oo, 

v-0 

and the analogous sequence-to-f unction transform 

/^(x) = v> v ^p v {x), 

^w(l) 

These generalizations will not be discussed in the present paper. 

2. We consider first the important case of Abel-summability (generalized by 
Stolz to complex values of x ); it can be written either as a convergence-factor 
method: 


lim X) = s, 

X ->1 0 

or as a sequence-to-function transform: 

00 4 

lim 2Z s.x’Xl — x) = s. 

*-♦1 0 

Accordingly: 

(2.1) *>»(x) = x", ^ » = 0, 1, 2, • • • , 

and 

(2.2) ^„(x) = (1 — x)x", n — 0, 1, 2, ••• . 

The method is known to be regular if and only if x approaches 1 along a path 
inside and non-tangential to the unit circle, i.e. 

| x | < 1 and 1 — x = 0(1 — | x |) as x —► 1. 

Writing x = pe' e the latter condition becomes 0 — 0(1 — p). 

Now the associated series to sequence transform is: 



72 


OTTO SZ-iSZ 


n-1 


(2.3) Ah — UpXn — ^n) “I” Sn^ii j ^ 9j lj 2j * ’ • 

m0 y-0 

The regularity conditions are (from (1.8) and (1.9)): 

(2.4) x„ —>> 1 as n —» , 


'* + 11 ~ 1 “ 0(1> 


as n ■ 


(2.5) | x n 

On the other hand, using (2.2) we get the sequence-to-sequence transform 


oo. 


(2.6) 5n = Z^^( l-*n), 

*-0 

and the summability method 

lim jB n = $ = gen. lim s n , 

n -*oo 

which is regular if and only if: 

(2.7) x n —► 1, ► 0, and 1 - x n = 0(1 - | x n |) asn->oo. 

Although the methods lim A n and lim B n both originate from Abel's method in 
much the same way, they have quite different regularity conditions. 

On writing 

Xn == PnC , 7T ^ 6 n = 7T, 


the conditions (2.4) and (2.5) take the form 

(2.8) Pn 1, On 0, Pn = 0(1), 


and 

(2.9) , 1 -^Pn COS e n + pi = 0 (^”i) ) 


But 


1 — 2p„ COS On + pn — (1 — Pn) 2 H“ 4p n Sm 2 #n> 


and in view of (2.8) 


and. 

hence (2.9) reduces to 



as n —> °o. 


as n —> oo, 


as n —> oo. 


8 If | X H | 


1, then 


1 - 1 a?n h 

1 - I Xn | 


is to be replaced by n. 



SOME NEW SUMMABILITY METHODS 


73 


Note that 


1 -Pn = _1_ 

1 - Pi 1 + Pn + • 

Finally, putting p„ = 1 + 5„we have 6„ 


T3 = 0(1) 


as n 


+ Pn 

►0, and pn = e n ,og (1+ ^;but 


oo . 


(2.10) log (1 + 5) = 5(1 + o(l)) as 5 0. 

Hence p\ 1 = 0(1) if and only if n(p n — 1) < A;. 4 Summarizing we have 
Theorem 1. Necessary and sufficient conditions for the regularity of the trans¬ 
form (2.3) (where x n = p n e' 9n ) are : 

(2.11) p n -» 1, lim n(p n - 1) < + °o, 6 n = 0 , as n-+<*>. 

The last condition is certainly satisfied if 9 n — 0(l/n); this condition is also a 
necessary one if p n S 1 for all large n. 

For the regularity of the method (2.6) the necessary and sufficient conditions 
are (from (2.7)) 


p n 1, p” —» 0 and 6 n = 0(1 — p n ) as n —> oo ; 

here the second condition can be replaced by (using (2.10)) 


n( 1 — Pn) —> + 00 as n —> oo. 

It is easily seen that the last condition of (2.11) can be replaced by 6 n = 
0(n 1 + | 1 — p n |). 


3. We get a summability method related to Abel's on choosing 

<p n (x) = = p n cos nd , n = 0, 1, 2, • • • . 

The regularity conditions (1.3) and (1.4) now become: 

p —> 1, 6 —> 0 and 

00 

(3.1) X) P v I cos vB — p cos (v + 1)6 1 uniformly bounded in p and 6. 

y-0 


In particular we must have p < 1; furthermore, using the formulae 

(3.2) cos vB — p cos ( v + 1)0 = (1 — p) cos vd + 2 p sin jB sin (v + $)6, 

and 

£ p’(l — p) I cos vB I g 1, 
o 

(3.1) is equivalent to: 

00 

| fl | o' | sin (t< + 4) 0 1 uniformly bounded in p and 0. 


4 k, fei, kt, • • • denote absolute constants. 



74 


OTTO SZASZ 


Thus a sufficient condition is: 


1 — p 


= 0(1) for p —► 1, or for 0*—> 0. 


This condition is also necessary, as is seen from 


?'’i™ ( ’ + * WI > (> + ~ s{rb “ 1-2,17+% } 

__ 1 — (1 + p 2 ) cos 0 + p 2 _ (1 4- p) 2 (1 — cos 0) 

“ 2(1 - p)(l - 2p cos 0 + P 2 ) “ 2(1 - p) {(1 - p) 2 + 2p(l - cos 0)}' 

Thus the regularity condition here is the same as for Abel-summability. 

The method corresponding to (1.6) is now 

n 

Anifin > 0n) = ^ v ^>v Pn COS vOn 

, v r-0 

»•*) 

= S„Pn{ COS Vdn — Pn COS (v + l)0nj + S n Pn COS 7i0 n ; 


the regularity conditions for this method are: 
(3.4) p n —> 1, 6 n —> 0 


as oo, 


(3.5) p£ | cos nOn | + 53 Pn | cos vd n — Pn cos (v + l)0 n | = 0(1) as n —> oo. 

0 

From 15W(o: v — x v+1 ) | ^ | x v — a;"* 1 | it is clear that the conditions (2.11) are 
sufficient for the regularity of the method (3.3). We shall prove that they are 
also necessary. We first show that pi = 0(1) is a necessary condition. If 

n | On | g then pi | cos nO n | ^ pI If w | 0 n | > then define X = 

Xn ^ 1 by 

(2X - \)\ < n | 0 n | g (2X + 1)^, 
and define k as the smallest integer for which 


*|0n| (2X- iy~. 

Thus (k — 1) | 0„ | < (2X — 1) | < n | 0„ |, <c < n; and 

„ ■>. fo\ i ^ * _ (2X ■+■ l)ir 2X 1 ^ 2X 1 n 

(2X - 1)^ - £ 2X"+"l” * 3- 


(3.6) 



BOMB NEW STTMMABILITY METHODS 


75 


Furthermore 


n—1 

£ Pn | COS V$ n 


and 

hence 


Pn COS (v + 1 )0n I 
*-1 

£= I S (Pn COS V0 n — Pn +1 COS (V + l)0 n ) | = | 1 — p£ COS K0 n |, 
0 

(2A — 1) | £ * j 0 n | < (2X - 1) | + | 0 B |, 


*|0«| « (2X - 1) 0£«.<|ft,|, 


COS K \0n\ ^ sin 


in (l- l 0 nl)- 


This and (3.6) proves that (3.5) implies p" = 0(1). Now using (3.2) we see 
that (3.5) is equivalent to 


n—1 


Bn 23 Pn | sin (v + h)Bn | — 0(1) 
0 


Hence we must have 


as n —> oo. 


(3.7) 

But 

23 p" sin 2 O' + 4) 0 


n —1 


On 23 Pn sin 2 O' + %)dn = 0(1) 
0 


as ft —► oo. 


71—1 



p v ( 1 — cos (2? + 1)0) 


1 — p n _ (1 — p) cos 0 

1 — p 1 + p 2 — 2p cos 20 


, p n [cos (2n + 1)0 — p cos (2n — 1)0] 
1 + p 2 — 2p cos 20 

1— p 71 __ (1 — p^cos 0 p n [cos (2n + 1)0— cos (2n — 1)0] 

1 — P (1 — p) 2 + 4p sin 2 0 (1 — p) 2 + 4p sin 2 0 


p"(l — p) cos (2n — 1)0 
(1 — p) 2 + 4p sin 2 0 

Now 


| 1 Pn | 1 COS Bn j ^ _ 1 1 ~~ Pn 1 _ __ q ( 

(1 — p n ) 2 + 4p n sin 2 0 n 4 p * 11 — p n | | sin 0 n | \®»/ ’ 

1 COS (2n + l)0n - cos (2n - l)0 n j ^ 2 1 sin0 n 1 _ Q /l\ 

(1 - Pn) 2 + 4p n sin 2 0 n ~ 4p n sin 2 0 n \0 n / ’ 

1 _ n 

and we find that (3.7) implies 6 n -— = 0(1). We have thus proved: 

i — Pn 



76 


OTTO SZ-iSZ 


Theorem 2. Necessary and sufficient conditions for the regularity of the trans¬ 
form (3.3) are conditions (2.11). In the special case p n s 1 these conditions 
reduce to B n = 0(l/n) as n —> oo. The corresponding transform il w (l, B n ) = 
u v cos vB n was first introduced by Rogosinski [5, 6], in connection with 
trigonometric series. 


4 . Finally let 


<Pn(x) = 


sin nx 
nx 9 


n = 0,1, 2, • • • , (#>o ® 1). 


Now (1.2) with x —* 0 is Lebesgue’s summability; it is not regular, as (1.4) is not 
satisfied. The associated transform (1.6) becomes 

(4.1) AM = t«. —- , n = 0,1,2, • • •. 

0 VXn 

The necessary and sufficient conditions for regularity are (from (1.8) and (1.9) 
for real x n ): 


and 


X n * 0 


2 

»~0 


sin vx n 
VX n 


sin (v + 1) Xn 

(v + 1 )x n 


= 0 ( 1 ) 


as n —> oo. 


We restrict our discussion to real positive x n = 0 n 1 0; we shall prove that as in 
Theorem 2, nd n = 0(1) is the necessary and sufficient condition for regularity. 
# Elementary calculus shows that sin 6/0 is monotonic in the successive intervals 
t v ~i ^ 0 ^ tv , v = 1, 2, 3, • • • , where to = 0 and t \, -fc , h , • • • are the positive 
roots of the equation ^ 


(4.2) 


sin 0 = 6 cos 


also 


t y = ptt + , where 0 < < -, 

It 

and substitution into (4.2) yields easily 



in particular a v —» tt/2 . Subdividing in 

A sin _ sin (v + l)fl n 
o vB n (v + l)0 n 

the summation into parts, in each of which the differences have constant sign we 
find now that nB n = 0(1) is the necessary and sufficient condition for uniform 
boundedness. Thus we have proved: 



BOMB NEW SUMMABILITY METHODS 


77 


Theorem 3. The transform (4.1) with x n = 6 n i 0 is regular if and only if 
n$ n — 0(1) as n —> ». 

6 . In this section we establish a lemma for later application. We restrict 
ourselves to real positive sequences {x n \. We have seen already (§2) that the 
class 

(5.1) xl = 0(1) as n oo, 

is characterized by 

(5.2) ljm n( 1 — x n ) > — 00 ; 

n —»oo 

and the smaller class 

(5.3) x 1 ^ = o{ 1) asn—> 

is characterized by 

(5.4) n(l — x n ) —» + as n —> oo. 


We now prove the 

Lemma. Let 0 < x n < 1 for all large n\ then any one of the three conditions : 



(5.7) lim {n( 1 - x n ) - log log n} > - oo, 

n-*oo 


implies the two others . 

First assume (5.5); then 

(5.8) fciXn” + log (1 - x n ) > 0 for all large ft. 

Let 

g(x) = kix~ n + log (1 — x), 0 < x < 1, 

then 

g'(x) = - nkix~"~ l - —— < 0, 

1 — X 

hence g{x) j — <» ass T 1- From (5.8): g(x„) > 0 for all largen. We intro¬ 
duce for a constant k 

x n (Jc) = 1- - (log log n-k ); 
n 


(2.10) yields 



78 


OTTO SZXSZ 


(5.9) (*.(*))* = = exp {( k - loglogn)(l + o(l))} - (1 + <,(1)) 

as n —► oo. Hence g(x n (k)) < 0 for e~ k < 1/ki and all large n. Thus 

(5.10) x n < x n (k) for k > log ki , and for all large n f 

which is (5.7); moreover (5.9) and (5.10) yield (5.6). We now assume (5.6); 
thus 

x 1 ^ < & 2 /log n for all large n, 


or 

Xn n > log n/k 2 , and e n(x * n “ 1) > Xn n > log n/k 2 for n > . 


Hence 

tt(Zn n ~ 1) > log log n - log &2 , 
or 

(5.11) Xn < - - - - - < 1 — - (log log n — lpg fc 2 ), 

1 w 

1 + - (log log n - log ki) 
n 

which gives (5.7). From here we get easily 


(5.12) log —— = log n + 0(1), 

1 — x n 

which together with (5.6) yields (5.5). Finally assume (5.7); then (5.11) holds, 
and the inequality 

1 — x < e~ x for 0 < x < 1 

gives x^ < & 2 /log n, which is (5.6). This and (5.12) finally give (5.5). This 
proves the lemma. ^ 


6 . We now apply the summability methods introduced in §§2, 3, 4, to Fourier 
series. Let/(0) be an integrable function of period 2 tt with the Fourier series: 


( 6 . 1 ) 
so that 


f(B) ~ + X ( o,p cos vB + by sin vB ), 


bo = 0, a y + ib y = - f f(B)e xv9 dB , v = 0, 1, 2, 

7T iLy 

Applying (2.3) to the series (6.1) yields 

n 

A n (*n, /) = io 0 + 2 *»(<*» cos v6 + b, sin vd) 


r—1 


( 6 . 2 ) 


= ~.J_ ■+■ oQj + 23 COS vt'j dt 


1 



SOME NEW SUMMABILITY METHODS 


79 


where 


T n (x n , t) = i + ib Xn COS vt. 


oo * 

H(x , 0) = ^ao + 22 z v (a* cos v0 + b v sin ?0), 

i 

n 

£ n (z, 0) — i<*o + 22 x \ a * cos vB + b v sin vB ), 

i 

then ^4n(^r» > /*) = $«(#» * 0). 

(6.2) is an integral-transform of the function /(0) into a sequence A n by the 
kernel T n (x n , t). Such a transform is called regular, if A n — » %{f(B) + f( — B)} 
at every point of continuity. On defining [cf. Sz&sz [8], chapter 3] 

2 

K(x, t) = - T n (x n , t) for n ^ x < n + 1, and 0 ^ t ^ 7r, 

7r 

and J£Cr, 0=0 for x ^ 0, t > t, 

furthermore 


x(0 - *{/(* + <)+/(»- 0}, 


we get for the transform (6.2) / l£(:r, 0x(0 a step function of x. The 

Jo 

necessary and sufficient conditions for regularity of this transform are [cf. 
Agnew 1]. 


(6.3) 

(6.4) 

(6.5) 


lim / T n (x n , t)g(t) dt = 0 for any integrable g{t) and any h > 0, 

n —♦oo J/t 

lim f | r„(a;„, t) \dt < oo, 

n—*oo Jo 

lim f T n (x n , t) dt = ^. 

n —*oo JO « 


It is of interest to consider real positive sequences x n , for which T n (x n , t) 
is non negative for all t. Evidently there exists an r n such that T n (x n , t) ^ 0 
for all t, when 0 < x n ^ r n , whereas for any c > 0 T n (r n + €, t) becomes < 0 
for some t I. Schur and G. Szego [7] have proved that: 


(6.6) r n | 1, r—— (1 — r„) —> 1 as n -» oo, 

1 log n 1 

(6.7) r n > 1 — - log 2n for n ^ 1. 

n 

For odd n r n is the positive root of the equation 



80 


OTTO SZASZ 


(6.8) 1 - r - 2r n+l = 0. 

It follows from (6.8) and (6.6) that 


= 0(1 - r, 




as n oo. 


Thus (from §2) the transforms (2.3) and (2.6) are both regular for x n = r n . 
Moreover A n (r n , /) is a regular transform; this follows from the statement: 

Theorem 4. The function-to-sequence transform (6.2) (where 0 < x n < 1) 
is regular if and only if x n 1 and if (5.7) holds. 

Evidently for any x n 

J£ T n (x n , t) dt = 

hence we need only satisfy (6.3) and (6.4). We restrict ourselves to 0 < x n < 1, 
and make use of the formula [7, §1]: 

2 T n (x, t) 

= (1 - 2x cos t + x 2 )~ l {\ - x 2 + 2x n+2 cos nt - 2x n+1 cos (» + 1 )2). 
2 f T 

The expression - / | T n (x n , t) \ dt = l(x n ) may be called the n th Lebesgue con- 
.7r Jo 

stant of our summability method. As 

r i- x 2 _ 

Jo 1 — 2x cos t + x 2 ^ “ 7r ’ 

it is clear that (6.4) is satisfied if and only if 

UN r I COS 7l2 - COS (n + 1)/ | ^ ^ N 

(6.11) x n I ---—-- 2 - dt -r 0(1) as /i —> oo. 

•'o 1 — 2x n 


0 ( 1 ) 

COS t + X n 


as n —> oo. 


But from the identity 

x cos nt — cos (n + 1)2 = xfcos nt — cos (n + 1)2) — (1 — x) cos (n + 1)/ 
it follows that (6.11) holds if and only if 

(6.12) Xn [ | cos nt - cos (n + 1)21 (1 — 2x n cos 2 + x 2 n )~ l dt = 0(1). 

Jo 

We now prove that the necessary and sufficient condition for (6.12) is (5.5). 
To show that (5.5) implies (6.12) write 

f I cos nt — cos (n + 1)21 (1 — 2x n cos 2 + x 2 n )~ l dt 
Jo 


= 2 / 
Jo 


* . 2 | sin (n + $)21 dt r r tdt 

sin --- < / -2- 

(1 - x n )* + 4:c„ sin* ^ 4 (1 - a;*)* + ^z»<* 

. f 1- ** tdt ,2 r .. 1 , 2 . «- 

I (1 - X„y + X„ Jls n d 2 + Xn l ° g l -X*’ 




SOME NEW SUMMABILITY METHODS 


81 


for 0 < x n < 1; this proves the first part of our statement. The converse is 
seen from the estimate: 


r T . t | sin (n + i)t\dt _ 1 f w t | sin (n + h)t ! dt . , , 1 

/ srnz- > r- / J > fc log -. 

Jo 2 (1 - x n ) 2 + 4x„ sin 8l - 2 * J, - I “ 1 1 “ * n 

We finally show that (6.3) is satisfied whenever x n —> 1 (assuming that (5.5) 
holds). We have 

Xn I x n cos nt — cos (n + l)t 11 git) | (1 — 2x n cos t + x 2 n ) -1 dt 

J h 

f I git) I dt = o(l) 
Jo 


cos h 


as n —> oo (from (5.6)). Hence, using (6.10), (6.3) holds if and only if 

(1 — x\) f (1 — 2x n cos t + xly 1 g{t) dt-+0 asn— 

J h 


and this is equivalent to x n —► 1. From the lemma of §5 we finally get 
Theorem 4. 

We now apply the summability method (2.6) to Fourier series. Introducing 
the notation 

n 

so = h a o > s„(/, 6) = s n = \a a + 2 (a, cos v6 + b, sin vd), nil, 

and using the formula 

we get from (2.6) the function-to-sequence transform: 

(6.13) B n (Xn , f ) = ^ {/(« + t) + /(« - 0 } U n (x n , t ) <fc, 

where 

1 n 

Un(x n , t) = -— r a:: sin (r + §)f. 

sin ■jt i 

For the regularity of the transform (6.13) we have the necessary and sufficient 
conditions: 

(6.14) lim (1 — x n ) f Unixn , Ogit) dt = 0 for any git) cL, 

n -*oo Jh 

and any given h > 0, 

(6.15) lim --— f | Un(x n , t)\dt < », 

n —»oo 7T *0 

(6.16) lim —f Un(x„ ,t)dt = 1. 



82 


OTTO BZABZ 


For/(<) s 1 formula (6.13) yields 

1 _ X : +1 = —^ f t/„(x„, <) d<, 

7T Jo 


and condition (6.16) becomes x„ —» 0 as n —* »; assume again 0 < x» < 1, 
then we must have n(l — x„) —> + “. But Fej6r [2, p. 20] remarked that for 
any c 0 > Cj > • • • > c„ > 0, £o c„ sin (*> + i)< > 0 for 0 < t < 2v; hence 
the kernel of the transform (6.13) is positive, and the condition (6.15) is satisfied. 
We finally discuss condition (6.14). From the harmonic development 


E 


x” sin (y + §)< = 


(1 + x) sin 
1 — 2x cos t + x 2 ’ 


0 < x < 1, 


we find easily the formula 

U n {x, t) sin £<(1 — 2x cos t + x 2 ) 

= (1 + x) sin %t + x n+2 sin (n + £)< — « n+l sin (n + f)<. 

Using this formula and the condition x” —> 0, it follows immediately that (6.14) 
holds if and only if 


(1 - Xn) P (1 
Jh 


2x„ cos t + x 2 ) 1 gr(0 dt -» 0, 


i.e. if x B —> 1. Thus we have 

Theorem 5. The necessary and sufficient conditions for the regularity of the 
transform (6.13) {where 0 < x„ < 1) are: 

x„ —► 1 and n{ 1 — x n ) — *. + », as n —* «>. 


7. We give hereui Tauberian type theorem for the method A „(x„) of §2. 
Theorem 6. Suppose x„ real or complex, and 

(7.1) liin 11 - x„ | L?" r = X < 1, 

n-*oo 1 I X n I 

then A n (x n ) —* $ implies s n —» s. 

Note that (7.1) implies lim n ^oo || x n \~~ n — 1 | ^ X, and in particular x n —► 1, and 
r“—r g lim I Xn I" ^ llm I X„ |“ ^ - — . 

1 T A n-*co n-*oo 1 A 

The theorem can be deduced from recent results of R. Rado [4]; we give a direct 
proof, following his device. 

Suppose first that s = 0 and s n = 0(1); then 0 ^ lim„- w | s„ | = 5 < «; we 
shall prove that the assumption 5 > 0 leads to a contradiction. To a given 
e > 0 choose n = n(«) so that | s„ | < 5 + e for v > n; then choose m > n so 
that | s„ | > 5 — Now, using (2.3) 



SOME NEW 8UMMABIUTY METHODS 


83 


m— 1 

* - « < I | - A m xT - Es,xr m (l - X„) 

o 

<\A m x-”\+ £,8,x' m - m (l-x m ) +(s + e)\l-x m \ i MZ-", 1 ; 

o ' 1 - | X m | 

thus, if m is large enough 

5— «<«+« + (5+ *)(X + «) < 5\ + e(3 + 5 + «), 

where X < 1. This is a contradiction for e small enough. 

We next assume s = 0 and lim | s» | = °o; choose 0 < e small and l large; 
denote by m — m(l) the least n for which | s„ | > l. Then (2.3) yields 

I r I—m -I 

( < I Sm | ^ | A m xT | + 1 1 1 ~ | j . < € + l(\ + «), 

A I X m I 

which again is impossible for small e. Thus the theorem is proved for s = 0. 
Finally for s ^ 0 we apply our result to the sequence s n — s, and its transform 

n—1 

(s, s') X n (l Xn) (Sn ®)x n == A n S ^ 0, 

v -0 

which yields s n —* s; our theorem is proved. 

Final note. The argument of §6 can be used to prove that the transforms 
A n (x n , f), BJx n , f ) under the regularity conditions of Theorems 4 and 5 con¬ 
verge to f(0) if only 

[ | f(0 + w) +/(0 — u) — 2/(0) | du = o(t ) as t —»0, 

Jo 

thus for almost all 6. 

University of Cincinnati 

LITERATURE 

1. R. P. Agne'w, Properties of generalized definitions of limit , Bull. Am. Math. Soc., vol. 45 

1939, pp. 690-730. 

2. L. Fej^r, Trigonometrische Reihen und Potenzreihen mit mehrfach monotoner Koejfizien- 

tenfolge f Trans. Am. Math. Soc., vol. 39, 1936, pp. 18-59. 

3. O. Perron, Beitrag zur Theorie der divergenten Reihen , Math. Zeitschrift, vol. 6, 1920, 

pp. 286-310. 

4. 4. R. Rado, Some elementary Tauberian Theorems (1), Quarterly Jour, of Math., Oxford 

SerieB, vol. 9, 1938, pp. 274-282. 

5. W. Rogosinski, Vber die Abschnitte trigonometrischer Reihen , Math. Annalen, vol. 95, 

1926, pp. 110-134. 

6. W. Rogosinski, Abschnittsverhalten bei trigononetrischen und insbesondere Fourierschen 

Reihen , Math. Zeitschrift, vol. 41,1936, pp. 75-136. 

7. I. Schur und G. Szeg6, Vber die Abschnitte einer im Einheitskreise beschraenkten Poten- 

zreihe, Sitzungsberichte der preussischen Akademie, 1923, pp. 545-560. 

8. O. SzAsz, Lectures on summability methods , University of Cincinnati, 1936-1937. 



Ann ax* of Math uma tic* 
Vol. 43, No. 1, January, 1942 


GENERALIZED SURFACES IN THE CALCULUS OF VARIATIONS 

By L. C. Young 
(Received September 30, 1941) 

I. Generalized Lipschitzian Surfaces 

1. Introduction 

This note deals with an extension of the idea of surface, which is similar to 
the writer’s generalization 1 of the notion of curve in the Calculus of Variations. 
The primary object of such generalizations is to ensure, as far as possible, that 
every problem of the Calculus of Variations has at least one solution. Their 
need has been apparent ever since the illuminating remark made by Hilbert: 2 
Eine jede Aufgabe der Variationsrechnung besitzt eine Losung sobald hinsichtlich 
der Natur der Grenzbedingungen geeignete einschrankende Annahmen erfullt sind , 
undy notigenfallSy der Begriff der Losung eine sinngemasse Erweiterung erfahrt. 

In the existence theorems for variational problems concerning an unknown 
curve, the customary restriction to regular problems can now be removed by 
using generalized curves , and applications of these notions have lately been made 
by McShane. 3 

Problems of minima of variational problems concerning surfaces are essen¬ 
tially far more complicated than those concerning curves. In this note, how¬ 
ever, the additional complication appears only in the proof of certain theorems, 
not in their statement. 

The definition of generalized surface given here, and the proof of the existence 
of a (generalized) solution, are framed for a problem of minimum originally 
stated in a class 2 of ordinary surfaces 

2 = z(x, y) 

with a given boundary. In the existence theorem, the principal further hy¬ 
pothesis is the restriction to a Lipschitzian class of surfaces, by which we mean 
that the function z(x, y) satisfies the condition 

I y') - z(x", y") I g Kp, 

where K is a constant depending only on the class 2 and where p is the distance 
of the arbitrary points (x\ y') and (x", y"). The restriction expressed by this 
Lipschitz condition as rendered necessary by the ordinary (as opposed to the 


i Young [12J, [13], [14]. 

a Hilbert [2] p. 184; cf. also Lebesgue [3] p. 372. 
* McShane [4], [5], [0]. 


84 



GENERALIZED SURFACES IN CALCULUS OF VARIATIONS 


- 85 


parametric) formulation of the problem, not by the fact that we have surfaces 
rather than curves. 

It is perhaps superfluous to add that the results of the present note merely 
constitute a beginning. The Lipschitz condition, although needed in the general 
type of problem considered here, is inconvenient in the applications to classical 
problems such as that of Dirichlet or Plateau, where a Lipschitzian boundary 
function may correspond to one or more singularities of the solution. And in 
any case, the usefulness of existence theorems which are based on a generaliza¬ 
tion, devised for the purpose, of the notion of surface, is naturally dependent on 
a new extension of the necessary conditions for attained minima such as that 
studied by the writer and by McShane in the case of curves. These questions 
we reeerve for a later date. 

2. An elementary unattained minimum 

We shall follow the example set by McShane in his treatment of generalized 
curves 4 and begin with a “heuristic discussion.” For the purpose of this dis¬ 
cussion, let us compare our variational problem with an elementary minimum 
problem which consists in finding the least value of the function 

m = t - 8 1, 

the latter being supposed defined in the first instance for rational t only. Let 
us analyze briefly this elementary problem as treated in the Differential Calculus, 
the term “least value” being first reinterpreted in the obvious way. 

We first define convergence of a sequence of rational numbers, so as to make 
every such simple function f(t) continuous. More precisely, whenever j/ n } is a 
convergent sequence of rational numbers so is {f(t n )\. Such a definition is 
furnished by the classical convergence criterion of Cauch y; indeed the classical 
idea of convergence may fairly be said to have been devised primarily for this 
very purpose. Next we define irrational numbers as limits of rational ones, and 
we extend to them the definition of the function f{t) by continuity. In so doing 
we need not refer to properties special to numbers, except those implied in the 
definition of convergent sequence. 6 Finally we apply the methods of elementary 
Analysis to show that in its extended range the given function attains its least 
value, and to determine the latter in the usual way. 

Can we proceed similarly in the Calculus of Variations? The above com¬ 
parison strongly suggests that we can, and this without restricting ourselves in 

« McShane [4] p. 514. 

5 The writer has in mind a definition “by abstraction” according to the terminology of 
Weyl [10] p. 7; and this definition is perhaps free from the criticism which Weyl himself [11] 
has directed against irrationals. Two sequences have the property of limiting equality if 
they are subsequences of a same convergent sequence; a system of sequences between any 
two of which the above property holds defines a real number, which is termed rational 
when the system contains a sequence of repetitions, otherwise irrational. With this 
definition, only the property of continuity actually stated is needed to extend/(0 to irra¬ 
tional values of t. 



86 


L. C. YOUNG 


any way to regular problems as past writers have done, if only we can produce a 
suitable definition of convergence. Now it is quite true that the definition of 
convergence normally applied to surfaces S in Analysis, renders the functions 
F(S) which occur in variational problems, discontinuous; this is so. even in 
regular problems, when the functions are merely semi-continuous. But, as 
remarked by McShane, we need not look very far for a more suitable definition: 
it is sufficient to term the sequence of surfaces {S n } convergent if and only if, 
for every F(S ) of the Calculus of Variations, the sequence of numbers {F(S n )} 
converges. The functions F(S) are then, automatically, continuous. 

3. Connection with linear operations 

In classical problems of the Calculus of Variations, the functions F(S ) have 
the form 

L a (f) = J f fix, y, z, p, q\dxdy = J J fix, y)dxdy, 

• 

where the integrand / is a continuous function of five real variables and j is its 
value at the point x, y of the domain 8 of definition A when we substitute for the 
three variables z , p , q the function z(x, y) and its gradient p(x , y ), q(x, y). We 
shall suppose for simplicity that the domain A is convex. Actually only minor 
verbal changes are needed to make the discussion applicable to domains of the 
most general kind. 6 * 

Just as in the writer’s work on rectifiable curves, it is possible to treat problems 
of minimum in which a number of such double integrals occur simultaneously, 
by allowing the functions / and F to assume values in a vector-space instead of 
only real values; we shall content ourselves with the case of functions taking 
real values, and leave the reader to make the modifications appropriate to vector- 
functions which are suggested by the writer’s treatment for curves. 7 

With the above form for F(S), the definition of convergence of surfaces sug¬ 
gested in the preceding paragraph leads us to study linear operations L(f) which 
are defined for every continuous function /. The definition itself, for the se¬ 
quence of surfaces {<S n }, coincides with what is known as weak convergence of 
the operations L 4 s n (/), that is to say the convergence for each / of the sequence 
of numbers defined by these operations on /. 

We shall suppose the functions / defined in the whole space of the five variables 
x , y , z , p, q as we clearly may; but we shall see in the next paragraph that no 
genuine linear operations, and no genuine weakly convergent sequences of 
linear operations, can exist, which are not wholly independent of the values taken 
by the functions / outside some fixed sphere x 2 + y 2 + z 2 + p 2 + g 2 g k 2 . This 

* By a domain we mean a closed set of positive plane measure whose boundary is of 
plane measure zero. 

The principal of these changes consists in interpreting distance of two points to mean 
minimum length of a path in A joining them. 

7 Young [13]. 



GENERALIZED SURFACES IN CALCULUS OF VARIATIONS 


87 


requires on the one hand that the domain A must be interior to a fixed circle 
of the x } y plane, and on the other hand that p , q be bounded, i.e. that the class 
of surfaces considered be Lipschitzian . 

4. Linear operations in a certain type of space 

Among the spaces which have become of interest in recent years, the best 
known are the Banach spaces, of which a particular case is the space whose 
elements are the continuous functions defined in a closed n-dimensional sphere. 
The space which is relevant in this note is one whose elements are the continuous 
functions defined in the whole of a five-dimensional space; in this case, any 
element/ can *be regarded as specified by means of a sequence {/„}, where f n 
denotes the continuous function defined in a sphere of radius n and fixed center, 
which coincides in its sphere with the corresponding values of /. 

We formulate the results of this paragraph rather more generally, in a space 
whose elements/ are specified by sequences {/ n }, where f n is a point which lies 
in a Banach space, possibly variable with ft. Writing \f n | for the norm, or 
magnitude, of f n in its space, we denote by Q n (f) the greatest of the first n num¬ 
bers | /* |; in the special case which mainly concerns us Q n (f) is the maximum 
absolute value of the function in the sphere of radius ft. We write further 

(4.D M = 2^rT®)' 

and we define the distance of two elements/, g to be Q(f — g). In addition to 
verifying the usual laws of a distance, we observe at once from the definition of 
Q(f) and from the fact that Q n (f) increases with ft, that 
(4.2) If Q„(f) ^ 2 ~ N for some N, then Q(f) g 2“ ( *" 1) . 

This being so, let us consider the linear functions L(f) defined in our space of 
elements/. We shall suppose the space complete when the distance is defined as 
above, a condition that is satisfied by the space of the continuous functions of 
five unlimited real variables. We shall further restrict each L{f) to be measur¬ 
able B, and therefore continuous 8 in /. Moreover, the absolute value | L(f) | 
being continuous also, an important result 9 enables us to assert that, for a se¬ 
quence {L m (f)} of such functions, which is bounded for each / of a set of the 
second category, there exists a sphere of elements / in which the sequence is 
uniformly bounded. 

Writing/ = /o + g where/o is the center of this sphere, we see that the sequence 
{L m {g)} = {Lm(f) — L m (fo)} is uniformly bounded in the appropriate sphere of 
elements g with the origin as center, and a fortiori in the set [Q*(0) ^ 2“*], 
which by (4.2) is a subset of this sphere if N is large. By homogeneity, it 
follows that there is a constant K such that for every /, and for all m, 


• Banach [1] p. 23 theorem 4. 

• Banach [1] p. 19 theorem 11. 



88 


L. C. YOUNG 


We have thus proved the following theorem. 

(4.3) If the sequence of the values of the linear functions L m (f) is hounded for 
each f of a set of the second category , then there exist constants K and N inde¬ 
pendent of f and m f such that 

(4.4) | L m (f) | £ K-Q N (f). 

As an immediate corollary, obtained by taking {L m (f)\ to consist of repeti¬ 
tions of the linear function L(f) defined for all /, we get 

(4.5) A linear function L(f) defined for all the sequences f = {/ n } depends only 
on a finite number of the f n . 

. In fact, if the points / and g denote two such sequences having the same N 
first terms, (4.4) applied to the point / — g } for which we have Qs(f —00 = 0, 
gives Lm(f — g) ~ 0 where L m (f) is now a repetition of L(/), and so L(f) = L(g). 

A set of linear functions L(f) will be termed compact if every sequence of linear 
functions belonging to the set contains a subsequence which converges weakly. 
The following result is again a corollary of theorem (4.3). 

(4.6) The linear functions L{f) of a compact set depend only on a bounded number 
of the f n . 

For otherwise, we could form a sequence L n {f) of functions of the set such 
that (4.4) were false when m — N = n. This sequence would have to be 
unbounded at some/ and so could not be compact, contrary to the hypothesis. 

5. Generalized surfaces 

We have already associated the notion of convergence of surfaces with the 
expressions F(S) rather than with a geometrical representation of the surfaces 
concerned. Since the notion of generalized surface is to be attached to that of 
convergence, it is the process of calculation of the relevant F(S) that we shall 
really be generalizing, rather than the geometrical representation. 

Now this process of calculation is mainly that of passing from an arbitrary 
function of five variables f(x, y, z, p , q) which we supposed continuous, to a 
function of two variables/(x, y) which we rendered measurable in A, by substi¬ 
tuting for the three variables z, p, q the corresponding functions of x, y on S. 
From the function/we then passed to F(S) by integration. In generalizing this 
process, we place, for reasons sufficiently explained elsewhere, 10 the substitution 
of z(x, y) for z on a wholly different footing to that of its gradient for p, q. We 
still substitute for z a function z{x> y) of which we say that it defines the track 
of the generalized surface S * but instead* of substituting similarly for p, q we 
form a certain average of the values assumed by f at the various p, q. 

By an average , or more precisely a linear average or linear mean } of a con¬ 
tinuous function g(p, q) of the variables p, q, designate a functional M(g), 


10 Young [13] p. 231. 



GENTOAUZED SURFACES IN CALCULUS OF VARIATIONS 


89 


linear in g } non-negative for non-negative g, and which satisfies the condition 
M(g) = 1 when g is the constant unity. 

We term measurable in x, y, an average which varies according to the values 
of the additional parameters x , y , if for every continuous g which depends on 
p, q only, the number M(g) obtained by forming the average at x , y defines a 
measurable function of x, y. As shown by McShane , 11 this number then still 
defines a measurable function when g is a continuous function of all four 
variables x ) y , p, q . 

This being so, we are now in a position to state our definitions. A generalized 
surface is the system defined by the following pair of elements: (i) a real function 
z(x f y ), absolutely continuous in the sense of Tonelli ; 12 (ii) an average M defined 
for points x, y of A and measurable in x, y } which operates on each continuous 
function of p, q } and which satisfies for almost all x ) y of A the condition that for 
the two functions g(p, q) = p and g(p , q) — q the averages M(p) y M(q) are the 
components of the gradient of z(x f y). It should be added that we identify two 
generalized surfaces when they have identical tracks z(x, y) and for almost all 
x y y identical averages M. In the sequel, we shall often assume, without saying 
so explicitly, that the average has been suitably modified at the points x, y of a 
set of plane measure zero. 

The generalized surface S* will be termed Lipschitzian with the constant K f 
if at each point x, y the average M(g) of an arbitrary function g(p, q) is wholly 
independent of the values taken by g(p, q) outside the circle of radius K whose 
center is the origin of the p, q plane. 

With the definition of a generalized surface aS*, we give also that of integral 
over S* of the continuous function /(j, 2 /, z, p, q) y i.e. the meaning to be attached 
to the symbols 

F(S*) = LAf) = /j>, y)dxdy. 

We do this by stipulating that f(x y y) = M(g) where g = g(p } q ) is the value at 
the point x t 2 / of the function g(x f y } p, q) obtained by substituting z(x, y) for z 
in f(Xy y y Zy p, < 7 ). The fact that M is measurable in x y y ensures that the func¬ 
tion /is so too. 

Let us observe that an ordinary surface S and the corresponding function F(S) 
are special cases of the above definitions: the average M then consists for almost 
all Xy y in substituting in the function concerned the gradient of z(x y y) for p, q. 

6. Convergence of generalized surfaces 

We now extend to the case of generalized surfaces the definition of con¬ 
vergence proposed in § 2 , and at the same time avoid conflict with the definition 
which is usual in Analyan^ We shall consider only Lipschitzian generalized sur- 

T 1 T 1 flhnnc fll q- 517 L emma 3.1; the proof is given in the case of a single parameter £, 
but extends uithout' 1 I 1 II 11 nilj To lull [luiiim Ii n* x y>d y 

» Saks 18] p. 169. 



90 


L. C. YOUNG 


faces, this restriction being necessary in view of (4.7), in order that the repeti- 1 
tions of a given generalized surface constitute a sequence which converges 
according to our definitioA. We say that the sequence of generalized surfaces 
{St | converges if the sequence {Ls*(/)) converges weakly, i.e. converges for every 
continuous function/(x, y , z, p, q). 

Just as we distinguish between a generalized surface S* and its track, we dis¬ 
tinguish also between convergence of generalized surfaces and convergence of 
their trades, the latter notion being defined to be uniform convergence of the 
corresponding functions z(x, y ), as in the case of the usual definition of con¬ 
vergence of surfaces, which we thus identify with, and shall refer to in the 
sequel as, convergence of the tracks of surfaces. 

Let us observe further that our definition of convergent sequence of generalized 
surfaces contains no reference to a limit. We shall see in §7, that the space of 
Lipschitzian generalized surfaces is complete, so that this kind of convergence 
may eventually be identified with the more familiar notion of convergence to a 
limit. In the meantime let us state explicitly that the term copipact set denotes 
a set in which all infinite subsets contain corresponding convergent sequences, 
and that we do not restrict these sequences to possess limits at the moment. 
In the writer's opinion, this interpretation of the notion of compactness is in 
any case preferable to the one usually adopted, since this notioh then becomes 
an intrinsic property of a set instead of a property which depends partly on a 
set and partly on the space. 

This being so, we have the following theorem of compactness. 

(6.1) A system of generalized surfaces is compact if and only if their trades lie in a 
fixed sphere and their Lipschitz constants are bounded,. 

The pair of conditions stated is clearly equivalent to the single condition 
which restricts the corresponding linear operations L s *(f) to be wholly inde¬ 
pendent of the values of the continuous functions / outside a fixed sphere of the 
x, V) z > Pi Q space. The necessity of these conditions therefore follows from (4.6). 
To prove their sufficieijpy, we suppose them satisfied, so that the linear opera¬ 
tions L s *(f ) may be regarded as defined in the Banach space consisting of all 
functions of y , z,. p, q which are continuous in a fixed sphere; the conclusion 
then reduces to a corollary of a well known theorem. 13 

We now consider some consequences of the notion of convergence. If S* is 
any generalized surface, and we write as usual 

£<«•(/) = / j y) dx dy, 

for the corresponding linear operation, we define more generally, for any interval 
A of the x, y plane, the linear operation additive in A 

LUf; A) = { I /(*, y) dx dy. 

J Ja-A 


19 Banach [1] p. 123 theorem 3. 



GENERALIZED SURFACES IN CALCULUS OF VARIATIONS 


91 


We remark that for any fixed function / the operations corresponding to a com¬ 
pact system of generalized surfaces are equi-continuous in A. In fact, denoting 
by K(f ) the maximum modulus of the function / in a fixed sphere of the x, y, z, 
p, q space outside which the values of f are irrelevant by the preceding theorem, 
we clearly have 

(6.2) | L a *(f ; A) - LAf; A') | ^ K(f) • | A — A' |. 

We shall now establish the following result. 

(6.3) If the generalized surfaces S* converge , then the operatio?is L s * n (f ; A) con¬ 
verge for each f and uniformly in A. 

It is sufficient by equi-continuity, to establish the convergence for each / and 
each A. We enclose the interval A in a concentric similar interval A' of slightly 
larger area, and we denote by g a continuous function of x y y y z y p, q whose 
absolute value is majorized by that of /, such that for all z y p y q we have g = f 
when x y y lies in A, and g = 0 when x y y lies outside A'. The existence of a 
function g with these properties is clear. 

This being so, it follows from ( 6 . 2 ) that the expressions 

Lit-Sf'y A ) = Ln-(g; A) 

differ from the corresponding terms of the convergent sequence formed by the 
expressions 

LAo) — A'), 

at most by the arbitrarily small amount 

K{g> I A' - A I g K{f) • | A' - A I, 

and hence that they too must converge by Cauchy’s convergence test. This 
completes the proof. 

(6.4) If the generalized surfaces aS* converge, so do their tracks. 

To see this, we observe in the first place that the functions z n defining these 
tracks have by (6.1) uniformly bounded gradient, since their gradients at any 
point are averages of bounded sets of values of p y q . These functions are there¬ 
fore equi-continuous, so that to show that they converge uniformly we need only 
verify that they converge at an everywhere dense set or that they converge in 
some average sense in the x y y plane. The latter is the case by (6.3) applied to 
the function f(x y y y z y p y q) = z y which states that the double integrals over A 
of the functions z n (x y y) converge uniformly. 

7. The completeness theorem 

We say that the sequence { S* } has the limit S* if the operations (/) have 
the limit L s *(f) for each /. Evidently the existence of the limit implies the 
convergence of the sequence; we shall establish the converse. 

(7.1) A convergent sequence of generalized surfaces has a limit . 

We denote by z(x f y) the limit of the corresponding tracks, and by L(f; A) 



92 


L. C. YOUNG 


that of the corresponding operations additive in A, the limits existing by (6.4) 
and (6.3). We denote further by Q(f) A) the maximum modulus of the func¬ 
tion/ when z = z(x, y), when x, y lies in A- A, and when p, q is bounded in 
magnitude by the upper bound of the Lipschitz constants of our sequence, finite 
by (6.1). 

Clearly the operation L(f; A) is linear in/, additive in A, and we have 

(7.2) I £(/; A) I g I A !•<?(/; A), 
as the limit of similar relations. 

It follows from (7.2) that L(f ; A) is the double integral over A of any network 
derivative M(f ; x, y), and that the latter exists almost everywhere, as the limit, 
for a suitable sequence of intervals A tending to the point x, y> of the expression 

(7.3) L(f ; A)/1 A | 
when the function / is kept fixed. 

For almost every x, y , this limit exists simultaneously for all functions / of a 
given enumerable everywhere dense set of functions. Since the functionals 

(7.3) are, by (7.2), equi-eontinuous in / for the various A, the derivative in 
question also exists simultaneously for all / at any such point x, y ; and it thus 
represents, for almost any fixed x, y , a linear operation in / which, as we deduce 
at once from (7.2), is majorized by the maximum modulus, for the same system 
of p, q as before., of the function / when x, y and z = z(x, y) are fixed. Since 
it has this majorant, the operation M(f ; x, y) depends on the values of / as 
function of p, q only, the other variables being kept fixed. 

This being so, let us write simply M (/) for the operation just defined at almost 
every x, y, and let us complete its definition for the remaining points x, y by 
choosing it for instance to mean the value of / for p = q = 0. 

We verify at onqe that M (/), at the point x, y and z — z(x, p), is a non-nega¬ 
tive operation when / is non-negative for all p, q at this point, and that for the 
function / = 1 we have M(f) = 1. Hence M(f ) is an average. Moreover we 
have already remarked that its double integral over any A coincides with L(f; A). 
In order to identify the latter with the operation Ls*(f; A), and thus to complete 
the proof of (7.1), it is now sufficient to verify that a generalized surface S* is 
defined by the track z(x, y) and the average M{f). This will bu4he cage if the 
averages of the functions/ = p and/ = q are the components of the gradient!#- 
z(x, y) at almost all points x, y of the domain of definition A. 

To establish this last result, it is sufficient to show that for these two functions 
/ and for an arbitrary interval A contained in the domain A, the expression 
L(f; A) coincides with the double integral over A of the corresponding component 
of the gradient. Since the corresponding relations are true for the sequence of 
which L is the limit, it only remains to verify that the double integral over A 
of the gradients of the tracks of the generalized surfaces of our sequence tend to 
the double integral of the gradient of z(x, y). This is evidently the case in 
view of the uniform convergence of these tracks. 



GENERALIZED SURFACES IN CALCULUS OF VARIATIONS 


93 


8. Closure of the class of ordinary surfaces 

From the theorem of completeness just established, it follows in particular 
that the limit of any convergent sequence of ordinary surfaces necessarily exists 
and represents a generalized surface whose Lipschitz constant is majorized by 
the upper bound of those occurring in the sequence. We now consider the 
converse question, whether every Lipschitzian generalized surface is expressible 
as the limit of a suitable sequence of ordinary surfaces. Our next theorem 
shows that the answer is in the affirmative, even with additional restrictions on 
this sequence. 

(8.1) The class of the generalized surfaces whose tracks have a fixed boundary and 
whose Lipschitz constants do not exceed N, is identical with the closure of the sub - 
class consisting of ordinary surfaces. 

It is sufficient to prove that, given any generalized surface S* with a Lipschitz 
constant not exceeding N, there exists a sequence of ordinary surfaces S n with 
Lipschitz constants not exceeding N and boundary coinciding with that of the 
track of S *, sqch that, for all continuous functions / of the variables x , y , z, p, q , 

(8.2) LAf) = lim Ls n (f). 

n 

Let us say that a generalized surface S* f is an e-approximation to the gen¬ 
eralized surface S *, if there exist a positive function rj of e only, which tends to 0 
with e, such that 

(8.3) | L*.(f) - Mf) | ^ KvQ(f) + *•«(/; v), 

where K is some fixed constant, and where Q(f ) and w(f; rj) denote, for the 
function /, the maxima, in some fixed sphere of x } y , z, p, q , of the modulus and 
of the oscillation between two points distant less than rj. 

In order to establish (8.2), and so (8.1), it is evidently enough to show that, 
for any positive c, there is an eapproximation to the given generalized surface S* 
provided by an ordinary surface subject to the conditions stated for S n . We 
shall divide the proof into several stages which occupy §§9-13. 

9. A lemma due to McShane 14 

In the sequel we frequently have to enlarge and fit together disconnected 
portions of surfaces, without unduly increasing their Lipschitz constants. In so 
doing, we make use of the following lemma. 

(9.1) Suppose that a function z(x } y), originally given in the plane set E, satisfies , 
for all pairs of points of this set f the Lipschitz condition 

(9.2) | z(x', y') - z(x", y") | ^ K-p, 

where p denotes the distance of the pair of points , and where K is a fixed constant . 
Then there exists a function z(x, y), defined in the whole plane and coinciding with 


14 McShane 71 p. 838, Theorem I. 



94 


L. C. YOUNG 


the given function in the set E, such that (9.2) continues to hold with this same value 
of K for all pairs of points of the plane . 

For a proof of this lemma, we refer the reader to McShane. 14 We shall deduce 
a slight refinement of some importance to us. We remark that the continuation 
whose existence is asserted is by no means unique, although there are certain 
points of the plane, those of the original set E in particular, at which all such 
continuations necessarily coincide. These points constitute a set E that we 
shall call set of unicity for the required continuation. 

(9.3) There exists a continuation for which , in addition y every point outside the 
the set of unicity is the center of a circle in which (9.2) is valid with some smaller 
constant in place of K. 

To see this, we first observe that, for any given point not in E y the validity 
of (9.2) with two distinct values c ± tK for z(x' y y') when we take x', y ' at this 
point and x", y" to lie in the set E y leads to the inequality 

I z(x", y") - C I g K-(p - «), 

from which it follows that the function with the constant value c in the circle of 
radius e around the given point, and with the value of the original function 
z(x, y) at each point of E y satisfies (9.2) in the set consisting of E together with 
this circle. Consequently, applying (9.1) to this function, we see that given 
any point not in E there exists a continuation which is constant in some neighbor¬ 
hood of this point. 

This being so, we can cover the complement of E by a sequence of such neigh¬ 
borhoods. Denoting by z n (x, y) the continuation which has a constant value in 
the n th neighborhood, the function 

Y 2~ n -Zn(x, y) 

constitutes a continuation for which in the n th neighborhood (9.2) is valid with 
the constant K reduced to (1 — 2~ n )*if. This proves our assertion (9.3). 

The reduction of K is not in general possible in the set E. In fact, we have 
the following result. 

(9.4) A point of E at a positive distance from the set E is always contained in a 
segment consisting of points of P for every pair of which the relation (9.2) is an 
equality . 

Moreover , at such a point , the gradient of z(x y y) y if it exists , has magnitude K. 

We remark, in the first place, that at a point x f y y f of the set of unicity the 
value z(x', y f ) is the only one which satisfies (9.2) for every x", y " of E\ for 
otherwise we could add this point to the set E with either of two values for 
z(x! y y f ) and use (9.1) to form two different continuations corresponding to these 
two values. 

Hence, if P is a point of E distant d > 0 from the set E y then given any posi¬ 
tive c there exists a point x", y " of E such that, if p" is its distance from P, 

I *(*",¥") ~z(P)\ >K-{ P " - «); 



GENERALIZED SURFACES IN CALCULUS OF VARIATIONS 


95 


and from this last relation together with (9.2) we deduce that if x', y' is the 
point on the segment joining P to x", y" at distance d from the point P, 

I *(*', V') - z(P) \>K-(d- «). 

v * 

These relations are true for any continuation z(x, y). Making 6 tend to 0, 
we see by continuity that there exists a point Q independent of the continuation 
chosen such that 


\z(Q) - z(P) | = K-d, 


the distance of the points P and Q being again equal to d. It is clear moreover 
that the values of z(x', y f ) corresponding to the various continuations can only 
differ by at most K, and hence that the value z(Q) is unique. Finally it is clear 
from our last equation that (9.2) becomes an equality for every pair of points of 
the segment PQ, and that this segment lies in the set of unicity B. 

It now only remains to prove the second part of (9.4). 15 Let R denote the 
point at distance h from P along a parallel to the x-axis, and let k denote the 
distance RQ. We write further <p and 0 for the angles made by RQ and PQ 
with the x-axis, and m for the ratio | z(P) — z(R) \/h. We have then 

h-m = | z(P) - z(R) | ^ | z(P) - z(Q) | - | z(R) - z(Q) | 

^ K d - K-k = K-\(h 2 + k 2 + 2 hk cos *>) } - fc}. 


Hence, dividing by h and making h tend to 0, we find, since k then tends to d 
and <p to 0, that the lower limit of m is not less than the value 


K lim ^*hk cos <p )^ — k 


K- cos 0. 


Hence the x-component of the gradient of z(x , y) at P is not less than K cos 0, 
and similarly the ^-component is not less than l£*sin 0. This completes the 
proof. 


10. Preliminary reduction 

We say that the generalized surface S* coincides with its track at the point 
x, y or in the subset B of A if at this point, or almost everywhere in this subset B, 
the average M(f) defining S* coincides for every continuous / with the result of 
substituting for p, q in / the gradient of the track of S*. We remark that this is 
certainly the case of any point x, y at which the gradient has the magnitude N 
of the Lipschitz constant of $*. 

At such a point the gradient has the form N cos a, N sin a, whence for the function 
g(p, q) ** N — p cos a — q sin a, we have M(g) = 0; and since, in the circle of relevant 
p, q t this function is non-negative and vanishes only for p, q — N cos a, N sin a, it follows 
in the usual way that M(g) — 0 for every g{p,q) which vanishes at this point; so that, for 
any g with the value c at this point, we have M(g — c) — 0 i.e. M(g) * c, as asserted. 

** The writer owes the second part of this proof to Mr. J. H. Whiteman. 



96 


L. C. YOUNG 


This being so, we denote by E the boundary of A . The function z(x, y) which 
defines the track of S* is a continuation of the type (9.1) for its boundary values 
in E , the constant K of (9.2) being replaced by N. We form also a second 
continuation Z(x, y ), possibly identical with z(x> y ), of the type (9.3). 

In the set of unicity E we certainly have z(x , y) = Z(x, y). In this set, 
moreover, since E is closed and of measure zero, almost all points have positive 
distances from E; and since the gradient exists almost everywhere, its magnitude 
is N by (9.4) for almost all points of E. Thus S* coincides with its track in E. 

We write P, Q for the gradient of Z(z, y) where it exists, and choose P = Q = 0 
otherwise. We write further 

V\ q' « eP + (1 - e)p, eQ + (1 - e)g 

and for any function g(p , q) we define 

M\g) = ifef(fir') where g'(p, g) = g(p', g'). 

With these definitions, the function 

z'(x, 2/) = tZ(x, y) + (1 - e)z(x, y) 

has almost everywhere the gradient M'(p), Finally we denote by A' 

a subset of A — E consisting of a finite sum of squares in each of which Z(x } y) 
satisfies a Lipschitz condition with a constant smaller than N. Since every 
point of A — E is the center of such a square by (9.3) we can choose A f so that 
its measure differs by less than c from that of A — E. 

In A ' the magnitude of P, Q has an upper bound N " less than N, so that for 
any p, q of magnitude not exceeding N the magnitude of p', g' cannot exceed 
the value N ' = eN" + (1 — c )N which is less than N. Hence for the points 
x , y of A', the average M’(g) is wholly independent of the values taken by 
g(p, g) at the points p, g of magnitude exceeding N ', where N' is less than N. 

This being so, we now define the generalized surface S*' in the following 
njanner: the track of S*' is z'(x, y) in A ; the average is M'{j) at each point x, y 
of A f \ while in A — A' the surface S*' coincides with its track. This generalized 
surface has the property that its portion in A ' has a Lipschitz constant less than N, 
while the remaining portion is an ordinary surface with the Lipschitz constant N. 
We shall see that it constitutes an eapproximation to S*. 

To this effect we observe that the relevant pairs of values p', g' and p, g have 
the difference e(P — p), e(Q — g) whose magnitude cannot exceed the number 
7j = 2 cN ; therefore in A' we have 

| M\j) - M(j) | = | M(f -/) | g Max |/(p', ?') - f(p, q) | g «(/; n). 

In E on the other hand the two generalized surfaces coincide with a same track, 
while the remainder of A, the set A — A' — E, has small measure. From 
these facts, we easily obtain a relation of the type (8.3), which shows that S*' 
is an c-approximation to S*, as asserted. 

Since this new generalized surface has moreover the same boundary as the 



GENERALIZED SURFACES IN CALCULUS OF VARIATIONS 


97 


original one, we may substitute it to the latter for the proof of (8.1). In doing 
so we may at the same time replace the domain A by the finite sum of squares A ', 
and since the latter can be approximated by a sum of squares without common 
points it will be sufficient to deal with the case in which A ' consists of a single 
square. 

11. Approximation by averages restricted to a finite set 

We now make a second reduction which has an interest independent of (8.1). 
We consider a triangulation of the p, q plane by equilateral triangles of side e, 
and denote by p m , q m , where m = 1 , 2 , • • • , n, the vertices whose magnitude 
does not exceed N. An average M(g) will be said to be restricted to the p m , 
q m , if it is of the form 

a m g(p m y q m ) where a m ^ 0 for each m and where a m = 1. 

We shall obtain the following result. 

(11.1) A generalized surface S* with Lipschitz constant less than N has an 
e approximation S*' with the same track and with a Lipschitz constant less than N, 
such that , at each point x , y , the average M' defining S* f is restricted to the p m , q m . 

We shall suppose e so small that the triangles with the vertices p ni , q m enclose 
all values of p, q which are relevant to the averages M (/) of the definition of *S*. 
We write in the form 

(p> q) == 2-1 * (p»* > (?»*) 

where only the vertices of the triangle containing p, q have non-zero coefficients, 
the barycentric representation of the point p, q in its triangle or edge of triangle. 
We define further, for any g(p, q), 

M'(g) = M(g’), where g'(p, q) = ’S(Pm , ffm); 

here M is the average occurring in the definition of S* at any point x , y and this 
average is defined for the function g' since the latter is continuous, as we easily 
verify. 

Now in the particular case where g is one of the functions p, q or the constant 1, 
we see at once that g f = g , and so M(g) = M'(g). Hence, remembering that 
the coefficients of the barycentric expression are non-negative, AT is an average 
with which we can associate the same track as that of S* to form an S* f . Evi¬ 
dently S*' has a Lipschitz constant less than N , so that it only remains to prove 
that /S*' is an ^approximation to S*. This however follows at once from the 
relations 

! M'(g) - M(g) | = | M(g - ^0 I ^ b m -a(g ;«) = o>(g; c). 

It is convenient to prove also the following more special result. 

(11.2) If the generalized surface S* has a piece-wise constant gradient, then the 
approximation S* r satisfying the requirements made in (9.1) can be made to fulfill 
the additional condition that the finite sum ^ a m -g(p m , q m ) which constitutes the 



98 


L. C. YOUNG 


average M'(g) at the point x , y shall have coefficients a m which are piece-wise con¬ 
stant in x y y. 

To see this, we divide any interval of constancy of the gradient of the track 
S * into similar intervals of diameter less than and denote by g w the function 
of x , y which is the mean value of a m in the interval of the division which con¬ 
tains the point x , y . As «' tends to 0, the a m tend boundedly to the a m almost 
everywhere and the convergence is uniform in a subset whose measure is as 
close as we please to that of the set of definition A. From this it follows that 
the generalized surface derived from S*' by replacing the a m by g m fulfills all 
our requirements. 


12. Reduction to a plane track 

The principal part of this section does not concern generalized surfaces. We 
prove the following theorem. 

(12.1) The track of a Lipschitzian surface with constant less thanN is expressible 
as the limit of a certain track z<(x> y) in such a manner that its gradient is almost 
everywhere the limit of that of z e (z, y) } where the track defined by z e (x, y) satisfies 
the following additional conditions: 

(i) its Lipschitz constant is less than N, 

(ii) outside measure c its gradient is piecewise constant. 

Moreover when these conditions are satisfied , any S* with a Lipschitz constant less 
than N and with the original track , has a corresponding e-approximation with 
Lipschitz constant less than N whose track is z € (x, y). Finally the theorem re¬ 
mains true if we restrict throughout the track z t (x, y) to have the same boundary as 
the original track. 

The final remark concerning the boundary is an immediate corollary of the 
rest of the theorem in view of (9.1), if we apply it in the first place to a domain 
interior to A whose distance from the boundary of A is positive, and then 
continue the track obtained suitably to agree with the original track on the 
boundary of A. We may therefore ignore this part of the assertion, and since 
we can continue the given track by (9.1) into a square containing A we may 
suppose that A is such a square. 

This being so, let z(x , y) define the original track in A and let v(x, y) be the 
vector function consisting of the gradient of z, which exists almost everywhere 
in A. There exists a vector function 3>(a;, y) piecewise constant in A; we shall 
suppose it constant in the squares R of side a, such that 

(12.2) | <p — | < c 10 except possibly in measure < e 10 . 

Let us denote by B the set of those squares R if any for which a subset of R 

of measure not less than a 2 -e 8 is occupied by points for which the relation 
|* - *| < « 10 is false. Clearly this relation is then false in a subset of 

B of measure not less than € 8 1 B |, so that by (12.2) we must have | B | < e 2 . 

Consider now any square R not in B . We have then 



GENERALIZED SURFACES IN CALCULUS OF VARIATIONS 


99 


(12.3) | (p — $ | < c 10 except possibly in measure < a 2 - c 8 of R. 

We denote by E the linear set contained in the projection of R on the 2 /-axis 
which consists of the values of y for each of which the relation | <p — 4> | < € 10 
is false for a corresponding set of x of linear measure not less than ta. It fol¬ 
lows from (12.3) that the linear measure of E is less than ta. The same being 
true when the axes of x and y are interchanged, it follows that we can determine 
a series of parallels to the two axes, forming a grating over /?, in such a way 
that R is divided into rectangles of sides less than e A a by this grating and that 
on any segment of this grating in R we have 

(12.4) | (p — 4> | < e 10 except possibly in linear measure < ea. 

Now any point x, y of R can be joined to the center of R by a path C consisting 
of two portions of segments of the grating and two small segments, the lengths 
of the latter being less than £e 4 a. It follows from (12.4) that on the path C we 
have 

(12.5) | <p — <f> | < c 10 except possibly in linear measure < 3* 4 a. 

This being so, let Z(x , y ) denote the linear function defined in the interior of 
R to agree with z(x, y) at the center of R and to have the constant gradient <£. 
The difference of the functions z and Z at the point x , y cannot exceed in mag¬ 
nitude the integral on C of that of their gradients. Since these gradients are 
clearly less than N (if e is sufficiently small), it follows from (12.5) that 

(12.6) | z(x, y) - Z(x, y) | g t-a, 
provided that e is sufficiently small. 

For every R not in B , we now denote by R' the concentric square of side 
(1 — 2e 2 )-a; we denote further by B ' the set of the points of the squares /?', 
and by B" the set of the points of the boundaries of the corresponding squares 
R. In the set B + B f + B " we define the function z e (x, y) by stipulating that 

(12.7) z«(x, y) = z(x , y) in B + B", z t (x, y) = Z(x y y) in B'. 

This function satisfies in B + B' + B" the Lipsehitz condition (9.2) with a 
constant K less than N; for we need only verify this when the two points x', y f 
and x", y n lie respectively in B f and in B + B". But since the distance p of 
the points then exceeds e 2 • a, we have 

I *.(*', y ') - *.(*", y ") I = I Z(x' f y>) - z(x", 2 /") I 

g € 3 -a + I z(x', y f ) - z(x", y”) | 
c• p + X-p < W*p, 

which is the required verification. 

This being so, we can continue the function z«(x, y) by (9.1) so that it still 
satisfies the same Lipsehitz condition. Hence, observing that any point of A 



100 


L. C. YOUNG 


outside B ' is distant at most c 2 -a from one of coincidence of the functions z<(x , y) 
and z(x f y), we see that the two functions differ by at most 2 i\T-c 2 a < e-a in 
A — B' and so by (12.6) throughout A. 

Now in B', except possibly in a subset of measure c 10 , the gradient of z f (x, y) 
is piecewise constant and differs by less than e 10 from the gradient of z(x , y). 
To show that z*(x, y) fulfils the requirements of our theorem, it is sufficient to 
verify that the complement of B ' in A has measure less than e — e 10 . This is 
the case since 

| A - J3'| = | J3 I + {1 - (1 - 2e 2 )} 2 * | A - B | g e 2 + 4c 2 | A |. 

Now finally let z e (x, y) denote any track fulfilling the requirements of the 
theorem, and let S* denote a generalized surface with the original track. 

We denote by rj the lower bound of the numbers such that outside measure 
7 1 at most 

( 12 . 8 ) the difference of the functions z(x , y), z t (x, y) and that of their gradients 
are both of magnitude at most rj. 

From our hypotheses, it follows that 77 tends to 0 with e. We write further A ' 
for the set of x, y in which ( 12 . 8 ) holds, and p', q' for the result of translating p, q 
by the difference of these gradients. Denoting by M the average defining S* 
at any point, we define, for any continuous g(p, q), the average 

M'(g) = M(g’), where g'(p, q) = g(p', q') 

to be at any point of the set A ' the average defining £*', and we complete the 
latter by making it coincide with its track in A — A'. 

The surface S*' with the track z t (x, y) and the average thus defined, is easily 
verified to be defined consistently and to furnish the required e-approximation. 

13. Completion of the proof of (8.1) 

The various reductions made possible by the results of the preceding sections 
show that it is now sufficient to establish ( 8 . 1 ) in the case of a generalized surface 
S * defined for the points x } y of a square A, possessing a Lipschitz constant less 
than N, possessing in view of ( 12 . 1 )—where we may now choose A to be a square 
of constancy of the gradient—a plane track; and finally possessing an average 
at x, y restricted to at most n vectors p m , q m , this average being of the form 
yi a m g(pm , qm) where the coefficients a m may be supposed piece-wise constant, 
and so, by subdivision, absolutely constant, in view of (11.2). With these 
additional hypotheses, which now cause no real loss of generality, we shall 
prove our theorem by an induction with respect to the number n. 

It will be sufficient to prove that S* has an e-approximation consisting of a 
generalized surface S* f formed of a finite number of portions . in each of which the 
corresponding average M f (jg) is restricted to at most n — 1 of the p m } q m • 

Moreover we may clearly suppose that A is a triangle instead of a square. 
Further, we need not insist that the boundary of S*' is to coincide with that of 
S *, since this can always be arranged by means of a subsequent modification of 



GENERALIZED SURFACES IN CALCULUS OF VARIATIONS 


101 


£*' near its boundary, a modification which is made possible by (9.1) and which 
is, by now, sufficiently clear. 

This being so, we may suppose by trivial transformations that the track of 
S* is in the a;, p plane, and tljat for a particular value of m, the value m = 1 
say, we have q m = 0 and p m > 0. We suppose also that the coefficients a m 
in the average M are all different from 0, for if this were not the case there 
would be nothing to prove. 

We write k for an index to be summed from fc=ltofc = n— 1, and we 
define numbers a 0 and po by the equations 

Go = 23 ’ a °P° ~ 23 a k+lPk+l • 

These numbers then satisfy the equations 

Go + «i = 1, Gopo + Gipi = 0, a 0 > 0, a t > 0. 

We write further b k = a k + i/g () and observe that 

23 bkPk+i = po, 23 bkQ k+i — o 

We now divide the plane into strips parallel to the p-axis whose widths are 
alternately ea 0 and ea x , and we define a function of x only, z'(x), linear in each 
strip and taking on the edges separating the strips alternately the values 0 and 

— 5 = ea 0 po — —- eGipi , 

from which it follows that the gradient of this function in the strips is alternately 
po and pi . 

We define the generalized surface S* f to have in the triangle A the track 
z'(x , y) = z\x) and the average M' given by 

M'(g) = 23 M (?>*■+ 1 , </a+i) 

in the strips where the gradient is p 0 , while at the remaining points the surface 
coincides with its track i.e. M'{g) = g(p x , 0). Evidently these conditions are 
compatible with the definition of our generalized surfaces, since we find that 
the functions g = p and g = q have the averages po and 0 in the strips where 
the gradient is p 0 . Since the average M ' is restricted in each strip to less than 
n vectors p m , q m it only remains to show that S*' is an e-approximation to S*. 

We observe that on the segment parallel to the x-axis intercepted by a pair 
of consecutive strips at the height p, the integrals of the two averages M(f) and 
M'{f) are, for any continuous f(x, p, z, p, q), products of the length of this seg¬ 
ment by the expressions of the form 

23 amf(x, y, 0, p m , q m ) and affix', y, z', p u ?i) + Oo 23 hf(x", y, z", p k +i, qk+i) 

where x } x ', x n are intermediate values of x , on account of the mean value theorem 
of the integral calculus, and where z', z" are values of z which are small with €. 
Since a m = Oo6 w -i, these expressions differ from one another by at most 
23 G m -co(/; rj ) i.e. by «(/; rj) where rj tends to 0 with «. 



102 


L. C. YOUNG 


From this it easily follows that the corresponding double integrals over the 
triangle A differ by at most K-a>(f; 17 ) and so, that S*' is an e-approximation of 
S* y as required. This completes the proof. 

14. Existence of an attained minimum. Equivalence of generalized and 

ordinary problem 

Let us suppose that we are given the problem of the minimum of one of our 
functions F(S ) in the class of Lipschitzian ordinary surfaces with assigned 
boundaries. Beside this “ordinary” problem let us consider the “generalized 
problem” of the minimum of the function F(S*) in the corresponding class of 
generalized surfaces S*. 

There exists a sequence of ordinary surfaces {S n \ of the class considered, for 
which the numbers F(S n ) tend to the minimum of the first problem; but, by 

(8.1) , this is also the case in the second problem. The two minima are thus 
identical, i.e. 

(14.1) The ordinary 'problem is equivalent to the generalized problem. 

We have established this result in the case in which the surfaces considered 
are Lipschitzian with unrestricted constants. The argument holds equally 
when their Lipschitz constants are restricted to be less than or at most equal 
to some fixed number K. In the latter case we can assert more. 

(14.2) In the class of surfaces with an assigned boundary whose Lipschitz constants 
do not exceed a fixed number K, the minimum of F(S) is the same for generalized 
surfaces as for ordinary surfaces and there exists a generalized surface in the class 
for which this minimum is attained. 

For by (6.1) the minimizing sequence {$ n | is compact. Denoting by S* the 
limit of a subsequence we evidently have F(S*) = Lim F(S n ) and this completes 
the proof. 

Let us remark that in the case of a regular problem, it is easily seen that the 
solution S* must coincide with its track, i.e. reduces to an ordinary surface. 
This does not however include the whole of what is known of the existence theory 
for the regular case, since Tonelli 16 obtained a semi-continuity theorem inde¬ 
pendent of the Lipschitz condition that we have had to assume here. Neverthe¬ 
less, we shall attempt to show in a subsequent note that even for regular prob¬ 
lems the present methods have distinct advantages. 

Capetown, South Africa 


Reference^ 

[1] Banach, S. Th&orie des operations linbairesy Monografje Matematyczne (Warszawa, 

1932). 

[2] Hilbert, D. tfber das Dirichletsche Prinzip 9 Jahresber. der Deutschen Math. Vereini- 

gung, Vol. VIII (1900) pp. 184-188. 


16 Tonelli [9] p. 342. 



GENERALIZED SURFACES IN CALCULUS OF VARIATIONS 


103 


[3] Lebesgue, H. Sur le problhme de Dirichlet , Rendiconti del Circolo Matematico di 

Palermo, Vol. 24 (1907) pp. 371-402. 

[4] McShane, E. J. Generalized curves f Duke Math. Jour., Vol. 6 (1940) pp. 513-536. 

[5] McShane, E. J. Necessary conditions in the generalized-curve problem of the Calculus 

of Variations , Ibidem, .Vol. 7 (1940) pp. 1-27. 

[6] McShane, E. J. Existence Theorems for Bolza problems in the Calculus of Variations , 

Ibidem, Vol. 7 (1940) pp. 28-61. 

[7] McShane, E. J. Extension of range of functions , Bull, of the Am. Math. Soc., Vol. 40 

(1934) pp. 837-842. 

[8] Saks, S. Theory of the Integral , Monografje Matematyczne, Vol. VII (Warszawa, 

1937). 

[9] Tonelli, L. Sur la semi-continuitt des integrates doubles du Calcul des Variations , 

Acta Math., Vol. 53 (1929) pp. 325-346. 

[10J Weyl, H. Die Idee der Riemannschen Fldche , Mathematische Vorlesungen an der 
Universitiit Gottingen, Vol. V (Teubner, Leipzig 1913). 

[11] Weyl, H. Ober die neue Grundlagenkrise der Mathematik f Math. Zeits., Vol. 10 (1921) 

pp. 39-79. 

[12] Young, L. C. On approximation by polygons in the Calculus of Variations , Proc. of 

the Royal Soc. (A), Vol. 141 (1933) pp. 325-341. 

[13] Young, L. C. Generalized Curves and the existence of an attained absolute minimum in 

the Calculus of Variations , Comptes Rendus dc la Soci6t6 des Sciences et des 
Lettres de Varsovie, Classe III, Vol. 30 (1937) pp. 212-234. 

[14] Young, L. C. Necessary conditions in the Calculus of Variations , Acta Math., Vol. 69 

(1938), pp. 239-258. 



Annals or Mathematics 
Vol. 43, No. 1, January, 1942 


FREE LATTICES II 

By Philip M. Whitman 
(Received July 15, 1941) 

In the previous paper under this title [14] the author discussed the basic 
internal structure of free lattices. We assume here a knowledge of the notation 
used in that paper. 1 In the present paper the previous results are slightly 
extended and then applied to obtain further properties of free lattices. We 
study the automorphisms of free lattices (§2), their sublattices (§3), and their 
order topology (§4), where we find that in the terminology of Birkhoff [3] free 
lattices are continuous (perhaps vacuously) but not complete. 

The author is indebted to Prof. Garrett Birkhoff for inspiration and advice 
in the preparation of this paper. 

1. Introduction 

The main definition of [14] will bear repetition: the free lattice generated by 
Xi , x 2 , • • • , x n is a lattice generated by them in which there are no laws of equality 
except those derivable from the postulates for a lattice. In [14] this lattice was 
denoted by F n , since to within isomorphism it depends only on n and not on the 
symbols . It now seems advisable to replace F n by FL(ri ), and also in some 
cases to specify the generators: FL(x\ , x 2 , • • • , x n ), or FL{X) where X = 
{xi, x 2 , • • * , x n ). One may also consider free modular or distributive lattices, 
denoted FM(n) and FD(n) and defined as are free lattices except that the 
modular or distributive law ([3] pp. 34, 74) is adjoined to the lattice postulates; 
cf. Dedekind [7]; Skolem [12]; Birkhoff [1] pp. 450-2, 463-4, [3] pp. 4, 49, 84-85, 
[4] p. 451; Church [5]. The existence of free algebras in general is shown in 
Birkhoff [2] p. 441. 

The main results of the previous paper [14] may be restated (with slight exten¬ 
sions) as follows, starting with the corollary of §2 and the theorem of §3. 

Theorem 1: In FL(n), Xi rsg Xj if and only if i — j; recursively , 

n Xj if and only if some ^ Xj ; 

(B) Xi g 22 bj if and only if x { fg some bj ; 

(C) 22 a, g b if and only if every a» g f>; 

(D) a g II bj if and only if a g every bj ; 

(E) XI ^ 22 bj if and only if U a* ^ some bj or some ai ^ 22 b ,. 

Theorem 2: Of all the polynomials equal in FL(n) to a given polynomial , there 

is one of shortest length , unique to within commutativity and associativity . 

This is taken as the canonical form. We write a = b for polynomials a and b 

1 Or in [3], plus the notation L(a) * length of a for the number of generators appearing 
in the polynomial o, counting repetitions. 


104 



FREE LATTICES II 


105 


if one is obtainable from the other by commutativity and associativity. As a 
criterion for canonicity, we can weaken the conditions of Corollary (18) of [14] 
slightly: 

Corollary 1.1: If 52 a i is 

(II* «i*) U (II* <hk) u • • • U (II* <w) U x im+1 U • ■ U x iu 

then 52 a i ™ canonical if and only if (A) for all j and p, a jv £ 52 a * > an d (B) 
a, ^ a* implies j = fc, and (C) every a, is canonical. Dually for H a,. A 
generator by itself is canonical. 

Proof: To prove this equivalent to the corollary cited, we need only to show 
that the failure of (b) in that corollary implies the failure of one of the conditions 
of Corollary 1.1. Thus suppose that in FL(n), a t g 52 m» a * • 

Case 1: a» s Xj. Then by Theorem 1 B, Xj g ak for some k contrary to (B). 

Case 2: a* = H; aa . Then by Theorem 1 E, either H, a a = a * for some k 
contrary to (B), or 

dij = ^ ^ CLk = 52 Clk 
M» all A: 

for some j, contrary to (A). 

Corollary 1.2: If = II bj in FL(ri) and 52 a < ™ canonical , then 

II by = bk for some k. 

Proof: This is a companion to Corollary (19) of [14] and is proved similarly. 

Lemma 1.1: I?i FL(n ) for n finite , given the polynomial a and index p then 
cither x p ^ a or a = 52*^? Xi , the two being mutually exclusive. 

Proof: This is obvious for L(a) = 1; we proceed by induction on L(a). 

Case 1 : a = n ai. If a % 52 i*p x i > then for all j , a, % 52, so by 

induction x p ^ a,* for all j; thus g H a ; = a as desired. 

Case 2: a s= 52 • If a $ 52> then for some &, a* ^ 52 Mp > so by 

induction x p ^ ak ^ a and thus x p ^ a. Thus at least one of the alternatives 

holds; they are mutually exclusive since x p ^ a ^ 52mp*» would imply 
x p ^ '^LUftpXi and then by Theorem 1, x v :g x t for some i not p which is 
impossible. 

This is a special case of a “splitting” of a lattice [13], a subject which will be 
discussed in another paper. 2 An immediate consequence of this lemma is 
Corollary 3 of §4 of [14], for suppose that for all p,a% 52*>p x * J then by Lemma 
1.1 we have x p ^ a for all p, so a g; 52?-i Xi ; since the latter is the unit element 
of FL(n)j we have a = 52?-i x * > as asserted. The dual of this corollary says 
that the “points” or “atoms” ([3], p. 10) of FL(n) are the H»y P x t . We can 
now strengthen Theorem 3 of [14], not only by extending it farther down in the 
lattice (which seems relatively uninteresting) but also in the content of its state¬ 
ments about the relation of other elements of the lattice to certain special 
elements. 

Theorem 3: In FL(n) for n finite , if a is given then either a = 52"-i x * or 

* Soon to be submitted for publication. 



106 


PHILIP M. WHITMAN 


a = Xi for some p or else for some distinct p and q, a ** b pq or a g c pq , 

where 

bp« = (Z a-.O n (2 *<) 

ifip ifiq 

*.« (Z *|)U E KE*)n(E*.)n(E*,)]. 

»>*P*« r^p.fl »i*p tj*9 ir*r 

Moreover , if a < J^^p ^ en a ^ &p« / or the same p and some q , and likewise if 
a < b pq then a ^ c pq . 

Proof: In view of Corollary 3 of §4 of [14], discussed above, it suffices to 
prove the last sentence of the theorem; the rest will then follow. Hence we 
suppose a < ^%^ P Xi . By Lemma 1.1, if q is given then either a ^ Xi or 
x q ^ a. If the latter held for all q different from p then 2^p x q ^ a ^ iC^p x » 
and so a — ^T,i* P Xi contrary to hypothesis. Hence for some q, a g Xi 
as well as a ^ ^ t y p #,*, and so a ^ b pq for some q other than p> as desired. 
That a < b pq implies a ^ c pq is proved by similar but more elaborate methods; 
since we do not use this result we omit the details. 

For reference we note certain immediate consequences of Theorem 1; n is not 
required finite. 

Corollary 1.3: In FL(n), (A) n>i ^ z y j if and only if for some i, 
a* ^ yj , and dually , where the y , are a non-void subset of the Xi ; (B) if 
T ai = Xj then a k = Xj for some k. 

Proof: (A) If a t - ^ X) Vi f° r no h then by Theorem 1 E we would have 
Ila. ^ yj for somej, and so for some i, o x ^ 2 /; ^ ^ • (B) By Theorem 1 B, 

Xj ^ ajfc for some k; a, ^ a:, for all i, so a* = Xj by (2) of [14]. 

2. Automorphisms of FL(n) 

In some cases an algebra is the free algebra ([3] p. 4; [2] p. 441) for more than 
one set of generators; 3 for instance the free group generated by x and y is the 
same as that generated by x and x~ l y. We assert that this is not the case with 
free lattices. First however we must clarify the meaning of “{ic»} is a set of 
generators.” In [14] we said that a set [Wi] generates a lattice L if the poly¬ 
nomials in the Wi exhaust L. Now if L consists say of all polynomials in some 
elements Xi , do we demand that every polynomial a in the a\ should be formally 
identical with some polynomial in the Wi after substituting the values of the Wi 
in terms of the Xi ? No, this seems excessive; rather we demand that some 
polynomial in the Wj should equal a. (As a matter of fact the next theorem 
holds whichever interpretation is used.) Likewise, as regards automorphisms 
we consider only their effect on classes of equal elements, not how they may 
permute polynomials within such a class. 

In this section we consider two sets of generators to be the same if their 
elements are equal by pairs. 

Definition 2.1: A set of generators of a lattice L is redundant if some member 
of the set can be omitted without affecting the property that the set generates L. 

* The author is not aware of any precise reference in print, but this is well known. It is 
mentioned by J. Dyer-Bennet, thesis, A free algebra with three operation8 f unpublished, 
Harvard University, 1940. 



FREE LATTICES II 


107 


Theorem 4: FL(ri) is the free lattice on a unique set X of generators ; any set of 
generators must contain X ; and X is the only irredundant set of generators. 

Proof: First we observe that FL(n) could not be the free lattice on a redun¬ 
dant set of generators, for if it were then by Definition 2.1 one of the generators 
would equal a polynomial in the others, which is not true in a free lattice. Now 
let FL(n ) be FL(x i, • • • , x n )' t that is, it consists of all lattice polynomials in 
the Xi , subject to the laws stated in Theorem 1; and suppose that it is also a 
lattice generated by Wi , • • • , w m . We assert that given fc, then Wi = x k for 
some i. For suppose not: for some k and all i } ^ x k ; then we may proceed 
by induction on L(a) where a is any polynomial in the Wi. Suppose a ^ x k 
for all a with L(a) < h. Then suppose a = x k and L(a) = h\ a = a, or 
dually. By Corollary 1.3 B, a t = x k for some i, since a is indirectly a poly¬ 
nomial in the Xi , contrary to hypothesis of induction. Thus by induction we 
would have that no polynomial in the Wi equals x k so the Wi could not be a set 
of generators. Hence Wi = x k for some i, given k, proving the second clause 
of the theorem. If moreover {w^ is irredundant then fw\} must coincide with 
{£»} to within equality, which proves the last clause. Then the first part of the 
proof proves the first clause. Q.E.I). 

This states only that a different irredundant set of generators cannot generate 
all of FL(n ); as we shall see in the next section, they may generate a sublattice 
isomorphic to it. Another consequence is this: any automorphism of a lattice 
is by definition determined by the images of the generators, so by Theorem 3 an 
automorphism of FL(n) must merely permute the Xi among themselves. On 
the other hand, by the symmetry of the postulates any such permutation deter¬ 
mines an automorphism. Hence 

Corollary 2.1: The group of automorphisms of FL(n) is the symmetric group 
on n symbols. 

3. Sublattices of FL(n ) 

We observe first that in free lattices the relations between polynomials in 
Xi , • • • , x k are independent of the presence or absence of other generators: 

Lemma 3.1: If f(x i, • • • , x, c ) ^ g{xi , • • * , x k ) for polynomials f and g in 
FL(n) f then this relation is also true in FL(k), and conversely. 

Proof: By Theorem 1 the conditions in the two cases are the same. 

Lemma 3.2: If {x i, • • • , x r }, {yi , • • • , £/«}, and {zi , • • • , z t } arc disjoint sub¬ 
sets of the generators of FL(n) y and 

f(xt, • • • , x r , yi, • • • , y.) s g(x 1 , • • • , Xr , 2l , • • • , Zt) 
for polynomials f and g in FL(n), then also 

f(X\ , * • • , X r j 1> ' • * t 1) ^(^1 j * * * j %r y 0, * * • ,0) 

in FL(n ). 4 

4 0 and 1 stand for the zero and unit elements of the lattice. If not present, they may 
for the purposes of this lemma be temporarily adjoined; of. MacNeille [9] p. 443. They will 
nearly always be absorbed in reducing the new polynomials to simpler form. 



108 


PHILIP M. WHITMAN 


Pboof: This is trivial for L(f) + L(g) — 2. Denoting the results of the 
substitutions by bars, we make an induction on L(f) + L(g). If g 
then by Theorem 1 / ^ g* for all i, so by induction / g §i for all i, and hence 
/ ^ II 9i = 9- If 9 = £ and / 555 XI/»> then/ g g { for some i, or/< g gr 
for some t, so f ^ 0* or ji g g, and thus / ^ The other cases are similar. 
Q.E.D. 

Let us now consider the sublattices of FL(x i, • • • , x n ) for n fixed. In par¬ 
ticular, we inquire which of these sublattices are again free lattices on suitable 
generators. Since a sublattice is often most easily designated by listing its 
generators, we shall find it convenient to make the following 

Definition 3.1: The set {w,} is free if the sublattice generated by it is the 
free lattice with these generators. 

The question then becomes: what conditions on the Ui will insure that they 
form a free set? In Theorem 5 it appears that the answer is simply that with 
respect to their joins and meets they shall behave as do the Xi . The situation 
in free lattices is thus unlike that in free groups, where every subgroup is 
again free. 5 

Theorem 5: A subset IJ = {?/»} of the elements of FL(n) is free if and only if 
Uj ^ * s Ui and its dual (where S is a finite set of indices) each imply j € S. 

Proof: We wish to show these conditions necessary and sufficient that in the 
given lattice FL(n) ss FL(X), the sublattice generated by l 7 is FL([ r ), or equiva¬ 
lently that this sublattice is isomorphic to FL(Y ) ss FL{y x , • • * ,y k ) where k 
is the cardinal power of the set U . Thus we wish to show that the conditions of 
the theorem are necessary and sufficient in order that for all polynomials/and gr, 

f(Vi x , • • • , Uif) ^ g{ui\ , • , Ui>) 

in FL(X) if and only if 

f(Vh , • * • , Vij) ^ g(Vi[ > * * * > Vi\) 

in FL{Y ), or (more briefly) in order that /(C) ^ g(U) in FL(X) if and only if 
/(F) S g(Y) in FL(Y ). The necessity of the given conditions is apparent 
from the fact that by Theorem 1 the yi behave in this manner. For sufficiency 
we assume the conditions of the theorem and prove the isomorphism by induc¬ 
tion on L(f(Y)) + L(g(Y)). If L(/(F)) + L(g(Y)) = 2 then L(f(Y)) = 
L(gr(F)) = 1. By hypothesis Ui ^ uj if and only if i = /, w r hich is true if and 
only if yi g y /, as desired. Proceeding by induction, 

Case 1: /(F) s y y . Now y ,• g g(Y) implies Uj ^ g(U) because the lattice 
generated by f Ms a homomorphic image of FL(Y) by [2], Theorem 9 on p. 441. 
Conversely if the latter holds then the former does, for if not then by Lemmas 
1.1 and 3.1 applied to the finite set R consisting of the yi which appear in either 
/(F) or g(Y), 


5 Nielsen [10], Schreier [11], Levi [8] gives a new proof. The falsity for free lattices is 
shown by taking any two-element sublattice. 



FREE LATTICES II 


109 


g(Y) £ £ Vi 

i < B—i 

in FL(yi ; i eR) and so also in FL(Y). But then 

yj ^ g(U) ^ £ Ui 

* « R—j 

contrary to hypothesis. 

Case 2: g(Y) = ?/,•. Dual of case 1. 

Case 3: /(F) = ^2fi(Y). Then ^2fi(U) ^ gr(C) implies by Theorem 1 
that /<([/) g 0 (£ r ) for all i, so by induction / t (F) g gr(F) for all i, whence 
/(F) g g(Y), while the converse follows as at the beginning of Case 1 . 

Case 4: g(Y) ss ^(F). Dual of case 3. 

Case 5: /(F) s f[MY) and g(Y) se £ </ t (F). Then/(C) g p(r) implies 
by Theorem 1 that fi(U) ^ 0 (C) for some i or else/(C r ) ^ ^t(f ) for some i, 
and by induction we again get/(F) S g(Y), and conversely as above. Q.E.D. 

In particular we can verify that in FL( 3) these conditions are satisfied by 
Xi U (x 2 0 x 3 ), x 2 U (Xi fl x 3 ), and x 3 U (xi ft x 2 ), or still better, 

Lemma 3.3: In FL( 3) the following are a free set : 

m = [xi fl (x 2 U x 3 )] U [x 2 H (xi U x 3 )] 

== [xi n (x 2 u x 3 )] u [x 3 n (xi u x 2 )] 

u s = [xi u (x 2 n x 3 )] n [x 2 u (xi n x 3 )] 

= [xi u (x 2 n X3 )] n [x 3 u (xi n x 2 )]. 

The proof is a matter of straightforward verification of the conditions of 
Theorem 5, using Theorem 1 . We give one example: u 4 $ u x U ?/ 2 U us . For 

otherwise one of the following must hold: (i) u 4 ^ Ui ; ( ii ) m ^ i ? 2 ; (Hi) u 4 S us ; 

(■ iv ) Xi U (x 2 fl x 3 ) ^ wi U w 2 U w 3 ; (r) x 3 U (xi fl x 2 ) g U m 2 U u 3 , bv Theorem 
1 E. But if ( i ) held, then either w 4 Xi fl (x 2 U x 3 ) or w 4 ^ (xi U x 3 ) or 
Xi U (x 2 fl Xz) ^ iwi or x 3 U (xi fl x 2 ) g , which are contradicted respectively 
by the facts that u 4 ?g Xi, u 4 ^ x 2 , Xi % u x , and x 3 % u x , so (?) is impossible. 
Similarly so is (ii). If (in) held, then ?? 4 ^ x 2 U (xi fl x 3 ); thus ?/ 4 ^ x 2 or 
u 4 ^ Xi fl x 3 or Xi U (x 2 H x 3 ) ^ x 2 U (xi fl x 3 ) or x 3 U (xi fl x 2 ) ^ .r 2 U (xi fl x 3 ). 
These are respectively contradicted by the facts that u 4 % x 2 , ?/ 4 % x x fl x 3 , 
x x $ £2 U (xi fl x 3 ), and x 3 % x 2 U (xi fl x 3 ), so (iii) is false, (iv) and (v) are 
false since x x % u x U t ? 2 U w 3 and x 3 % u x U ?? 2 U u 8 . Thus the typical condition 
chosen holds; the others are similar. Q.E.D. 

Theorem 6 : FL( 3) has FL(n) as a sublattice, for any finite or countable n . 

Proof: By Lemma 3.3, FL( 3) has a free set \u x , u 2 , u 3 , t? 4 }. We proceed 
by induction on the number of elements in a free set {i??}. Suppose 
{ui , w 2 , • • • , Uk } is a free set of elements of FL( 3). By Lemma 3.1, {uk -2 , 
wjfc-i, w*} is free, so by Lemma 3.3 there is a free set {v x , v 2 , v 8 , t> 4 } of poly¬ 
nomials in Uk- 2 , Wfc-i, and i?* . Then 



110 


PHILIP M. WHITMAN 


{«1 , Mf , • • • , Uk-i , , Vi , J>j , l> 4 } 

is free, for we may readily verify the conditions of Theorem 5: 

Condition 1: wi $ « 2 U u» U ••• U UkS U Vi U V% U Vs U Vi . 

For otherwise 

Ui ^ Ui U • • • U Uk-4 U Vi U V 2 U Vs U Vi ^ Ui u ... u Uk 

contrary to hypothesis that {u\ , • • • , Uk) was free. 

Condition 2: i; 4 $ wi U w 2 U • • • U Uk -3 U Vi U v 2 U vs. 

For otherwise (since Vi is a polynomial in Uk~ 2 , Uk- 1 , and Uk alone) by Lemma 
3.2, Vi S Vi U v 2 U v s contrary to the choice of the Vi . 

The other conditions follow by symmetry and duality. Thus the conditions 
for induction are fulfilled and we obtain a countable set of Ui . This whole set 
together is free, for the conditions of Theorem 5 involve only finite sets and 
so hold by construction. Q.E.D. 

Corollary 3.1: For n ^ 3 and any finite or countable k , FL(n) has FL{k) 
as a sublattice. 

Theorem 7: FL{n) has no sublattice isomorphic to the free modular or distribu¬ 
tive lattice on more than two generators. 

Proof: Otherwise FL{n) would have FM{ 3) or FD( 3) as a sublattice, for if 
say it had say FM( 4) as a sublattice, then the sublattice generated by the first 
three of these four generators would satisfy the definition of free modular 
lattice (this argument assumes well-ordering in the infinite case); a similar 
argument would have sufficed for Lemma 3.1. But it is known ([7] p. 246; 
[3] pp. 49, 84) that the generators of FM( 3) and FD( 3) satisfy the conditions 
of Theorem 5, so the sublattice they generate would be FL( 3) rather than FM( 3) 
or FD( 3). 

4. Intrinsic topology of FL(n) 

In a lattice each pair of elements, a and b, must have a least upper and greatest 
lower bound ([3] p. 16, [14] p. 325). By the associative law this must also hold 
for any finite number of elements, but it may or may not hold for an infinite 
number (e.g., the set of all integers is a lattice under the customary linear 
ordering, but there is no greatest; on the other hand any class of subclasses of 
a fixed class does have a least,upper bound under set inclusion). In case the 
least upper bound or greatest lower bound of a set {a»} exists we may denote 
it sup {a*} or inf {a t } respectively ([3] p. 16). 

Definition 4.1 ([3] p. 27): In a lattice, {a,} is said to (o)-converge to a if 
sequences {w<} and {t;*} exist such that 

Ui g Ui +1 g a ,+1 g Vi+i g Vi 

for alii, and sup {} = inf = a. Then we write a< —* a, Ui T a, andt;< I a. 

In terms of this order convergence, one may study the topology of a lattice— 



FREE LATTICES II 


111 


the intrinsic topology, as Birkhoff calls it, since it is defined in terms of the in¬ 
clusion relation without external considerations. For instance, the lattice 
operations are said to be continuous 5 if a* —* a and h* —► b imply a, U bi —> a U b 
and dually. For this, it is sufficient ([3] p. 30) that a t i a imply a, U b i a U b 
and dually. If sup {a»} and inf {a»j exist for every set {a,} then the lattice is 
said to be complete. 

We have already (§4 of [14]) obtained a simple topological result for FL(ri), 
in finding that in certain cases one element covers another; thus FL(n) is in a 
sense not everywhere dense. Moreover, in Theorem 9 below we find that FL(n) 
is not complete—we exhibit sets without suprema. This leaves open the ques¬ 
tion of the existence of an infinite ascending chain with a supremum; if there is 
none then Theorem 8 is vacuous. 

Theorem 8: FL(n) is continuous. 

Proof: Suppose a t i a. By [3] (p. 30) it suffices to show' a,- U fc i a U 6. 
But a U b is obviously a lower bound to the a» U 6; it remains to be shown 
greatest—that is, that if c is any lower bound to the cti U b then c g a U b. But 
if this should fail—if c|aU b —then cUaU6>aU6 and c U a U b is again 
a lower bound, so it suffices to show' by induction on L(c) that it is impossible 
to have c > a U b and c a low'cr bound to the a* U b. 

Case 1: c == c ? , wiiere we may suppose that no c, is a join. Then 

c g di U b for all so c, ^ a t U b for all i and j. Hence by Theorem 1, given 
i and j, either (1) Cj ^ a, , or (2) cj ^ b, or (3) Cy ss n kCjk and for some k, 
Cjk S di U by the possibility (3) being excluded if c, is a generator. If (3) holds 
for some j and infinitely many i, it holds for that j and infinitely many i and some 
fixed k, since m is finite. By monotonicity of the a t , c Jk U YlwCt is a shorter 
lower bound than c and contains a U b, contrary to hypothesis of induction. 
Thus, given j, it must be that (1) holds for infinitely many i or (2) does; perhaps 
both. In the former case cy ^ a, in the latter c, ^ b, so c = Cy = a U b 
contrary to hypothesis that c > a U 6. 

Case 2: c s= II?-i c i • Then JJ cy ^ a, U b for all i , so given i either (1) 
c ^ a* , or (2) c ^ by or (3) for some j, Cj ^ a* U b. If (3) holds for infinitely 
many i, then (since m is finite) it holds for some fixed j and infinitely many i, 
and hence for that j and all i, and cy is a shorter lower bound, cy > a U 6, con¬ 
trary to induction. Otherwise (1) or (2) holds for infinitely many i, so c ^ a 
or c ^ by contrary to c > a U b. 

Case 3: c is a generator. This is included in Case 2, with (3) impossible. 
This starts the induction. Thus the theorem holds. 

This leaves open the question of whether there are any infinite sequences 
which have limits. We proceed to show that at any rate in FL{ 3) not every 
infinite sequence has a limit. For our horrible example we consider the elements 


6 Not to be confused with the use by some authors of “continuous” for what we call 
“complete,” nor with the use of “continuous” in connection with a metric in a lattice 
([3], p. 43). 



112 


PHILIP M. WHITMAN 


which Birkhoff used 7 to show that FL{ 3) has an infinite number of distinct 
elements. 

Let t\ s X\ , Ui+ 2 k = t$i+ 2 k—l n X 2 k , Ui+ 2 k+l s Ui+ 2 k U X 2 k +1 fOT 1 = 0, 1, 2, • • * , 
where the subscripts on x are taken modulo 3. Thus for instance 

w 55 (((((few n x 2 ) u x 3 ) n xi) u x 2 ) n x 8 ) u x x . 

By Theorem 1, t x g t 7 , and by induction t*h+k ^ ta+fc in FL{ 3), if h < i. Birk- 
hoff’s example just cited shows that equality cannot hold, as could also be 
shown by Theorem 1 and induction, so Uh+ k < Ui+k in FL( 3) if h < i. We 
have thus six infinite strictly monotone increasing chains: t x , t 7 , t X3 , • • • ; 
U., ts y tu , • • • , etc. 

Lemma 4.1: If one of the infinite chains U i+k (k fixed) has a least upper bound , 
then so does each, and if sup, {£ei+*} = s *, then S 2 k = s 2k -i H X 2 k and s 2 *+i = 
S2k U # 2 *+i . 

Note: The subscripts on s may be taken modulo 6 , on x modulo 3. We do 
not as yet assert that s 2 *-i 0 # 2 * gives the canonical form of s 2k .• 

Proof: By Theorem 8 , if s 2 *_i exists then 

S 2k s sup,- {< 6 ,-+ 2 *} = sup, { Ut± 2 k-l n # 2 *} 

= sup, {fc+ 2 A:-l} n sup iX 2 k 

— s 2 k -\ n x 2k 

so that s 2k exists and equals s 2k - x fI # 2 * . Likewise 6 ‘ 2 * + i = s 2 * U £ 2 *+i and so if 
any $ exists, then by repetition all do. Q.E.D. 

Lemma 4.2: If the s k are in canonical form, then the equalities in Lemma 4.1 
become identities. 

Proof: Part (I): s 2k + x = s 2k U £ 2 * +1 . 

Case 1 : s 2 * + i = n r*. Then either s 2k + 1 = s 2k or s 2 *+i = # 2 * 4-1 , by Corollary 
1.2. But the second is contradicted by # 2 * 4-1 ^ Ui+ 2k +i < s 2 k+i , and if the former 
held, then by Lemma 4.1, 

#2*4-1 < S2k +1 = $ 2 * ^ X 2 k 

whereas X 2 k +i $ # 2 * in FL{ 3). Thus Case 1 is impossible. 

Case 2: s 2k +1 = 2Z r i • By Corollary (19) of [14] and by (I), we have 

( 1 ) given j, either r ; g s 2k or r, ^ # 2 * 4 - 1 . 

Now # 2 * 4-1 g $ 2 * 4-1 , so by Theorem 1 B, 22 * 4.1 ^ r,- for some j; say 

( 2 ) # 2 * 4-1 ^ ri. 

But as in Case 1 , 

(3) # 2 * 4-1 ^5 ® 2 * • 

7 [ 2 ] pp. 451-452. The following typographical changes are needed on p. 452: line 5, 
subscript should be t + 1; lines 7-8, interchange meet and join symbols. 



FREE LATTICES II 


113 


By (2) and (3), n $ s 2k , so by (1) and (2), r x = x 2k+ 1 . By canonicity and (1), 
we have 

(4) ri = x 2k + 1 and r, ^ $ 2k for j > 1. 

By Lemma 4.1, 

S2AJ-1 n x 2 k = ^ $2fc+l s E^y, 

so by Theorem 1, either 


(5) 

s 2 * ^ Ti for some i, 

or 

(6) 

^2*—l = $2*4-1 

or 

(7) 

X 2 k ^ $2*4-1 . 



If (7) holds, then Z 2 & U x 2 a=+i ^ s 2 ;t+i, which is impossible in FL( 3) by the covering 
results stated in Theorem 3 since s 2k+ 1 is the limit of a strictly monotone increas¬ 
ing sequence. If (6) holds then a similar contradiction is reached: x 2k -i U 

x 2k +\ ^ • Hence (5) holds. But s 2k x 2k +i , for 

X 2k -\ fl X 2k ^ (s 2 /;_2 U 3" 2 A:—l) fl X 2 fc = «2fc-l fl X 2k = S 2 fc 

whereas x 2k -i fl x 2k $ x 2 *+i. Hence in (5) i > 1 and by (4), r,- = $ 2 jt . By 

canonicity, 

$2*4-1 = Ti U r 2 = #2*4-1 U S 2k . 

Case 3: s 2k = #,. Like Case 1. 

Part (II): S 2k — $2*—1 #2* • 

Case 1 : s 2k s E r 7 • By Corollary 1.2, either 

(1) $ 2 * = $ 2 *-i or 

(2) s 2k = ar 2 A-. 

If (1) holds, then 

#2*-l = $2*-l = $2* ^ #2* 

whereas # 2 *-i J £ 2 *. If (2) holds, then 

X 2 k U # 2 *+l = $ 2 * U X 2 k+l = $ 2 * 4-1 

contrary to Theorem 3. 

Case 2 : $ 2 * = n^. By Corollary (19) of [14], given j, either 

(1) s 2 *_i ^ r,- or 

(2) x ik g rj. 

Also su ^ , so for some j, rj ^ x ik ; say j — 1: 

(3) n ^ . 



114 


PHILIP M. WHITMAN 


Then $ 2 *-i $ n (otherwise x 2k ~\ £ s 2 k-i ^ r x g x 2 k , whereas x 2 k-i $ x 2 k) so by 
(2), x 2 k ^ ri, and by (3) and canonicity we have 

(4) x 2k as ri and s 2 a-i ^ r, for j > 1. 

Also 

XT T% = §2Jfc = S2*-l ^ ^2fe ^ S2Jfc-l = S*k-2 U Z 2 jfc_i , 

so either 

(5) s 2 k £ s 2k - 2 or 

(6) s 2 k ^ x 2k -1 or 

(7) r* ^ s 2 fc—-i for some L 

If (5) holds, then s 2k ^ £ 2 *_ 2 fl :r 2 ;t , and if (0) holds then s 2 k ^ ^ 2^-1 fl £ 2 * , each 
contrary to Theorem 3. Hence (7) holds, and by (4) and canonicity, 

s 2k s= x 2 k fl s 2 *_i . 

Case 3 : $ 2k = Xi . Like Case 1. 

Thus Lemma 4.2 is proven. 

Lemma 4.3: FL( 3) is not complete . 

Proof: The assumption that every set has a least upper bound leads via 
Lemmas 4.1 and 4.2 (applied six times) to the conclusion that 

s i = (((((n x 2 ) u x 2 ) n xi) u x 2 ) n x 3 ) u x 1 

and that both sides of this equation are in canonical form, which is impossible 
since they are not of the same length. 

Theorem 9: FL(n) is not complete 8 , for n ^ 3. 

Proof: FL(n) has the infinite chain {ta+i}, of which each clement is a poly¬ 
nomial in X \, x 2 , x 3 alone, not involving x 4 , • • •', x n . By 9 Lemma 3.2, the 
least upper bound of this chain (if it exists) is a polynomial in x 4 , x 2 , ;r 3 alone, 
and so by Lemma"3.1 must be the least upper bound in FL( 3), contrary to the 
proof of Lemma 4.3. 

6. FM( 4) 

Birkhoff has shown ([1] p. 463-464) that the free modular lattice FM{ 4) 
has infinitely many distinct elements. A slightly stronger result, again due to 
Birkhoff 10 is this: 

Theorem 10: FM( 4) has an infinite chain of distinct elements. 

Proof: Let A be the free Abelian group generated by Xi , x 2 , • • • , x n , • • • . 
For k = 0, 1, 2, • • • let Si = [x 2 k], the subgroup of A generated by the xn , 
S 2 = [x 2 k + £ 2 *+i]> Sz = [# 2 *+i], $4 = + x 2 k+ 2 ]. If we set B = [Si, S 2 , 

8 In fact, not even conditionally sigma-complete ([3] p. 29); we have exhibited a bounded 
countable set without least upper bound. Note: While n was required finite in Theorem 3, 
the method of applying that theorem here avoids such a restriction in the present theorem. 

9 The adjunction of 1 and 0 possibly required for that lemma causes no trouble, for (using 
Lemma 3.1) we may apply Lemma 3.2 to the lattice generated by the first three z’b and those 
involved in an alleged least upper bound; since this set is finite, 1 already exists for this 
sublattice. 

10 Who communicated this section to the author by letter, March 7, 1940. 



FREE LATTICES II 


115 


S 3 , Sa], then the set of all subgroups of B is a modular lattice ([3] p. 35). Let 
Ti = S 1} T 4n +1 = T in fl 5i, r 4n+ 2 = T 4n +1 U S 2 , T An+3 = T 4n+2 0 S 3 , T in+4 = 
5r 4n+3 U aS 4 . Then 

T 4 n ^ [3*2n+2&—1 > #2n-|-2Jfc > #2&+l ”f“ 372A-h2] 

T 4n +1 = [£2n+2fc] 

T 4n +2 = [#2n+2A: , #2n+2fc+l , #2* + #2fc+l] 

T 4n +3 = [^2nH-2*-hl]^ 

whence T 4 > T 8 > T x2 > • • • . Thus one modular lattice with four generators 
has an infinite chain, hence so does FM(4). Q.E.D. 

6. Unsolved problems 

As far as the author is aware, the following problems suggested by or related 
to the present paper remain open. (1) Solve the word problem for free modular 
lattices ([3] p. 146). This seems to be difficult. (2) Extend Theorem 3, and 
[14] §4. Perhaps not difficult. (3) Does some infinite set in FL{ri) order- 
con verge (§4)? We may call a set of elements {a x , 02 , • • •} a fence below a if 
c < a implies c ^ a, for some i. As we saw in Theorem 3 the £» form a 

fence below the unit in FL(n); etc. Is it perhaps true that in FL(ri) every 
element has a finite upper and lower fence? This would settle the first part of 
(3). On the other hand if some suprema exist, are there real cases like the 
vacuous one of Lemma 4.2 where canonical forms carry over? Do there exist 
infinite connected chains in FL(n )—ones in which no further elements can be 
interpolated? (4) Study the free complete lattice. Study the completion of 
FL(n) by cuts (MacNeille [9] p. 443; [3] p. 27). (5) Determine the finite sub¬ 

lattices of FL(n) \ cf. Theorem 7. 

Harvard University and the University of Pennsylvania 

REFERENCES 

G. Birkhoff. [1]: On the combination of subalgebras, Proc. Camb. Phil. Soc. 29 (1933) 

pp. 441-464. [2]: On the structure of abstract algebras, ibid., 31 (1935) pp. 433-454. 

[3]: Lattice Theory, Amer. Math. Soc. Colloq. Pub. XXV, New York 1940. [4]; 

Rings of Sets, Duke J. 3 (1937) pp. 443-454. 

R. Church. [5]: Numerical analysis of certain free distributive structures, Duke J. 6 (1940) 
pp. 732-734. 

R. Dedekind. [6]: Ober Zerlegungen von Zahlen durch ihre grossten gemeinsam Teiler, 
Festschrift Techn. Hoch. Braunschweig (1897), Gesammelte Werke II pp. 103- 
148. [7]: Ober die von drei Moduln erzeugte Dualgruppe, Math. Annalen 53 (1900) 

pp. 371-403, Ges. Werke II pp. 236-271. 

F. Levi. [8]: Ober die Untergruppen freier Gruppen, Math. Zeit. 32 (1930) p. 315ff. 

H. M. MacNeille. [9]: Partially ordered sets, Trans. Am. Math. Soc. 42 (1937) pp. 416-460. 
J. Nielsen. [10]: Om regning med ikke-kommutative faktorer, Matematisk Tidsskrift B, 

1921, pp. 77-94. 

O. Schreier. [11]: Die Untergruppen der freien Gruppen, Abh. aus den Math. Seminar 

Hamburg 5 (1927) pp. 161-183. 

T. Skolem. [12]: Ober gewisse Verbdnde oder Lattices , Avhandlinger utgitt av det Norske 
Videnskaps Akademi (Mat.-Naturv. Klasse) 1936 no. 7. 

P. M. Whitman. [13]: Abstract, Bull. Am. Math. Soc. 46 (1940) p. 760. [14]: Free Lattices , 

Annals of Math. (2) 42 (1941) pp. 325-330. 



Annals or Mathbhatzcb 
Vol. 43, No. 1, January, 1042 


TOPOLOGICAL METHODS FOR THE CONSTRUCTION OF TENSOR 

FUNCTIONS 

By Norman E. Steenrod 
(Received July 7, 1941) 

1. Introduction 

It is a well-known theorem that an orictitable manifold M admits a continuous 
field of non-zero tangent vectors if and only if the Euler number of M is zero. 
Stiefel [5] 1 has generalized this result to the case of any finite number of fields 
independent at each point, obtaining necessary conditions in the form of the 
vanishing of certain cohomology classes of M. 

From these results, there is every reason to expect a general theory connecting 
the existence of tensor functions of various types over a manifold M with the 
topological structure of M. It is our purpose in this paper to give the founda¬ 
tions and first theorems of such a theory. 

Briefly, it is shown that the set M' of point tensors (of a fixed order and 
weight) over the differentiable manifold M is a differentiable manifold forming, 
in a natural way, a fibre bundle over M in the sense of Whitney [6]. A tensor 
function attaches to each point of M a point of the fibre in M' over it. The 
nature of a tensor function is restricted by specifying that its values lie in a 
submanifold M" of M f . If M " is likewise a fibre bundle, then there is a char¬ 
acteristic cohomology class in M attached to M • The vanishing of this class 
proves to be a necessary (and sometimes sufficient) condition for the existence 
of the prescribed tensor function. As an application, a direct and simple proof 
is given that any separable manifold admits a Riemann metric. 2 

It is worth noting here that the characteristic cohomology class belongs to a 
new type of cohomology group: one based on local coefficient groups connected 
by local isomorphisms (see section 10). It reduces to the usual group if M is 
simply connected. 

It will be seen in the proofs that the problem we are considering is a generaliza¬ 
tion of the problem of extending continuous mappings. We shall both generalize 
and use extensively certain theorems of Eilenberg in this connection [2]. 

In an appendix, the existence and properties of the characteristic cohomology 
class are established for the more general type of fibre space introduced by 
Hurewicz and the author [3]. Any lengthy proof that would interrupt the trend 
of ideas is postponed to a later section. 


1 Numbers in square brackets refer to the bibliography. 

2 Whitney [7] has proved this by showing that any C^-manifold is C'-homeomorphic to 
an analytic manifold in a euclidean space. 


116 



CONSTRUCTION OF TENSOR FUNCTIONS 


117 


2. Spaces of point tensors 

In the following, M will denote the differentiable manifold 3 of dimension n 
over which tensors are to be defined. The class r of M is assumed to be ^2. 
It is not assumed that M is compact. It is assumed that M is separable so that 
a countable set of coordinate neighborhoods can be found to cover it. It follows 
from results of Cairns [1] that M can be triangulated. We may therefore 
assume that M is homeomorphic with a definite simplicial countable, complex. 
The symbol M q will denote the subcomplex of M composed of its simplexes of 
dimensions ^ q. 

In order to be explicit and yet not have too cumbersome a notation, we shall 
define only the manifold M' of point tensors over M having one contravariant 
and two covariant indices. If P is a point of M, C(x) a coordinate neighbor¬ 
hood of P, and a)k (i, j, k = 1, • • • , n) an ordered set of n 3 real numbers, the 
combination ( P , C(x ), a)k) is called a representation of a point tensor. Two such 
(P, C(x), a)k), (Q, C(z), d'jk) are said to be equivalent if P = Q and 


(A) 


-i _ u dx v dx w dx f 
a,k Qvw dx’ dx k 


(The derivatives are evaluated at P.) As is well known, this is a proper equiva¬ 
lence relation, so that the combinations (P, C(x), a) k ) fall into mutually exclusive 
equivalence classes. An equivalence class is called a point tensor. The adjec¬ 
tive point is used for the obvious reason that a point tensor is attached to that 
point of M occurring in any one of its representations. 

The family of all point tensors of the above type at all points of M form a set 
M f which we now proceed to show is a differentiable manifold in a natural way. 
To do this we must define 1-1 maps of subsets of M* into suitable subsets of a 
euclidean space. Let C{x) be an admissible coordinate neighborhood in M. 
Let U be the family of all point tensors having representations (P, C(x), a) k ) 
(where P varies in C(x), and the a\s are arbitrary). Then V is in 1 — 1 corre¬ 
spondence with this set of representations in C(x). Attach to (P, C(x ), a) k ) the 
n + n* real numbers x 1 , • • • , x n , a) k where the x’s are the coordinates of P in 
C(x). The two correspondences define a 1 — 1 map of V into the open subset 
of (n + n 3 )-euclidean space of coordinates (x y a) where x is in C(x) and the a’s 
are arbitrary. Thus for each C(x) in M a coordinate neighborhood C(x , a) 
in M ' is determined. If a point tensor is in both C(x, a) and C(x, a) then, 
if x l = f(x) is the coordinate transformation from C(x) to C(x\ these equations 
together with the equations (A) define the coordinate transformation from C(x f a ) 
to C(x, a). Thus, the x 9 s are functions of the z\s alone while the a ’s are func¬ 
tions of both the x’s and the a’s and are linear in the latter. The determinant 
of this transformation is easily seen to be a power of the determinant of the 
transformation from C(x) to C(x). Since the derivatives in (A) are of class 


1 For the concept of differentiable manifold see [7]. 



118 


NORMAN E. STEENROD 


r — 1, it follows that M f is a differentiable manifold of class r — 1. Its dimen¬ 
sion is n + n 3 . 

It is clear that the relative point tensors of a given weight likewise form a 
differentiable manifold. The same procedure likewise carries through for the 
space of point affine connections. It is only necessary that the transformation 
law in question shall define a proper equivalence relation among the representa¬ 
tions. For example, ordered pairs of covariant vectors at points of M have 
representations of the form (P, C(x) } a* , bi ) with coordinate transformations 
of the form 

dr u - dx u 

x = f(x), di = a u — ? , h = b u . 

dx' dx' 

The determinant is again a power of the determinant of x x. 

In the following pages we shall frequently speak of “a tensor manifold M f 
over M It is to be understood that M* is the space of point tensors over M 
of a fixed order and weight—or M f is the space of point affine connections over 
M —or M ' is the space of ordered sets of vectors (or tensors of given order and 
weight) at points of M. 

3. The projection of the tensor manifold onto M 

Let the function t assign to each point tensor in M f the point of M to which 
the tensor is attached. This natural mapping of M f onto M we refer to as the 
projection . If C(x) is a coordinate neighborhood in M, and C(x, a) the cor¬ 
responding one in M’, then r has the form of a projection in these coordinates. 
The Jacobian of the projection has rank n at each point of M. 

The inverse image of a point P of M in M f (i.e. the point tensors at P) form 
a linear space which we shall refer to as the fibre F P over P. It follows readily 
that M r is a fibre bundle over M in the sense of Whitney [6]. The fibres are 
euclidean spaces witfi linear transformations as the admissible group, and the 
fibres over a particular coordinate neighborhood form a product space. 

4. Tensor functions over M 

A tensor function over ¥ is a map / of M in a tensor manifold M r over M 
having the property that irf is the identity. The image of M in M r under / 
is called the graph of /. 

The usual definition of tensor function has the disadvantage that graphs exist 
only locally and. then vary with the coordinate system. The advantage of the 
present method is that, by identifying the equivalent functional values first, 
the function itself falls into that class of objects (usually referred to as functions) 
which are point to point correspondences. 

If / is a map of the set A in the set B , the graph of / is usually defined to be 
the set (a, /(a)) in the product space A XB. Therefore the foregoing definition 
of graph of a tensor function needs a few words of justification. Suppose M f is 
the space of point scalars over M. It is not hard to see that ilf' is the product 



CONSTRUCTION OF TENSOR FUNCTIONS 


119 


space MXL of M with a coordinate line L. Any map / of M in L (i.e. a scalar 
function in the usual sense) has a graph in MXL which defines a scalar function 
in the sense described above. Therefore, at least in this case, the term is 
justified. An additional reason is that for any neighborhood C(x) the part of 
the graph over it is the graph of the functions a(x) in the product space C(x , a). 

In general, as examples given later will show, M' is not the product space of 
M with a fibre. Consequently we cannot expect that tensor functions as usually 
defined will have graphs in the usual sense. We have, however, with the present 
definition, most of the important features of a graph. The graph is in a space 
which is locally a product space over M y and we have the projection t which, 
when applied to the graph, results in the identity map of M. The only missing 
feature is the resolution of M f into a product space of a second space with M . 
This, however, is inherent in the nature of tensor. 

In case M can be resolved into a product space, it is of little interest unless 
the resolution satisfies several strong conditions. We shall say that M' is 
properly resolved into a product space MXE w'here E is a euclidean space pro¬ 
vided there is a homeomorphic map of class r — 1 of MXE onto M f with the 
property that, for each P e M, the section PXE is mapped linearly on the fibre 
over P in M . Then we have 

Theorem 1. If M' is a tensor space over M of dimension n + m, it can be 
properly resolved into a product space if and only if there exist m tensor functions 
over M of class r — 1 which are linearly independent at each point of M. 

Suppose M is properly resolved, and 0 is its homeomorphism with MXE. 
Let ai , • • • , a m be m linearly independent points of E. Then /»(P) = <£(P, 0*) 
(i = 1, • • • , m) are m tensor functions of class r — 1 which are linearly inde¬ 
pendent at each point of M. 

Conversely, let {/.(P)} be m independent tensor functions of class r — 1. 
Define <t>(P , ai , • • • , a m ) = ^ a */*(P)- Then is a proper map of MXE 
onto M . 

6. Statement of problem 

The most general type of problem for which we shall give a method of attack 
may be formulated as follows: 

Let M f be a tensor manifold over M y and let M " be a differentiable sub¬ 
manifold of M' such that the projection w maps M " onto M, has a Jacobian of 
rank n at every point of Af", and M " is locally a product space over M. (This 
last means that each point P of M has a neighborhood C such that the product 
CXF ,f y where F" is the fibre over P, is homeomorphic with the part of M " over 
C, and, for any point Q of C y QXC is mapped on the fibre over Q.) The problem 
is to define a map / of M in M " such that irf is the identity map of M y and / has 
a specified class (up to the class of M"). 

The foliowring are examples of problems of this type. 

a. If M" is all of M f y the solution is immediate, for the zero tensor at every 
point is an / of the required kind. 



120 


NORMAN £« STEENROD 


b. Let M f be the space of contravariant point vectors over M and M" the 
family of non-zero vectors. This is the problem of defining a non-singular 
vector field over M . 

c. Let M' be the covariant point tensors of order 2, and M " the symmetric 
positive definite ones. A solution / determines a Riemann metric in M. 

d. Let M' be as in c, and let M" be the symmetric semi-definite tensors of a 
fixed rank k . 

e. Let M f be as in c, and let M " be the symmetric non-singular tensors of a 
fixed index h. 

f. Let M ' be the set of ordered sets of k contravariant vectors, (k ^ n), and 
let M " be the subspace of independent sets. This is the problem considered 
by Stiefel [5]. 

g. Consider the problem raised by Theorem 1. Let M " be the tensor space 
of ordered sets of m tensors each of the same order as in M'. Let M nf be the 
subspace of independent sets. Then the existence of a map / of M into M ,n 
such that wf = identity is a necessary and sufficient condition that M f can be 
properly resolved into a product space. 

The methods to be given seem to be of little value in handling a problem 
involving differential conditions such as defining a flat affine connection or a 
Riemann metric of constant or constant mean curvature. 

The following theorem reduces the general problem to the simpler one of 
finding a continuous admissible tensor defined on M. 

Theorem 2. If f is a continuous map of M in M" such that wf = identity , 
then there exists a map f of class = that of M" such that wf = identity. If a 
metric is given in M n y f may he chosen to approximate f as closely as desired . (If 
M n is analytic , we can only assert the existence of an f of class °o). 

6. Construction of a Riemann metric 

In order to illustrate the basic method of attack on our problem, we shall 
show that one can define over M a symmetric, positive definite, tensor function 
g of covariant order 2. This result has already been established by Whitney 
[7]. His procedure is to construct a regular imbedding of M in a euclidean space 
and then to take over into M the metric of the euclidean space. Our procedure 
is a direct construction of the metric tensor and is somewhat simpler. However, 
it fails to give the imbedding theorem. 

Let M' be the manifold of covariant point tensors of order 2 over M . Let 
M " be the subset of symmetric, positive definite ones. Let F f be the fibre in 
M r over a point P in M , and let F n be its part in M n . The important observa¬ 
tion at this point is that F n is a convex linear cell of dimension n(n + l)/2. 
This is proved by noting that a linear combination with positive coefficients of 
two positive definite quadratic forms is positive definite. Since a symmetric 
matrix is determined by n(n + l)/2 of its elements, and since any symmetric 
matrix sufficiently near a positive definite one is likewise positive definite, it 
follows that the dimension is n(n + l)/2. 



CONSTRUCTION OF TENSOR FUNCTIONS 


121 


We suppose now that M is subdivided into a simplicial complex so fine that 
each simplex is contained wholly within some one coordinate neighborhood. 
Since the number of simplexes is countable, we may order them in a sequence 
g\ , < 72 , * • • in such a way that a simplex is preceded by any one of its faces. 
Then <r\ is a vertex P of M . We choose a point in the fibre F " over P and denote 
it by g(P). Suppose inductively that g has been defined over all simplexes 
preceding a k so as to be continuous on their point set sum and irg = identity. 
Then g is defined on d<r k (the boundary of a k ). Let C(x ) be a coordinate neigh¬ 
borhood containing a k . Then the functions g%j{x) are defined and continuous 
over d<T k . These functions alone map da k into the fibre F" over a point P in 
C(x). Since F n is a cell, these functions can be extended over the interior of 
ok so as to give a continuous map of the closed simplex a k in F”. The desired 
map g of <r k in C(x, gif) is now obtained by setting g(x) = (x, gij(x)) for x e <r k . 
Then g is continuous on the sum of the first k simplexes, for if a function is con¬ 
tinuous on each of two compact sets, it is continuous on their sum. This in¬ 
ductive construction leads to a continuous map g of M in M" . The differentiable 
approximation is given by Theorem 2. 

7. Observations on the construction 

It is clear that the construction just given applies in any problem where F" 
is a cell. This condition may be weakened to the extent of requiring that every 
continuous map of a (k — l)-sphere in F " for k ^ n is contractible to a point 
in F ". This latter condition is equivalent to the requirement that all homotopy 
groups of F" of dimensions <n shall vanish. This in turn is equivalent to the 
same requirement on the homology groups of F" of dimensions <n [4]. 

As an application, suppose it is required to define a tensor function over M 
of a prescribed order >1 which is not zero at any point. In this case the fibre 
F" is a euclidean space with its origin deleted. Since the order is >1, the 
dimension m of the euclidean space is >n. Deleting a single point from a 
euclidean m-space does not alter the fact that a map of a sphere of dimension 
<m — 1 in the space is contractible to a point. The required function therefore 
exists. 

We formulate our results as follows: 

Theorem 3. If all the homotopy groups or equally well all the homology groups 
of the fibre F" vanish for each dimension <n, then there exists a tensor defined 
over M of the specified type. 

8. Construction of the characteristic cocycle 

If some sphere in F" of dimension <n is not contractible, we can expect 
essential difficulties. In order to discuss this case we make the 

Assumptions. Let h be the smallest integer such that the homotopy group 
[4], Th(F") t* 0. We suppose h < n. By iro(F") is meant the 0 th homology 
group of F" where cycles are taken with integer coefficients, and the sum of the 
coefficients of a cycle is zero. If h = 1, we shall suppose that wi{F”) is abelian. 



122 


NORMAN E. STEENROD 


Since M" is locally a product space, any two nearby fibres are homeomorphic. 
As M is connected we can pass from one fibre to any other through a succession 
of fibres such that successive pairs are homeomorphic. Thus, the above assump¬ 
tions need only have been made of one fibre. 

It is a consequence of our assumptions that any map of an oriented ^-sphere 
in F" determines a unique element of 717 , (F") (i.e. it is not necessary that a fixed 
reference point of the sphere be mapped on a fixed reference point of F"). 

We shall suppose that M is subdivided so fine that the star of a simplex lies 
wholly within a single admissible neighborhood of M (one such that the part 
of M " over it is a product CXF"). Then, by the argument of section 7, one 
may construct a map / of the ^-dimensional skeleton M h in M n such that 7 r/ = 
identity. 

To any map/of M h in M " such that irf = identity, we attach a chain c h+1 (f) 
of M as follows. If o- is an oriented (h + l)-simplex, C an admissible neighbor¬ 
hood containing o-, and CXF" a representation of 7 r _I (C) in M", then / maps 
da into CXF". Let X map CXF" into F" by attaching to each point its F" 
coordinate. Then X/ maps da into F", determining thereby an element of 
tt*(F") which we denote by c(f, a). Then 4 

A+l//\ V' // A+l\ /»-fl 
C (/) = 2-f c (/> * )V • 

The sum extends over all ( h + l)-simplexes of M. 

An object is a chain if it is a function from simplexes to a group. Here F" 
varies from one neighborhood to another, so the definition of chain is not satisfied 
by c M l (/). The definition will be satisfied when we have established a fixed 
reference F” , and fixed isomorphisms connecting tt*(F") for each F" to tta(F”). 
We know such isomorphisms exist; but arbitrarily chosen ones will not serve 
our purpose. We consider this matter in the following two sections. 

9. The orientable case 

Let A be a curve joining two points P, Q of M defined by a function 
a t ^ 6 , 4 >{a) = P, = Q. Suppose a continuous function h(y, t) is given 
with values in M", defined for y in F? and a g t g 6 , such that Th(y, t) = 
and h(y , 0) = y . Then we shall say that h deforms Fp along A into Fq . Two 
such deformations of Fp are said to be equivalent if the two resulting maps of 
Fp in Fq are homotopic in Fq . 

If the curve A lies in a neighborhood C and the part of M" over C is repre¬ 
sented as a product CXF", then h(y } t) can be defined as the point (</>(£), y) of 
the product CXF". Now any curve A can be expressed as a sum of curves 
Ai , • • • , Ak each of which lies in a single neighborhood. By piecing together 
homotopies along each, one obtains a homotopy along A . 

A homotopy of Fp along A into Fq induces an isomorphism between *h(Fp) 


4 The chain c h + l (f) is the “characteristic cocycle” to be found in the work of Stiefel [5] 
and Whitney [6J. 



CONSTRUCTION OF TENSOR FUNCTIONS 


123 


and taOFq). This isomorphism appears to depend on several factors. However, 
we have 

Lemma 1. If A \, A 2 are two curves from P to Q which are homotopic leaving 
their end points fixed, then a deformation of Fp along A\ into Fq is equivalent to 
one along A 2 . 

The virtue of the lemma is that it assures us that the isomorphism set up 
between v h (Fp) and ir h {F q) by deforming Fp along a curve into Fq depends only 
on the homotopy class of the curve. Consider now the case P = Q. Then 
every element of ti(M) defines an automorphism of Th(Fp). It is not hard to 
see that the attached automorphism gives a homomorphism of (M) into the 
group of automorphisms of irh(Fp). 

Definition. We say that M" is orientable over M relative to Vh(F ") if, for 
each point P of M and each element of n(M), the resulting automorphism of 
7Th(Fp) is the identity. 

The use of the word orientable is explained by the fact that M itself is orientable 
if and only if the space M ‘" of non-zero, contravariant, point vectors is orientable 
over M relative to 7 r n _i(F"). 

The next remark is that in the definition of orientability we do not need to 
require of each point P that any element of ti(M) shall induce the identity 
automorphism. It is sufficient to demand this of just one point F 0 ; it then 
follows for every point P. This follows from the well-known fact that any 
closed curve beginning and ending at P is homotopic to a path which first describes 
a curve A from P to P n , then a closed curve from P 0 to P 0 , then finally de¬ 
scribes A 

The virtue of the orientable case is that any two deformations of Fp along 
curves into Fq induce the same isomorphism between w fl (Fp) and tt*(Fq). For, 
if A, B are two curves from P to Q , and <f> 1 , <f > 2 are the two corresponding iso¬ 
morphisms, then deforming F Q around the closed curved 1 # gives the identity 
isomorphism of it^Fq). This automorphism applied to fa gives fa , since the 
path AA~ ] B is homotopic to B. 

Assuming now the orientable case, we choose a fixed fibre Fq , and for any 
fibre F" we set up the unique isomorphism between ^(P") and Th(Fo) obtained 
by deforming F" along a curve into Fq . In this way, each element of 7 r;«(P") 
for any F" represents a unique element of Wh(Fo). Consequently, in the orient- 
able case, the object c M (f) defined above is a chain (in the proper sense of the 
word) with coefficients in the group th(Fq). 

10. The non-orientable case 

Here we cannot proceed as before since no unique isomorphisms in the large 
exist. However, for each simplex <r, unique isomorphisms connecting the 
irh(Fp) for P e<r can be set up using curves in <r; for a simplex is simply connected. 
Thus each simplex <r of M has a coefficient group G(a). A g-chain is now defined 
to be a function / attaching to each ^-simplex <r an element f(cr) eG(<r). The 
g-chains form a group in the obvious way. Suppose <r f is a face of <r (<r' < o), 



124 


NORMAN E. 8TEENROD 


then by using a curve in the closure of <r, joining a point of a to a point of </, 
a unique isomorphism h'* of G(a) onto (?(</) can be defined. If [o-io 7 ] is the 
incidence number, we can now define the boundary of a <?-chain / to be the 
(q — l)-chain df having on <r' the value 

d/(<r') = 2 [*:*']&#'#(/(«■)) 

o 

which is an element of O(cr'). Since the closure of <r is simply connected, 
cr" < a' < a implies = K" 0 . From this it follows that ddf = 0. Co¬ 

boundary is defined by 

<r' 

and, again, 66/ = 0. Thus cycles, cocycles, homology and cohomology can be 
defined as usual. 5 Thus, in the non-orientable case, we are dealing with the local 
coefficient groups Th(F r ) connected by the local isomorphisms gotten by deforming 
the Fp along curves . 6 

11. Properties of the characteristic cocycle 

Having agreed on the sense in which c hH (f) can be regarded as a chain in M, 
we are prepared to state its principal properties. 

Theorem 4. Under the assumptions of section 8, there exist maps f of the 
h-dimensional skeleton M h of M in M" such that irf = identity. Any such f 
defines a chain c h+l (f) until the following properties : 

(a) . c M (}) = 0 is a ?iecessary and sufficient condition that f can be extended 
continuously to M h+1 , preserving wf ~ identity. 

(b) . c M (f) is a cocycle. 

(c) . Iff is any other map of M h in M n such that wf = identity, then c M (f) ^ 
c M (f). 

(d) . If c M is a cocycle cohomologous to c h+1 (f), there exists a map f of M h in 
M n such that irf == identity, and c h+l = c h+1 (f). 

(e) . A necessary and sufficient condition that there be a map f of in M n 
such that irf = identity is that, for any such map f of M h , we have c h+1 (f) ^ 0. 


5 The resulting groups may be handled much as the ordinary ones. Care must be taken 
that simplicial transformations are accompanied by local coefficient group isomorphisms. 
The usual proof of topological invariance carries through with just these modifications. 
These new groups may also be defined for a general space. Here each point P has its 
coefficient group G(P), and a homotopy class of curves from P to Q determine an iso¬ 
morphism which is transitive under addition. It has come to the author's attention that 
“local coefficients" and Reidemeister’s “tJberdeckungen" (Topologie der Polyeder , Leipzig, 
1938) are equivalent concepts. A discussion of the connection and an analysis of this 
homology theory will be found in a forthcoming paper. 

* The procedure of Whitney [6], in the case of non-orientable sphere-bundles, is to reverse 
the sign of certain incidence numbers in M. His method can apply in our case only when 
the sole automorphism on G(a) is a reversal of sign. 



CONSTRUCTION OF TENSOR FUNCTIONS 


125 


(f). The cohomology class of c+ l {f) is independent of the subdivision of M which 
is used and of the map f of M h ; it is therefore a topological invariant of the pair of 
spaces M, M" and the map w. 


12. Proof 6 * of Theorem 2 


In order to construct the differentiable approximation, we shall prove some 
preliminary results. 

(A). Let D, D' be two rectangular domains in n-space E n defined by 
Oi < Xi < bi , a t < Xi < bi , respectively, and such that D f contains the closure 
D of D. Then there exists a real-valued function g defined in E n of class oo , 
and such that 


Og^l, 


Jl in D, 

|o in E n - D'. 


Let (c, d) be an interval and let 



Then yp is of class <x>, and yp ^ 0. Let 


in (c, d), 
outside (c, d). 


<t>cd(x) = £ i pM) dt j j \p cd (t) dt. 


Then 0 is of class », 0 ^ 0 ^ 1 , <p = 0 for x ^ c, and 0 = 1 for x § d. If 
(a, b ), (a', ?/) are two intervals and a' < a, b < b' } then, by piecing together 
two such functions as 0, we obtain a function 


g(x) = 


0a / a(*^) 

1 <t>bb'(x) 


for x ^ b, 
for :r > b 


of class oo, 0 £.0 ^ 1, jf = 1 in (a, 6), g = 0 outside (a', 6'). Let gi(xi) be such 
a function for the pair (a,-, b t ), (a^ , 6,-). Then the product p(>i, • • • , x n ) = 
gi(xi) . g n (x n ) has the properties asserted in (A). 

(B). Let U be an open set in E n with compact closure U , and let l '' be an 
open set containing U. Then there exists a real-valued function g defined in 
E n of class oo, and such that 

f 1 on [/, 

0£ggl, g = | 

(0 outside U'. 

As U is compact, we can choose a finite number D\ , • • • , D m of rectangular 
domains covering U such that the closure of each is in U f . Let D' { be a rec- 


u A shorter proof can be based on a result of H. Whitney, Analytic extensions of dif¬ 
ferentiable functions defined in closed sets f Trans. Amer. Math. Soc., 36 (1934), pp. 63-89, 


Th. 3. 





126 


NORMAN E. STEENROD 


tangular domain containing A and contained in U'. Then there is a function 
gi attached to the pair A , Z>< as in (A). Define the function g in E n by 

1 - g = (1 - gi)( 1 — — (1 — 0 m). 

Then g is of class <*>, 0 ^ g g 1, some 0 » = 1 implies 0 = 1 , every 0 , = 0 implies 
0 = 0. Thus 0 = 1 i n£ft, and 0 = 0 outside Di . 

(C). If Q is a point of M" y and C(x) a coordinate neighborhood of 7 r(Q), then 
a coordinate neighborhood C"(y) of Q in M " may be chosen such that 7 r has the 
form x x = tt x (i j) s y* (i = 1 , • • • , n). 

Let (7(5) be a neighborhood of Q, and let 7 r be given by the functions x* = 7f’(0)- 
Since ir has rank n at Q, the equations y x = ir x (y) can be solved for n of the 0 *s, 
say 01 , • • • , y n , as functions of the y's and the other y’s in some neighborhood 
of Q. If we let y x = y (i = n + 1 , • • • , m), then, in this neighborhood, the 
y’s form an admissible coordinate system. 

Returning now to the proof proper of Theorem 2, let / be a map of M in M " 
such that 7 r/ = identity. Let P e M and f(P) = Q e M". Cofresponding to 
a neighborhood C of P, let C n be the neighborhood of Q given by (C). We 
can then choose two neighborhoods D, E of P such that C 3 D, D 3 E, and 
f(B) d C". As M is locally compact and separable, we may choose a sequence 
| Ci , Ci , Di , Ei } of such sets of neighborhoods such that M is covered by the 
neighborhoods Ei , and each Ei meets only a finite number of other E } . This 
latter property can also be arranged for the A by reducing them in size without 
losing the property A 3 A . 

In the pair of neighborhoods A , Ci, f is given by m functions y % = J\x). 
Due to the choice of coordinates in C\ and 7 rf = identity, we have f\x) = x x 
(t = 1, • • • , ft). Choose a pair of neighborhoods ?/•', L such that A 3 U ', 
U r Z> U and (/ 3 E \. If e > 0 is given, we ean choose functions 
(j = n + 1, • • * , m) in D x of class 00 and such that \ <t> J — f J \ < €. Let g(x) 
be the function given by (B) for the pair £/', U. Define 

V = Q<> 3 + (1 - g)f 3 in A . 

Then \f/ 3 is of class 00 in IJ, and x// 3 = / 3 outside f/'. If e is sufficiently small, 
the functions f 1 , • • • , / n , ^ n+1 , • • • , yj/ m define a map fi of A in C" . If /1 = / 
outside Di, then /1 is a map of M in M " such that 7 r/i = identity, and fi is 
differentiable to the class of M " in an open set about E \. Since only a finite 
number of A meet A , by restricting €, we can insure that /1 maps A in 
for each i. 

Suppose a map f k of M in M" is given such that tt/* = identity, f k maps each 
A in Ci , and f k is differentiable to the class of ilf" in an open set W about 
Yl i A . Choose a pair f/', C7 of open sets such that U 13 2?*+i — , 

I/' Z) 17, A+i 3 U' and U f • A = 0 . Apply now the construction of the 
preceding paragraph to each of the last m — n components of f k in the neighbor¬ 
hood A+i • The components \f/ k will not only be differentiable in U but like¬ 
wise wherever the/* are differentiable, namely, in W • A+ 1 . This leads to a new 
map/*+i which is differentiable in an open set about ]£i +l A . 



CONSTRUCTION OF TENSOR FUNCTIONS 


127 


Since f k +i = /* on ^\E %, the sequence of functions {/*} so constructed con¬ 
verges to a map /' of M in M" such that/' = /* on Ei . Therefore/' is dif¬ 
ferentiable to the class of M " everywhere and 71/' = identity. 

13. Proof of Lemma 1 

Consider the closed curve AT'A 2 beginning and ending at Q. By assumption 
it is contractible into Q leaving its end points fixed. The two maps of Fp in 
F q are homo topic along this curve. We must cover the contraction of AT 1 A 2 
into Q by a contraction of the homotopy into Fq . Let E be a 2-cell, and \f/ a 
map of E in M so that its boundary dE is mapped into Ai X A 2 . Subdivide E 
into simplexes so fine that the image of each lies wholly within a coordinate 
neighborhood of M. Now AT l A 2 can be contracted into Q over faE) in a finite 
number of steps, each consisting of deforming an arc of the curve over a simplex 
of E . Thus, each step occurs in a single coordinate neighborhood. Suppose 
then fat, 0 ), (a rg t ^ b, 0 g 0 g 1 ) is a homotopy of the arc fait) into fa(t) 
leaving the end points fixed and occurring in a neighborhood C(x). Let h(y, t) 
be a homotopy of Fp along fait). Now the part of M" over C(x) is a product 
space CXF ". Hence, h(y, t) is given by a pair of functions h'iy, t) = fa(t) in 
C and h"(y, t) in F". Define h'iy, t, 0 ) = fat, 0 ) and h"(y, t, 0 ) = h"(y, t). 
The second pair h f , h " define in CXF" a deformation of h(y, t) into a homotopy 
hi(y, t) along fa(t) without altering li{y, a) and h{y, b). This proves the lemma. 

14. Proof of Theorem 4 

The proofs of (a) to (e) parallel closely proofs to be found in a paper by 
Eilenberg [ 2 ]. The modifications are all of the same type. We shall be dealing 
with a simplex a of M;C will be a neighborhood of a, CXF" will be a representa¬ 
tion of the part of M n over C. Tn each case our problem will be to construct, 
extend, or deform homotopically a map / of a or da . The function / has, locally, 
two components 7 r/ = identity in C, and X/ in F". It will be seen that Eilen- 
berg^ arguments apply in each case to the component X/. 

(a) . Suppose / is defined on M h+l . Then for any a h+l , X/(/ +1 ) is a cell in F" 
whose boundary is \f(da h+1 ). Therefore c(/, a h+1 ) = 0. Conversely, c(/, a h+1 ) = 0 
means that X/ can be extended to a map of a M in F". If we extend 7 r/ over 
a h+l to be the identity, we obtain the extension of /. 

(b) . Let a h+2 be arbitrary, and K = | a h+2 1. Now X/ maps K h in F". If F" 
is taken to be the space Y of Eilenberg, then the chain c h+1 (\f) of Eilenberg is 
identical with c h + l (f) over K. As the former is known to be a cocycle on K , it 
follows that a h+2 enters &c M (f) with coefficient zero. As a h+2 is arbitrary, the 
proof is complete. 

(c) . We need the following: 

Lemma 2. There exists a homotopy f(x , t) of f{x) for x e M h , 0 ^ t ^ 1, such 
that 7 rf(x, t) = x for x € M h and all t } f(x, 0) = f(x) for x € M h , and fix, 1) = fix) 
for x € 



128 


NORMAN E. STEENROD 


Define f(x, 0) = f(x) for x c M h , f(x , 1 ) = /'(a:) for x e Hf *" 1 and 7 r/(x, t) - x 
for x € ilf\ 0 ££ l £ 1. We must extend the local components X/(x, £) over the 
rest of M h X /. Order the simplexes of M h ~ l in a sequence such that each sim¬ 
plex is preceded by its faces. As the first is a vertex P, and F" is connected 
(if A > 0), we can join/(P) to /'(P) by a curve/(P, t) in P". Suppose /(x, t) 
is defined over all prisms aXl for simplexes a preceding cn . Then f(x , t) is 
defined on a*X0, <r,*Xl and d<nXl, and therefore X/(x, t) maps d{<nXl) in F". 
As the dimension of this sphere is < A, X/ can be extended over <nXl. Having 
thus defined /(x, t) over M h ~ l Xl , we have only to extend it over each <r h Xl. 

As X/(x, t ) is defined over o-*X0 + dcr h Xl , and this set is a retract of <r h Xl , 
X/(x, 0 admits a continuous extension over <j h Xl. This proves the lemma. 

Let /"Or) = /(x, 1 ). Since for any a h+ \ \f(da h+l ) is homotopic in P" to 
\f"(d(T h+1 ), we havec A+1 (/) = c h+1 (f"). Corresponding to the maps /', /" which 
agree on M h ~\.we define (following Eilenberg) an A-chain rf A (/', /"). Let a be 
any A-simplex. Write the oriented A-sphere as a sum S h = E' + P" of two 
hemispheres. Let g ', { 7 " map 2?', P" respectively on </ topologically with de¬ 
grees 1, — 1 so that they agree on the common boundary. Then define <f> = f'g ' 
on P', $ = /"gr" on P". As /', /" agree on da h , X</> is a continuous map of in 
P" determining thereby an element of m l (P") which is denoted bv </(/', /", cr h ). 
Let d\f\ /") = X tf(/', /", trV- Then 

(A) $A/',/") = c A+1 (/') - c* +, (/"). 

If, for any cr ,t+1 , we consider the functions X/', X/" defined on | d </ +1 | and the 
chains d h (\f', X/"), c A+ 1 (X/'), c^X/") as defined by Eilenberg for the complex 
| <r h+1 I, then the relation analogous to (A) holds. However these latter chains 
agree with the former where both arc defined. Th£ proof of (c) is therefore 
complete. 

(d) . Let 5d h = c h+1 — c* +1 (/). Define/' = / on M h ~\ Given any <r h , define 
7 r/' = identity in <A Then define X/' on a h so that d(f r , /, cr h ) (see proof of (c)) 
is the coefficient of <r h in d h . Then d h (f', /) = d h . As shown in (c), 5d h (f', f) = 
c h+l (f') - c* +1 (/). Therefore, c h+1 = 

(e) . If a map / of M m exists, then, by (a), c h+1 (f) = 0. By (c), for any/' 
defined on M h , c A_H (/') ^ c h+1 (f ) = 0. Suppose c A+1 (/') ^ 0 for some map of M. 
Then, if we choose c * +1 = 0 in (d), we obtain a new map /" of M h such that 
c h+l (f' ') = 0 . It follows from (a) that/" can be extended over M h+1 . 

(f) . Let Mi , M 2 be two subdivisions of M into complexes. We suppose M 2 
is so fine that there is a simplicial map r of M 2 into M x such that, for each point 
x, t(x) lies on the closure of the simplex of M containing x. There is, therefore, 
a homotopy r(x, t) connecting the identity to r(x) such that r(x, t) lies on the 
closure of the simplex of M x containing x for 0 ^ g 1. We now suppose M x 
is so fine that the map r of any prism f X/, where f is a simplex of M 2 , lies in a 
neighborhood of M . (This will be the case if the closure of the star of each sim¬ 
plex of Mi lies in a neighborhood). Now let / be a map of M\ in ilf". Con¬ 
struct a map/' of M^T 1 in M " (always so that irf = identity). We must extend 



CONSTRUCTION OF TENSOR FUNCTIONS 


129 


/' over M\ so that c 2 +1 (/') ^ c? +1 (/). To this end we shall define a map/'Or, t) 
of M 2 XI in M n . Let /'(a:, 1) = fr(x) for x c M \, and /'Or, 0) == f{x) for 
x e Order the simplexes of M\ in a sequence so that each is preceded by 

its faces. If P is the first (a vertex), we define 7 r/'(P, t) = r(P, t) and X/'(P, 0 
to be a path in F" joining X/'(P, 0) to X/'(P, 1). Suppose now/'Or, 0 is defined 
for a; in any simplex preceding f, and is such that t f(x, t) = r(x, t). Then, if 
dimension f < h y f'(x, t) is defined on the boundary of f X/, and therefore, 
X/'Or, t) admits an extension over this prism. We extend t /'Or, t) so as to be 
t(x, t) over fX/. If, however, dimension f = h, f\x , /) is defined only on 
dfX/ + fXl. As this set is a retract of fX/, X/'Or, 0 admits an extension 
mapping fX/ in F". Then we set 7r/'0, 0 = t(x, £). Now let/'(a*) = /'(a*, 0) 
for x € ilf 2 . Then, for any of M\ , is homotopic in F" to X/(drf /,+1 ). 

Therefore c 2 (/', f A+1 ) = Ci(/, r^ u ). Therefore, the cocycle cf'if) is the image 
of the cocycle cj +1 (/) under the simplicial map r. This completes the proof of (/). 

Appendix I. Extensions of method 

In some problems a tensor may alrcad % y be defined on a submanifold L of il/, 
and it may be required to extend it over M. Such a problem could arise in 
connection with a manifold M with regular boundary L. If M is subdivided 
so that L is a subcomplex, then as before / extends to L + il/\ The chain 
c hhl (f) lies in M — L and is a cocycle in the open complex M — L. Then a 
necessary and .sufficient condition that / on L can be extended to L + A/ a+1 
is that c h+1 (f) ~ 0 in M — L. 

In those problems for which the characteristic cohomology class is zero and 
maps / of M h+l can be defined, we are faced with the difficulty of extending 
such an / to ilf fc+2 . We can, of course, proceed as before and define a chain 
c h+2 (f) with coefficients in 7 r*+i(F"). For this, it is necessary to know that F" 
is (h + l)-simple in the sense of Eilenberg, 7 so that X/(d</ +2 ) determines a unique 
element of 7 rh+i(F"). It will follow as before that c h+2 (f) is a cocycle; and 
c M (f) ~ 0 is a sufficient condition for an /' defined on M h+2 to exist. However, 
it is not necessary; for the cohomology class of c Af2 (/) may well vary from one 
/ to another. Just how this cohomology class varies is not known. Some deeper 
analysis is required to resolve the difficulty. 

Appendix II. Application to fibre spaces 

In a paper by Hurewicz and the author [3], the notion of a fibre space was 
introduced as a generalization of the Whitney notion of fibre bundle. In those 
cases for which we know that F" is compact, we can prove that M n is a fibre 
space over M relative to w. This suggests that any fibre space A' over a base 
space B relative to a mapping 7 r(Z) = B determines in some sense a character¬ 
istic cohomology class in B. We propose to show that this is the case. 

Assuming B to be arewise connected, the fibres F may be deformed along 

7 If h > 1, then m(F") — 0, and F" is t-simple for every i. 



130 


NORMAN E. STEENROD 


curves into others and these deformations induce isomorphisms between the 
Vh(F) (where h is the smallest integer such that vh(F) 5 * 0). Therefore, the 
coefficient groups wh(F) are at hand as before. 

A fibre space X need not be locally a product space over its base space B. 
Consequently, the irf , X/ method must be replaced by a new mechanism. Such 
a one is provided by the following lemma. 

Lemma 3. Let X be a fibre space over B relative to t. Let K be a complex , L a 
subcomplex, yp a map of K in B, and yp' a map of L in X such that mp' = \p. Sup¬ 
pose \p t is a homotopy of \p(K) and yp[ is a homotopy of yp'(L) such that mp t = \pt , 
0 ^ t ^ 1. Then ) if \p[ admits an extension to K such that irypi = \pi, this is also 
true of yp'. 

The complex U = LXI + KX 1 is a deformation retract of KXI . Let 
h r (0 S t ^ 1) be such a retraction. Since \p[ gives a map of U in X such that 
mpt = ypt on U, the function yp[h\ maps KXI in X such that mp\h\ = \p t hi . 
Since \p t hi is homotopic to \p t leaving U pointwise fixed, the covering homotopy 
[3] deforms \p t hi leaving U fixed into an extension of \p[ to KXT such that 
mpt = ypt . Then rp Q is the required extension of \p f . 

Returning now to the problem of defining a characteristic cocycle, let K be a 
complex and \p a map of K in B. We propose to show there is a map yp' of K h in 
X such that mp' = \p. There is no difficulty in defining \p' on K°. Suppose it 
has been properly defined on K l and a is an (i + l)-simplex. Then \p is defined 
on a, and yp' on da. Let h t contract a on itself to a point. Let yp t — yph t , and 
let yp[ cover yp t on da. If i < h, the map yp[ of da in the fibre F over \pi(a) can be 
extended to a map of a in F. Applying Lemma 3 gives an extension of yp' over 
a. Thus, yp' can be defined over K h . 

If yp' is a map of K h in X such that mp' = yp , we can define a chain c h+1 (yp') in 
K as follows. Let a be an (h + l)-simplex, and let h t contract da over a to a 
point Po. Then the Homotopy yp t = yph t of da has a covering homotopy yp[ of 
yp'. As yp[ maps da into the fibre F over yp(Po), it defines an element of th(F) 
which we denote by c(yp', a M ). Define c h + l (yp') = X) c(yp', a h+l )a h+l . Then we 
have: 

Theorem 5. If X is a fibre space over B relative to 7r, K a complex , and \p a 
map of K in B , then there exist maps yp' of K h in X such that mp' = yp, where h is 
the smallest integer for which Th(F) 5 * 0. Any such map determines a cocycle 
c h+1 (yp') in K whose cohomology class c+ l is independent of yp'. It may be chosen 
arbitrarily in its class by an appropriate choice of yp'. Therefore , in order that 
there be a map yp' of i£* +1 in X such that mp' = yp, it is necessary and sufficient that 
the cohomology class c h+1 = 0. Furthermore, the class c h+1 is independent of the 
subdivision used in K. 

The proofs are like those of Theorem 4. They differ only in that covering 
homotopies and Lemma 3 are used in place of the CXF" construction. 

A continuous finite cycle in B is composed of three things: a complex K , a 
map yp of K in B, and a finite cycle Z in K. Let us consider (h + l)-cycles with 
local coefficients in the character groups of the th(F). Then c k+1 -Z^ +l is a real 



CONSTRUCTION OF TENSOR FUNCTIONS 


131 


number mod 1 . If ( Ki , ypi , Zi) and ( K 2 , ^ 2 , Z 2 ) are homologous continuous 
cycles, then by definition, there is a complex K s containing K \, K 2 as subcom¬ 
plexes, a map ^3 of K 3 in B agreeing with ^ 1 , \p 2 on K \, K 2 , and a chain in K& 
wjiose boundary is Zi — Z 2 . If a map ^3 of K\ in X is given such that mpz = fa , 
then cj + 1 (^ 3 ), C 2 + 1 W 4 ) and cj + 1 (^ 3 ) are simultaneously defined, c? +1 (^ 3 ) is the part 
of Cs +1 (ipz) on , and the part on K 2 . Therefore, c? +1 *Zi = cJ + 1 -Zi, 

c h 2 +1 -Z 2 = <£ + 1 -Z 2 . Since Zi ~ Z 2 in = 4 + 1 -Z 2 . Thus, the real 

number mod 1 , c h+1 -Z , is independent of the representation of the homology 
class of Z. Thus, a map of the homology group H h+l (B) in the reals mod 1 is 
at hand. This map is homomorphic; for the sum of two cycles is represented 
by the abstract sum Ki + K 2 , a map \p = fa on Ki , = fa on K 2 , and the sum 
Z\ + Z 2 . The characteristic cocycle on Ki + K 2 is the sum of the cocycles 
of K x and of K 2 . Therefore, 1 • (Z x + Z 2 ) = cj +1 • Z l + c h 2 +1 • Z 2 . Summariz¬ 
ing, we have: 

Theorem 6. Corresponding to the fibre space X over B, there is a character¬ 
istic cohomology class c h+l in B, where h is the smallest integer such that imiF) 0. 
c h+1 belongs to the cohomology group of characters of the homology group H M (B) 
based on continuous finite cycles with local coefficients in the character groups of 
the 717 i(F). If K is a complex , and \f/ a map of K in B, then the image of c h+l in K 
under \p is the characteristic cohomology class of K relative to (see Theorem 5). 

The University of Chicago 


BIBLIOGRAPHY 

1. S. S. Cairns, Triangulation of the manifold of class one, Bull. Amer. Math. Soc., 41 

(1935), pp. 549-552. 

2. S. Eilenberg, Cohomology and continuous mappings, Annals of Math., 41 (1940), pp. 

231-251. 

3. W. Hurewicz and N. E. Steenrod, Homotopy relations in fibre spaces, Proe. Nat. Acad. 

Sci., 27 (1941), pp. 61-64. 

4. W. Hurewicz, Beitrdge zur Topologie der Deformationen I-IV, Proc. Amsterdam Acad., 

38 (11)35), pp. 112 and 521, also 39 (1936), pp. 117 and 215. 

5. E. Stiefel, Richtungsfelder und Fernparallelismus in n-dimensionalen Mannigfaltig - 

keiten , Comm. Math. Helv., 8 (1936), pp. 3-51. 

6. H. Whitney, On the theory of sphere-bundles, Proc. Nat. Acad. Sci., 26 (1940), pp. 148-153. 

7. H. Whitney, Differentiable manifolds, Annals of Math., 37 (1936), pp. 645-680. 



Annals or Mathematics 
Vol. 43, No. 1, January, 1942 


HOMOTOPY PROPERTIES OF THE REAL ORTHOGONAL GROUPS 

By George W. Whitehead 
(Received September 25, 1941) 

1. Introduction 

In this paper we propose to investigate the topological structure of the rota¬ 
tion group R n of the n-sphere, with special emphasis on its homotopy properties. 
Of particular interest are the homotopy groups it t - of R n . These groups, one 
for each dimension i, were first defined for a general space by Hurewicz [l]. 1 
Like the homology groups, they are topological invariants of a space; unlike the 
homology groups, however, no general method for computing them is known. 
Each space thus presents a problem in itself. 

The computation of the groups iVi(R n ) for i ^ 5 and all n will bh carried out 
by the method of fibre mappings and covering homotopies developed by Hure¬ 
wicz and Steenrod [2]. We shall make extensive use of the results of Freuden- 
thal [3], Hopf [4], and Pontrjagin [5] on the homotopy groups of spheres. 

The groups in(R n ) are useful not only in the study of the homotopy properties 
of spheres, but also are used by Whitney in his theory of sphere-bundles [6, 7], 
where they appear as coefficient groups for certain cocycle invariants. 

Another application of our results appears in the theory of continuous vector 
fields over spheres. It is well known that no continuous field of unit vectors 
can be defined over the spheres of even dimension. Over the odd-dimensional 
•spheres, however, one such vector field can always be defined; and if n = 3 
(mod 4) or n = 7 (mod 8) it is possible to define three or seven independent 
vector fields, respectively, over the n-sphere S n . These can be readily con¬ 
structed by the use of the multiplication matrices for quaternions and Cayley 
numbers. For a general odd n, however, there is no known result on the maxi¬ 
mum number of independent vector fields which can be defined over S n . 

In this paper the case n = 1 (mod 4) is resolved as follows: Any two vector 
fields over S 4m+1 (m = 0, 1, 2, • • • ) are somewhere dependent. As a corollary to 
this result it is observed that the tangent sphere-bundle of S n is not simple if 
n > 1 and n = 1 (mod 4). 

This investigation was carried out under the direction of Prof. N. E. Steenrod, 
to whom the author wishes to acknowledge his indebtedness for many valuable 
suggestions and criticisms. 

2. Table of groups m to 7r 6 of R n 

In this section the results of our computation of the homotopy groups w *(/2 n ) 
are exhibited, and a set of generators for these groups is given. Proofs will be 
deferred until Section 8. 


1 Numbers in square brackets refer to the bibliography at the end of the paper. 

132 



HOMOTOPY PROPERTIES OF REAL ORTHOGONAL GROUPS 


133 


Let oo denote the free cyclic group, 2 the cyclic group of order two. If A 
and B are two abelian groups, A + B denotes their direct sum. In terms of 
these notations, the groups t i(R n ) may be tabulated as follows: 

Ri R 2 Rz Ri Rh R*> • * * Rn 

2 2 2 2 2 • • • 2 

0 0 0 0 0 ••• 0 ••• 

00 00 + 00 00 00 00 ••• 00 ••• 

2 2 + 2 2 0 0 •••()••• 

7Tg I 0 0 0 0 oo 0 • • 0 • • • 

The results of the first two rows were (irst obtained by Cartan [ 8 ]. 

A generator of 7 Ti(/ 2«) is given by the map of the circle x\ + x\ ~ 1 defined by 

2*1 — *2 0 

x —» x 2 Xi 0 

0 0 In—i 

where I n -i is the (n — l)-rowed identity matrix. 

A generator of ir 3 (R n ) (n ^ 3) is given by the map of the 3-sphere = l 

»-i 

X\ — x 2 —Xa — Xi 0 j 

x 2 Xi — x 4 X Z 0 | 

x —> x s x 4 X'i — x 2 0 

J 4 X-2 Xi 0 

0 0 0 0 In—3 

The corner matrix is the linear transformation of Euclidean 4-space defined by 
multiplying every quaternion on the left by + ix 2 + jx z + kx A . 

The generator of irziRi) is the well-known double covering of R 2 by S 6 . 
Bordering this matrix with a 1 in the lower right hand corner and zeros else¬ 
where, we obtain the extra generator of tt%{Rz). 

To obtain the generator of t 4 (R 2 ) } we map S 4 on S 3 essentially, and then map 
S z into R 2 by means of the generator of 7 r 3 (R 2 ) given above. Such an essential 
map was constructed by Freudenthal [3]. In a similar manner we obtain gen¬ 
erators for ta(Rz) and 7 r 4 (I? 4 ). 

Finally, the generator of r 6 (Rt) is determined by the map of S 5 into R 5 given by 
a: —>|| a t7 - 2 XiXj ||* 


h 0 
0 -1 




7T 4 0 



134 


GEORGE W. WHITEHEAD 


3. Preliminary notions 

In this section we introduce notations and concepts which will occur through¬ 
out the paper. The relative and absolute homotopy groups of a space are 
introduced and two homomorphisms relating these groups are discussed. 

Let points x in Euclidean (n + l)-space be referred to coordinates (£i, £ 2 , • • • , 
x n + 1 ). The unit sphere ^2 x] = 1 we denote by S n . The equatorial plane 
x n +i = 0 divides S n into two hemispheres V\ and VI defined by the inequalities 
x n + 1 ^ 0 and x n +\ S 0 respectively. If x = (x \, X 2 , • • • , x n +i) c S n , the anti¬ 
podal point ( — xi , — x 2 , * • * , —x n+i ) is denoted by x. We shall refer to the 
point x° = (0, 0, • • • , 1) as the north pole , and to its antipode x° as the south pole. 

The group R n of all rotations of S n may be represented as the group of all 
real square orthogonal matrices of order n + 1 with determinant + 1 . The 
subgroup of R n consisting of all those rotations of S n which leave the north pole 
fixed is isomorphic with the group R n -i , and we shall denote the former group 
also by the symbol R n -i . 

Let Y be a topological space, F a closed subset of Y, and y Q a fixed point of F. 
Let X denote the space of all maps of V* into Y which carry the boundary 
dVi = S’ 1 ' 1 into F and the north pole x° of S n ~ l into y a . We introduce an 
equivalence relation in X as follows: two maps fi , / 2 e X are said to be equivalent 
if they are homotopic, and during the homotopy S n ~~ l remains in F and x° 
remains at y a . In other words, two points of X are equivalent if they can be 
joined by an arc in X. The relation of equivalence is easily seen to be reflexive, 
symmetric, and transitive, and thus divides X into classes of equivalent maps, 
called homotopy classes. The homotopy class determined by a map / we denote 
by {/}; the set of all such homotopy classes by ir n (F, F). Hurewicz [1] has 
introduced an operation, called addition, in ir n (F, F), by means of which it 
becomes a group, the n th relative homotopy group of Y modulo F. If the closed 
set F is specialized to consist only of the point y Q , the group 7 r n (F, y Q ) so obtained 
is called the absolute homotopy group 7 r„(F). The latter group may also be 
defined by means of mappings of spheres into Y and we shall frequently find it 
convenient to use this definition. 

We now introduce a homomorphism w of 7 r n (Y, F) into T n ~i(F). This homo¬ 
morphism is defined as follows: if a = {/} ei r„(F, F), then w(a) denotes the 
homotopy class of 7r n -i(F) determined by the map /(5 n_1 ) Cl F. Evidently 
{/} = 1(7} implies a>({/}) = co({gr}). Thus co maps 7r n (F, F) into 7r n -i(F), and it 
follows from the definition of addition that w is a homomorphism. Let v n 0 (F, F) 
denote the kernel of this homomorphism, 7r n _i, 0 (F) the image of ir n (Y , F) under w. 
Evidently 7r no (F, F) consists of those homotopy classes determined by those 
relative n-cells in Y modulo F which are contractible into F, while 7r n -i. 0 (F) 
consists of the classes determined by those (n — l)-spheres in F which are 
homotopic to points in Y. 

Since each element of 7r n (F), considered as a set, is a subset of an element of 
7 r n (F, F), we have a natural mapping yp of ir n (Y) into 7r n (F, F), which is a homo¬ 
morphism. It is not hard to show that ^({/}) = 0 if and only if some/' in the 



HOMOTOPY PROPERTIES OF REAL ORTHOGONAL GROUPS 


135 


class of / maps S n into F . If [fi] = {/*) and 0({/i|) = H\U\) ~ 0, then f[ 
is homotopic in Y to f 2 . Thus f\ and f 2 determine the same element of 
ir n (F)/ir no (F ). Conversely, if f(S n ) C F , then 0({/)) = 0. Hence the kernel 
of the homomorphism 0 is isomorphic to T n (F)/ir n o(F). The image of r n (Y) 
under 0 is evidently the group i r„ 0 (K, F). We summarize these results in 

Theorem 1. The natural homomorphic maps co[7r n (y, F)] C 7 r n _i (F) and 
ifrWniY)] C 7Tn(y, Z 1 ) arc related as follows: the kernel of the homomorphism 0 is 
isomorphic to tt n (F)/Tr no (F) y while the image of tt n (Y) under 0 is the group w n , 0 (Y,F). 

4. Homotopy relations in compact Lie groups 

Let G be a topological group, H a closed subgroup of G, and B = G/H the 
space of left (or right) cosets of H in G. Then there is a natural mapping t 
of G onto B defined as follows: for every g eG, ir(g) is the coset of B containing g. 

If G is a compact Lie group, then w is a fibre map in the sense of Hurewicz 
and Steenrod [2]. A slicing function can be defined as follows: a plane of maxi¬ 
mum dimension independent of the tangent plane to II at the identity 1 meets 
each coset h in a sufficiently small neighborhood U of b Q = 7r(l) just once. We 
denote this point by 0(1, b). Then if g~ l b e l r , let <f)(g , b) = <70(1, g~ l b). Evi¬ 
dently 0 has all the required properties of a slicing function. 

The following theorem will be useful in our discussion of the rotation groups: 

Theorem 2. If G is a topological group ami B is the space of left (or right) 
cosets of a closed subgroup II of G, and, if there exists a map f(B) C G such that 
Trf(b) = 6, then G is homeomorphic with the product space H X B. 

We may suppose B is a space of left cosets. Let/'(f>) = /(b)* [/(bo)] -1 ; then 
7 r/'(6) = irf(b) = 6, and f'(b 0 ) — 1. We then set up the homeomorphism by 
means of two maps p(G) = II X B and q(H X B) = G defined as follows: 

p(g) = [g^'f'^o), *g] (g * G), 

q(h,b) =/'(&)•/r 1 (h e //, 6 c B). 

Then 

p[q(h, b )] = {h[f'(b)r l f'(b), b} = (h } b), 
g\p{g)\ = fto)-[fby)P-g = g> 

and both maps are continuous. Hence G and H X B are homeomorphic. 

6. Slicing functions for R n —> R n /R n -i 

In this section the results of Section 4 are applied to the special case G = R n , 
H = iUn-i, and an explicit slicing function is constructed. The special cases 
n = 1,3, 7 are treated separately, and for these values of n Theorem 2 is applied. 

Let us consider the mapping 7r (R n ) = S n defined by x(r) = r(x°) (r c R n ). 
As shown by Hurewicz and Steenrod [2], w is a fibre map of R n into S n } the 
fibres being the left cosets of i? n -i in Rn . In terms of the matrix representation 
of R n , ir(r) is the last column of the matrix r. 



136 


GEORGE W. WHITEHEAD 


We now define the slicing function promised above: if x x° , let 0(7, x) be 
the rotation carrying x° along a great circle into x, and leaving the (n — 2)-sphere 
orthogonal to this great circle fixed. In terms of matrices 


( 1 ) 


0(7, x) = A n (x) = 



(Xj + 8i, n +l)(X j + 
*n+l + 1 


In 0 

0 -1 


(if j 1 > * * * f 71 *4“ l), 


where 7 n denotes the n-rowed identity matrix. Then 0(r, x) is defined as is 
Section 2. Evidently it is impossible to extend 0(7, x) so as to be defined and 
continuous over all of S n . We shall show later that there is no slicing function 
with this property for a general n. 

For the dimensions 1, 3, and 7, however, it is possible to define such a slicing 
function. Let ?I M +i (n = 1, 3, 7) denote the algebras of complex numbers, 
quaternions, and Cayley numbers, respectively, over the field of real numbers. 
By means of these algebras a multiplication x-y of points of Euclide ; an (n + 1)- 
space is defined. This multiplication has the property that || x-y || = 
|| x || • || y ||, where || x || 2 = ^ x* is the square of the distance of the point x 
from the origin. Hence S n is closed under multiplication and for x ) y c S n we 
have x-y = B n {x)-y , where B n (x) e R n , and, if the coordinate system is chosen 
so that x° is the unit of the algebra, B n (x°) = 7, irB n (x) = x. Since B n (x) is 
defined for all x e S n , we have 

Theorem 3. For n = 1, 3, 7, R n is a product space R n ~\ X S n . 

Since the i ih homotopy group of a product space X X Y is the direct sum of 
the i th homotopy groups of X and F, we have 

Corollary. For n = 1, 3, 7, t i(R n ) is the direct sum 7r t -(ft n _i) + ?r.(£ ri ); 
*in particular , 7r„_i(/? n -i) = 7r n -i(/2 n ). 


6; The canonical map of S n in R n 


We now introduce a mapping C n of S n into R n which plays an important role 
in the following discussion. We shall refer to it as the canonical map. It is 
proved that this map is contractible into R n -\ if n is even; while for n odd it is 
not so contractible. The canonical map is then used to construct a generator 
for 1Tn(Rn ) 7£ n _i). 

In order to define the map C n , let 6(x)(x e S n ) denote the angular distance 
from x° to x , and let x f be the point in the great circle through x° and x with 
0(x') = 2 6(x). Let C n (x ) (x ^ x) be the rotation which carries x° along a great 
circle into x' and leaves the orthogonal (n — 2)-sphere fixed; and let C n (x°) = 7. 
Evidently 


( 2 ) 


C n (x) = UnWf == || Bij ~ 2XiXj\\ - 


In 

0 



(i,j ^ 1, ••• , n + 1). 


We observe that C n , unlike A n , is defined and continuous over all of S n . Fur¬ 
thermore, antipodal points have the same image, while distinct pairs of antipodal 
points have distinct images. Thus the image of S n in R n is a projective n-space. 




HOMOTOPY PROPERTIES OF REAL ORTHOGONAL GROUPS 


137 


Let g n (S n ) = S n denote the projection ttC n of the canonical map. If g n ,t(x) 
is the i th coordinate of g(x), we have g n ,i(x) = 2xi%n+i — 8» t n+i (i = 1 , • • • , n + 1). 

Theorem 4. If n is even, g n has degree zero; if n is odd, g n has degree two. 

For g n maps the equator S n ~ l into x° and maps V i — S”* 1 topologically on 
S n — x°; in fact, g n can be obtained by a homotopy of S n on itself in which each 
point moves along the great circle joining it to the north pole. Thus g n maps V* 
on S n with degree 1. Furthermore, g n maps F? on S n with degree (— l) n+1 ; 
for g n (x) = g n (x) (x e F?) and the antipodal transformation x —» x has degree 
( — l) n+1 . Hence g n maps S n on itself with degree 1 + ( — 1 )” +1 . 

If n is even, g n has degree zero, and hence is homotopic to a point. We shall 
give a homotopy of g n which will be useful in a later section. This homotopy 
g n (x, t) is given by the equations 

0n,2i-l(z, t) = 2{ (1 - 0*2<-l*i»+l + [*(1 — t)^X2i\, 

(3) g n .u{x, t) = 2{(1 - t)x 2i Xn+i - [/(l - OlWi) (i = ],•••, n/2), 

gn.n+l(x, t ) = 1 — 2(1 — 0(1 — *n+l). 


It is easy to verify that g n (x, t) contracts g n (S n ) over S n into x°. 

If n is odd, g n has degree two. We shall give a deformation of g n into a 
second map g' n of degree two. The latter map is defined by the equations 

g'nAx) = Xi (i = 1 , • • • , n - 1 ); 


(4) 


gn ,nOr) 


2 x n Xnfl 

Ixl +xl + j’ 


f / \ X n +1 X n 


(x 2 „ + X 2 +i 0), 


£7n.n(x) (7n,n+l(x) 0 (Xn Xn-j-i — 0). 

It is easily seen that g'„ is defined and continuous over all of S”. The homotopy 
0 „(x, 0 of gn over S n into g' n is given by 


( 5 ) 


gn.ii-l(x, t) = tX 2 i-l + [1(1 ~ /)]**« + 2x„+i 
gn,2i(x, t) = tX 2i — [<(l — 01* «■«—1 + 2x„ + i 


(1 - Qxm— 1 ~ U (1 ~ QI* X« 

{1 - <11 - (x 2 + xL,)]} r ’ 
(1 — t)x 2 i + [<(1 — 0]*ar«-i 
{1 - <[1 - (x 2 „ +~x* + ,)]} r 



gn, n(x, 0 


fif„,» + l(x, <) 


2Xn Xn+l 

{1 - <U - (x 2 + * 2 + J]?’ 

_ 2 x^+i_ 

{1 - <11 - (x 2 n + x 2 b+1 )]}‘ ' 


{i - <u 


(Xn + X 2 +i)]}‘. 




138 


GEORGE W. WHITEHEAD 


Although g n (x, t) is not defined everywhere for t = 1, it is easily verified that 
linw g n (x, t) = g' n {x) uniformly in x. 

We now use the canonical map to construct a generator of w n (R n , Rn- 1 ). 
We shall take as fixed reference points for this group the north pole of S n ~ x and 
the identity matrix I c R n - 1 . Since ([2], Theorem 2) ir n (R n , Rn- 1 ) is isomorphic 
to ir n (S n ) under the map ir(R n ) = £ n , a generator of the former group is repre¬ 
sented by any relative n-cell in R n modulo R n -1 which projects into S n with 
degree d=l. To define such a relative n-cell, we observe that C n maps V * 
into R n and #S n-1 into the coset R n -i opposite to R n -i and projects into S n with 


degree 1. Hence the map D n (x ) = C n (x)' 


(x e Vi) defines a 


In -1 0 

0 -/ 2 | 
y verified that D n (x°) = /, while 
Hence D n represents the required 


relative cell in R n modulo , and it is easi 
dn(x) = TD n (x) = g n (x) has degree (-l) w+1 . 
generator. 

Since tt„_i (R n , R n -i ) = 7r n _i (S n ) = 0, it follows from the results of Section 3 
that 7r n _i(j? n ) = ir n -i(Rn-i)/Tr n -i,o(Rn-i)- Thus 7r n _i (R n ) is a factor group of 
tt n —i(Rn—i) > the kernel of the homomorphism being the group T n -i, 0 (Rn-i ). But 
w n -i,o(Rn~i) = w[7T n (/? n , R n - 1 )],’ hence a generator of irn-i,o(Rn-i) is given by the 
map coDn , which is easily shown to be the canonical map C n ~ i. Hence 

Theorem 5. The kernel of the homomorphism T n -i(Rn-i) —> 7r n _i(/2 n ) w 
subgroup of the former group generated by the canonical map. 


7. On the possibility of sectioning the cosets of # n _i in R n 

In this section the following question is considered: Is there an ?i-sphere in R n 
which projects into S n with degree 1? It is shown that this question can be 
answered in the negative for certain values of n. For other dimensions the ques¬ 
tion remains open. 

The first step toward the solution of this problem appears in 

Theorem 6. The following conditions are equivalent: 

1) there is a map F(S n ) C R n such that ttF has degree one ; 

2) R n can be represented as a product space R n - 1 X £ n ; 

3) the homomorphism T n -i(R n -i) — * 7r„_i (R n ) is an isomorphism ; 

4) the canonical map of S n ~ 1 into 7? n -i is homotopic to a point in R n -i . 

The first condition implies the second. For let F(S n ) C R n be such that wF 
has degree one. Since wF is homotopic to the identity, it follows from the 
covering homotopy theorem ([2], Theorem 1) that F is homotopic to a map 
F'(S n ) C R n such that irF'(x) = x. Then by Theorem 2, R n = R n -i X S n . 

That the second condition implies the third we have observed in the proof of 
the Corollary to Theorem 3; that the third implies the fourth follows from 
Theorem 5. 

The fourth implies the first; for during the homotopy of C n -1 to a point a 
relative n-cell is swept out in /2 n -i whose boundary is the (n — l)-sphere defined 
by the canonical map. But the map D n defines a relative n-cell in R n with the 
same boundary as the first one. Joining these two cells by identifying corre- 



HOMOTOPY PROPERTIES OF REAL ORTHOGONAL GROUPS 


139 


sponding points on the boundaries, we obtain a sphere in R n whose projection 
has the same degree ( — l) n+1 as that of d n . The required map is then easily 
constructed. 

Theorem 6 enables us to answer immediately the question posed above for the 
case when n is even. For suppose that n is even and suppose that such a map 
exists. Then by the fourth condition in Theorem 6, the canonical map C n -i 
is homotopic to a point in 7 ? n _i. Hence g n - 1 = 7rC n -i is inessential. But we 
have already shown that g n ~ i has degree two. This contradiction completes 
the proof of 

Theorem 7. If n is even , there is no map F(S n ) C R n such that wF has 
degree one . 

We now turn to the much more difficult case where n is odd. A partial answer 
to the question is obtained in 

Theorem 8. If n > 1 and n ss 1 (mod 4 ), there is no map F(S n ) C R n such 
that 7 rF has degree one. 

The proof may be outlined as follows: 

1) Since for n odd, g n -\ is homotopic to a point, it follows that C n -1 is homo¬ 
topic to a map G n -i(S n ~~ l ) C /i* n _ 2 . Such a homotopy is exhibited, and it is 
proved that k n -i = 7rG n ~i is essential if n s 1 (mod 4 ), and inessential if n = 3 
(mod 4 ). 

2) A generator P n (Vi) Cl R n _ Y of the group Tr n (R n -i , Rn- 2) is constructed for 
all n ^ 5 , and the projection pt-i of the map wP n (^ n “ 1 ) Cl P n _ 2 is shown to be 
inessential. 

When these steps have been established, the proof of the theorem may be 
r completed as follows: Suppose that n > 1 and n ss 1 (mod 4 ) and suppose that 
the theorem is not true. Then C n - 1, and consequently C n _ 1, is homotopic to^ 
point in R n - 1. Since G n -1 is not homotopic to a point in /? n _ 2 , the deformation 
of Gn—i defines a relative /i-cell in R n \ modulo /? n „ 2 , and hence an essential 
element of Tr n (R n ~i , R n - 2)- This group has been shown by Freudenthal [ 3 ] to be 
the cyclic group of period 2 if n ^ 4 . Hence wP n and Cr n -1 are homotopic in 
R n -2 • But 1 and k n -1 are not homotopic, a contradiction. 

Lemma. If n is odd , the canonical map C n -1 is homotopic in R n ~ 1 to a map 
G n ~i(S n ~ l ) C 2, whose projection into S n ~ 2 is essential if n = 1 (mod 4 ) and 
inessential if n ss 3 (mod 4 ). 

Let denote the closed n-cell bounded b}^ the unit sphere S n ~ l in Euclidean 
n-space. H ” denotes the upper half x n S 0 of P n . Let points y e H n be repre¬ 
sented by coordinates r) where x e V"~ l is the central projection of the point 
y on Vr\ and r is the distance of the point y from the origin. Let 


c 

s 

0 

0 

... 0 

—■ s 

c 

0 

0 

... 0 

0 

0 

c 

$ 

... 0 

0 

0 

— s 

c 

... 0 

0 

0 

0 

0 

... 1 


II(r) - 







140 


GEORGE W. WHITEHEAD 


where c = 1 — 2r 2 , s = 2 r(l — r 2 )* (0 ^ r ^ 1). Let 


G(y) = G(x, r) 

( 6 ) 





(« « 7i; 0 ^ r ^ l)s 


map ff n into R n -i . Evidently (?(x) = C n -i[0«-i(x)] for a: c V\ while 6 map, 
the equatorial plane into /i n -2 and S* -2 into a single point. Deform Vi~ l over 
H n into E n ~ x by letting each point move along the perpendicular joining it to the 
plane x n = 0. The image of G under this homotopy gives a homotopy of 
C n -i[0n-i(aO] to a map K(V\~ l ) C i? n _ 2 . Since under the homotopy /S n “ 2 
remains fixed, K maps S n ~ 2 into a single point. Evidently K{x) can be repre¬ 
sented in the form K{x) = G n -i[gn-i(x)], where G n -i(S n_1 ) C /i n _ 2 is defined and 
continuous over all of S n ~~ l , and G n ~ i is homotopic in i?„_i to C n - 1. 

By multiplying out the matrices in (6) and computing the homotopy, we find 
that the map &„_i = 7r(r„_i of S n ~ x into S n ~ 2 is represented by the equations 


2/2i-1 = 


2 (x 2 ,_i X n -2 + X2iXn-l) 

(1 - 4 )* 


( 7 ) 


Da = 

Vn—2 = 


2(x«:r„_2 — X2i-iXn-i) 

d’- 4 ) 1 

2(xl_2 + x*_i) _ n __ 

1 ( n) ’ 


2/n—l 



(1 - arl ^ 0); 


= 0 (i = 1, • • ■, n — 2), 2 /«-i = x‘n (1 ~ x 2 = 0). 

It follows easily that &„_i is defined and continuous over all of S n ~ x and maps 
S n ~ l on S n ~ 2 . In order to investigate this map, let us consider the map 
m n -i(S n ~ 2 ) = S n ~ 3 obtained by setting x n = y n ~i = 0 in ( 7 ). This map can be 
studied more easily in complex coordinates. Let z, = x 2 y-i + ixu, wy = 
2/2y—i + mi U = 1 , * * • , (n — l )/2 = Jc), where w k = y n -2 is real. Then 
and S n ~~ z are given by the equations Z/Zy = 1, u>y©y = 1, respectively. 
In terms of these coordinates the equations representing the map ra n -1 take 
the form 


( 8 ) 


Wj = 2zyz* (j = 1, • • •, fc — 1), 

W k = 2z*Z* — .1. 


If the complex coordinates occurring in (8) are formally replaced by real ones, 
the map g k -i is obtained. In equations ( 3 ) and ( 5 ) we have constructed homo- 
topies of g n for n even and odd, respectively. The functions defining these 
homotopies can be extended so as to be defined for complex coordinates, as 
follows: if k is odd, i.e., if n ss 3 (mod 4 ), let 

W 2/-1 = 2 {(1 — t)z2i-i Zk + U(1 — 01*%}, 



HOMOTOPY PROPERTIES OP REAL ORTHOGONAL GROUPS 


141 


( 9 ) w ti = 2{(1 - t)Zifa - [<(1 - <)]%-i} (j = 1 , • • •, ~2~^ . 

w k = 1 — 2(1 — <)(1 — Zkh). 

Evidently the homotopy (9) deforms m n _i over S n ~ z to a point. Hence m n _i, 
and consequently also & n _i, is homotopic to a point. On the other hand, if k 
is even, i.e., if n = 1 (mod 4 ), let us consider the map m^_i(S n “ 2 ) C £ n “ 3 defined 
by the equations 

Wj = Zj O' = 1, •••, fc - 2), 


( 10 ) 


Wk-l - 


w k = 


2Zk-lZk 


(Zk-lZk-l + gfcZfc)*’ 
2g*g* 


(z k -iZk-\ + Zkh) h (zk—iZk~\ + ZkZk 7* 0 ); 


(Zk-lZ k -l + ZkZk) h 

w k -1 = w k = 0 for 2*_i = z k = 0. 


Setting Wj ~ Zj = 0 (j = 1, •••,& = 2 ) in ( 10 ) and writing the result in real 
coordinates, we obtain a map of S 3 on £ 2 of Hopf invariant ±1 (cf. Hopf [ 4 ], § 5 ). 
It follows from the results of Freudenthal [ 3 ] that ni' n ~i is essential. 

We shall now define a deformation of m n -\ to m' n -i (for n ^ 1 (mod 4 )) by 
extending the functions in ( 5 ) so as to be defined for complex coordinates, as 
follows: let 


(ID 


W2j- 1 = tZzj—l + U (1 ““ 01 ^ 2 ; + 2 


(1 - t) z 2 j- iZ k - U (1 
{1 — <[1 — (Zk-lZk-1 


- t )] h Z k Z 2 j 
+ ZkZk )]\* ’ 


Wij = fc 2 ,- ~ [<(1 - 01 ^ 22,-1 + 2 


(1 - t )z 2 jZ k + [t( 1 - Q]*g fcg2|-1 
{1 — i[l — (Zk-\Z k -l + Z k Zk)W h 



Wk-l = 


_ 2gfc-i Zfc _ 

{1 — /[I — (g*-l2*-i -f- ZkZk)])^' 


Wk 


_ 2 ZkZk _ 

{1 — t[l — (Zk-lZk-1 + ZkZk)])** 


{1 — <[1 — (Zk-lh-l + Z*2*)]}*. 


Although this map is not defined everywhere for t = 1, it is readily verified that 
as t —> 1, the functions in (11) converge to the corresponding ones in (10) uni¬ 
formly in 2. Thus m n -1 and m ' n -1 are homotopic, so that m n _i, and conse¬ 
quently also k n -i , is essential. This completes the proof of the Lemma. 

As indicated above, the next step in the proof is to construct a generator of 
the group x n (/ 2 »-i, R n - 2). This group is isomorphic with ir n (S n ~ l ) under the 
projection x ([ 2 ], Theorem 2 ). If n ^ 4 we may take as generator of the latter 
group the map p n (S n ) = S defined by 


PnAz) = & 


(i = 1 , • • •, n — 4 ), 




142 


GEORGE W. WHITEHEAD 


Pn.n-l(x) 


2(Xn—3^n—1 “f” Xn—iXn) 

(Xn + %n-l + 2 + ^n-s)* 


Pn,n-2(X) 

( 12 ) 


Pn,n-l(z) 


Pn.nfa) 


2(^rn—2 37n—1 37 n —8 37n) 

(«n + 37*-1 + #n-2 + ^n-s)^ 

X„ “I” 37 n __i 37 n _; 2 Xn—8 

(Xn + 37^-1 + 37 n ~2 + 37 n-a)* 

37 n +l (37n + 37n_l + 37n-2 + 37n-8 ^ 0), 


Pn,n— 3 ( 37 ) — Pn,n— 2 ( 37 ) — Pn,n— 1(37) 0 (37n — 37n—1 — 37n—2 37n—3 ~ 0). 

Evidently p n maps F? into Fi -1 and aS * -1 into aS* - * 2 . The latter map is essen¬ 
tial; in fact, it represents a generator of 7r n -i(*S n “ 2 ). Let P n {x) = D n -i[p n (x)] 
(xeVi). Then P n represents the desired generator of 7r n (P n - 1 , Pn- 2 ); for 
ir P n (x) = d n -\[pn(x)] } and the latter map represents a generator of 7r n (& n "' 1 , x°) — 

Let P* = wP w , and = 7 rP* its projection into aS”~ 2 . Since p n maps 
aS ” -1 into S n ~ 2 , and since = C„_ 2 , we have Pt(x) — C n - 2 [pn(x)], and hence 

Pn~\(x) = g n - 2 [pn(x)] (37 e aS”’ 1 ). Let a denote the element of 7 r n - 2 (S n ~ 2 ) deter¬ 
mined by the identity map, p the element of 7 r w _i (aS" 2 ) defined by the map 
PnCS* 1 ' 1 ) = S n ~\ Since { 0 n _ 2 } = 0 or 2a according as n is even or odd, we have 
{Pn-i} = 0 or 4/3 2 respectively. But Freudenthal [3] has proved that 2/3 = 0 
if n ^ 5; hence for all n ^ 5 we have {p*-i| = 0, so that pt--i is inessential. 
This completes the proof of the theorem. 


8 . Computation of the homotopy groups 

We are now in a position to establish the results exhibited in Section 2 . The 
generators which wq shall compute here will differ slightly from those already 
exhibited; however, they may be easily seen to be homotopic. 

We have already observed that 7r n (Pn+i) is a homomorphic image of 7 r n (R n ). 
In a similar manner we can prove (cf. [ 2 ], Theorem 5) that ir n (R n +k) (k = 
1 , 2 , • • • ) is isomorphic with 7r n (P n +i). Another useful result is the following: 

Theorem 9. If n is even , w „(P n ) is a factor group of 7r n (P n -i); the kernel of the 
homomorphism is the, group T n>0 (R n -i). 

Since jD n } is a generator of 7r„(P n , R n ~ 1 ) and since u>{D n \ = {C n -i} 0 for 
n even, it follows that ir n , 0 (R n , Rn- 1 ) = 0. Hence ir n (R n ) = 7r n (Pn-i)/7rn t <>(P»--i). 

In order to compute 7ri(P n ), we first observe that, since Pi is homeomorphic 
to S 1 , its fundamental group is the free cyclic group generated by any map of S l 
on Pi of degree 1. Such a generator a x is given by the map 


B x (x) = 


37 2 37i 

— 37i 37 2 


Since P 2 is homeomorphic with projective 3-space P 3 , it follows that 7 Ti(P 2 ) 


*This follows easily from Theorem II b' of [4]. 



HOMOTOPY PROPERTIES OF REAL ORTHOGONAL GROUPS 


143 


is the cyclic group of order two. But tti(R 2 ) is a factor group of iri(fli); hence 
for a generator of ti(R 2 ) we may take the generator ai for 7 ri(Pi) subject to the 
condition 2a\ = 0 in R 2 . The same result holds for wi (7? n ). 

Since all the higher homQtopy groups of S 1 vanish, the same holds for Ri . 
In particular, tt 2 (/2i) = 0. Then, by Theorem 9, 7r 2 (P n ) = 0 for all n. 

The higher homotopy groups of R 2 are isomorphic to those of its covering 
space S 3 . In particular, 7 r 8 (P 2 ) is the free cyclic group, and a generator a 3 is 
represented by the covering map H (S*) = R 2 given by the equation 



2 2 2,2 
X4 — X 3 — X 2 + Xi 

2(xiXt — x 3 Xi) 

2{xix 3 + x 2 x t ) 

I/Or) = 

2(xiXi + XtXi) 

x\ -xl + xt-xl 

2{xi 13 — 

XiXi) 


2 (xiX 3 — XiXi) 

2(xix 4 + X2X3) 

xl + xl - 

2 2 
X 2 ~ Xi 


We observe that ttH(x) maps *S 3 on S 2 with Hopf invariant dbl. 

Since R 3 is the product space of R 2 and the quaternion subgroup Q 3 , the 
homotopy groups of R 3 are the direct sums of the groups of the same dimension 
of R 2 and Q 3 = £ 3 . In particular, tt 3 (R 3 ) = t 3 (R 2 ) + 7 r 3 (& 3 ) is a free group with 
two generators. For one of these generators we may take the generator a 3 of 
tt 3 (R 2 ); the second generator ft is determined by the quaternion matrix 


Xi 

2*3 

-X 2 

Xl 

— X 3 

Xi 

Xl 

X 2 

—x 2 

— Xi 

Xi 

X 3 

— Xi 

— X 2 

-X 3 

Xi 


(xeS 3 ). 


To compute 7r 3 (P 4 ) = 'kz{Rz)/^z,o{Rz), we observe that 


Cz(x) = 


H(x) Oj 
0 lj 


so that {C 3 } = 2ft + a 3 = 0 in P 4 . Hence 7 r 3 (P 4 ) is a free cyclic group with the 
one generator ft , and the same is true of 7 r s (/2„) (ft ^ 4). 

We now consider the groups m . Freudenthal [3] and Pontrjagin [5] have 
proved that 7r 4 (*S 3 ) is the cyclic group of order two, a generator being determined 
by the map p 4 (*S 4 ) = S z . Hence a generator a 4 of 7r 4 (P 2 ) is determined by the 
map Hp \. The group w 4 (i? 3 ) is the direct sum of two cyclic groups of order two; 
for generators we may take a 4 and ft = {B 3 pi}. 

We have observed in the proof of Theorem 8 that 7r 4 ,o(P 3 ) is generated by 
W {P 6 } = {Pt}. But 


P*(x) = 5 3 [p 4 (^)]- 


IllPi(x)] 0 
0 1 


and {pi*} = 0 , so that {P?} = a 4 = 0 in P 4 . Hence 1 r 4 (P 4 ) is the cyclic group 
of order two generated by ft . 



144 


GEORGE W. WHITEHEAD 


We have proved in Theorem 8 that {C 4 } 5 ^ 0 in . Hence in,o(Ri) = 
tt 4 (/? 4 ), so that 7 ta(R 6 ) = Wi(Rn) = 0 (n ^ 5). 

We conclude by computing the five-dimensional homotopy groups of R n . 
Pontrjagin [5] has proved that vs(S z ) = 0 ; hence 7 r 6 (i? 2 ) == 7 t 6 (/? 3 ) = 0 . Hence 
x 6 (/2 4 ) = ir &l 0 (/2 4 ,R 3 ) = 0 since t 4 , 0 (/? 3 ) 5 * 0 . This in turn implies that ir 6 (/2 6 ) = 
7 r 6 , 0 (/? 5 , # 4 ). But Tt(R 6 ) contains the essential element a h = {C 5 }, and since 
7 rCs has degree two, it follows from Theorem 8 that ts(Rs) is the free cyclic group 
generated by as . Since a 6 = 0 in R *, we have finally that tts(Rs) = tts (R n ) — 0 
(n *£ 6). 


9. Continuous vector fields over spheres 

The above results will now be applied to the study of continuous vector fields 
over S n . Geometrically, a continuous vector field may be thought of as a set of 
functions V\x) (i = 1 , • • • , n + 1 ; x e S n ) defining a unit vector tangent to S n 
at the point x ) the functions V\x) being continuous over all of S\ A set of p 
such fields are said to be independent if the vectors of the fields at each point 
of S n are independent vectors. 

Let P k denote the point of S n whose coordinates are 5 t * (i, k = 1 , • • • , n + 1 ). 
ft n _p denotes the subgroup of R n consisting of all rotations leaving P k fixed 
(A = n — p + 2 , • • • , n + 1). The coset space RJR n -_ v may be thought of as 
the space of all sets of p mutually orthogonal points of S n ; for two elements of 
R n are in the same coset of R n - P if and only if their last p columns are identical. 
These columns define the orthogonal p-tuple associated with the given coset. 
Conversely, given a set of orthogonal points Qi, Qt, • • • , Q P , the required coset 
is the set of all rotations carrying P 7l _ pi 2 into Qi , • • • , and P n + 1 into Q p . 

Since R n ~ P C R n ^ , each coset of R n - P lies in some coset of R n - 1 . Thus a 
natural mapping RJR n - p —* R n /R n -1 = S n is defined; each coset of R n - P is 
mapped into the coset of R n ~ 1 containing it. We refer to this map as the 
projection ; it is easily verified that it is a fibre map. If we regard an element of 
Rn/R n -p as being determined by a set of p mutually orthogonal points, the pro¬ 
jection is the p th point of the set. 

Since the usual process of orthogonalizing a set of independent vectors can be 
carried out here, there is no loss of generality in assuming that the vectors of the 
fields we are dealing with are orthogonal at each point and of unit length. 

Theorem 10. Every set of p orthogonal vector fields over S n defines a mapping 
of S n into R n /Rn- P -i whose projection into S n is the identity ; conversely , any such 
map defines a set of p orthogonal vector fields. 

By translating the vectors at any point x *to the center of S n and adjoining 
the point x , we obtain an orthogonal (p + l)-tuple whose projection is x . This 
process is continuous and gives the required map. Conversely, given such a 
map, we define the p orthogonal vectors at z as follows: the given map associates 
with x an orthogonal (p + l)-tuple, the (p + l) 8t point of which is x . Transla- 



HOMOTOPY PROPERTIES OF REAL ORTHOGONAL GROUPS 


145 


tion of radius vectors drawn to the first p points to the point x gives the set of 
vectors at x. 

Corollary. There exists a set of p independent vector fields over S n if and 
only if the canonical map of into R n -i is contractible in R n ~i into R n - P - 1 . 

It follows from Theorem 7 that there is no vector field over S n if n is even. 
However, if n is odd, it follows from the Lemma used in the proof of Theorem 8 
that there is at least one. Over £ 3 and S 7 are three and seven orthogonal fields, 
respectively. These are defined by the mappings >S 3 —> —► S 3 and S 7 —> 
Ri —* S 7 of degree one given by the matrices B and B obtained in Section 5. 

Theorem 11. // n = 3 (mod 4) there at least three orthogonal vector fields over 
S n ; if n = 7 (mod 8) there are at least seven. 

We shall prove the theorem for n = 4m + 3; the proof for n = 8m + 7 is 
entirely analogous. Let V\x ) (i — 1, 2, 3; x t S n ) define a set of orthogonal 
vector fields over S n ; the components of the vector V'(x) will be denoted by 
V}(x) (j = 1, 2, 3, 4). We extend V l (x) to be defined over all of E* as follows: 
if y e F 4 , let x be the central projection of y on £ 3 , r the distance of y from the 
origin. Then V\y) — rV\x) defines a set of three orthogonal vectors at each 
point of E 4 , and these vectors vanish only at the origin. 

If z is a point of F 4 ” 1 * 4 with coordinates (zi , • • • , £ 4 * 1 + 4 ), let x J (j = 1, • • • , 
m + 1) denote the point of E 4 with coordinates ( 24 ,- 3 , Z 4 /- 2 , z 4 ,-i, Z 4 /). Then 
we define three vector fields W\z) over E r4w+4 as follows: 

W\ i+k (z) = V' k (x *‘) (i = 1, 2, 3; j = 0, • • • , m; k = 1, 2, 3, 4). 

Evidently the vectors W l (z) arc mutually orthogonal at each point 2 , and if 
2 € S im+i they are of unit length. Thus the theorem is proved. 

The problem of determining the maximum number of independent fields 
which can exist over S n has not yet been solved for general odd n. For n = 1 
(mod 4) the solution appears in 

Theorem 12. If n = 1 (mod 4) any two vector fields over S n are somewhere 
dependent. 

The theorem is evidently true for n = 1. Suppose n > 1 and suppose that 
the theorem were not true. Then by the Corollary to Theorem 10, the canonical 
map C n -i would be contractible in /? n -i into /? n _ 3 . Hence the map G n _i(S r ' -1 ) C 
R n -2 would be homotopic in /? n _ 1 to a map F(S n ~ l ) C /? n _ 3 . Hence 
{Gn-X) - {F} € 7T n _ lf o(tf n -2). But ir [{ G n ^\ ~ {F}] = 7T [ G ^] ~ 7 r(F} = 
^{G n -i} 5 ^ 0. In the proof of Theorem 8 we have shown that this is impossible. 

Corollary. If n > 1 and n s= 1 (mod 4) the tangent sphere-bundle of S n is 
not simple ([7], p. 788). 

University of Chicago 


Bibliography 

1. W. Hurewicz, Beitrdge zur Topologie der Deformationen y Proc. Kon. Akad. van Weten- 
schappen te Amsterdam, 38 (1935), pp. 112-119, 521-528 ; 39 (1936), pp. 117-125, 
215-224. 



146 


GEORGE W. WHITEHEAD 


2. W. Hurewicz and N. E. Steenrod, Homotopy relations in fibre spaces , Proc. Nat. Acad. 

of Sci., 27 (1941), pp. 61-64. 

3. H. Freudenthal, Ober die Klassen der Sphdrenabbildungen f Comp. Math., 5 (1937), 

pp. 299-314. 

4. H. Hope, Ober die Abbildungen der dreidimensionalen Sphdre auf die Kugelfldche , Math. 

Ann., 104 (1931), pp. 637-365. 

5. L. Pontrjagin, A classification of continuous transformations of a complex into a sphere , 

C. R., Acad, des Sc. de l’URSS, 19 (1938), pp. 361-363. 

6. H. Whitney, On the theory of sphere-bundles , Proc. Nat. Acad, of Sci., 26 (1940), pp. 

148-153. 

7. H. Whitney, Topological properties of differentiable manifolds , Bull. Am. Math. Soc., 

43 (1937), pp. 785-805. 

8. E. Cartan, La topologie des groupes de Lie , Actualit6s Scientifiques et Industrielles, 

Exposes de geometric, vol. VIII. Paris, 1936. 



Annals op Mathematics 
Vol. 43, No. 1, January, 1942 


ON MATRIX ALGEBRAS OVER AN ALGEBRAICALLY CLOSED FIELD*t 

By Winston M. Scott 
(Received June 2, 1941) 


Introduction 


Recently a number of writers have discussed interesting developments in the 
theory of not completely reducible matrix sets and non-semisimple algebras. 1 
Here we have made use of some of these concepts and methods to study matrix 
algebras over an algebraically closed field. We shall discuss in some detail in 
this introduction the definitions that are used and the theorems that are 
developed. 

Let 21 denote a matrix algebra, with unit element, over an algebraically closed 
field K. 21 will be taken in reduced form, by which we shall mean that 21 is 
exhibited with only zeros above the main diagonal, with irreducible constituents 
of 21 in the main diagonal, and that 21 is expressible as a direct sum of its radical 
and a semisimple subalgebra which latter has non-zero components only in the 
irreducible constituents of 21: 2 


( 1 ) 


' <Sn 

(£21 S22 # 

, S*i S*2.S l t j 


the S,t denoting the irreducible constituents; further 


21 = SR + 21*, 


* The following constitutes a portion of a dissertation written under the direction of 
Dr. C. J. Nesbitt and accepted by the University of Michigan in January, 1941. The aid 
and encouragement given me by Dr. Nesbitt has been invaluable in the preparation of 
this work. 

t Presented to the Society, May 2, 1941. 

1 See, for instance, the forthcoming paper by R. Brauer, On Sets of Matrices with Coeffi¬ 
cients in a Division Ring; T. Nakayama, On Frobeniusean Algebras, I, these Annals, Vol. 
40 (1939), pp. 611-633; T. Nakayama, Some Studies on Regular Representations , Induced 
Representations, and Modular Representations , these Annals, Vol. 39 (1938), pp. 361-369; 
T. Nakayama and C. Nesbitt, Note on Symmetric Algebras, these Annals, Vol. 39 (1938), 
pp. 659-668; R. Brauer and C. Nesbitt, On the Regular Representations of Algebra*, Proc. 
Nat. Acad, of Sci., Vol. 23 (1937), pp. 236-240; C. Nesbitt, On the Regular Representations of 
Algebras, these Annals, Vol. 39 (1938), pp. 634-658. We shall refer to the last two as B.N. 
and N.R., respectively. 

8 Cf. N.R., p. 639. 


147 





148 


WINSTON M. SCOTT 


where 91 is the radical of SI and 




° 


fCu 

(2) 

iff = 

E21 0 

, 21* = 

0 S22 



[ S/i S *2 . 0 J 


, 0 . s« y 


Let Fi , F 2 , • • • , Fk denote the totality of distinct irreducible representations 
of ST. Then each S« is equivalent to one of the F K . It is well known that up 
to equivalence the irreducible constituents E,» of 21 are uniquely determined. 

As a part of 21, S»y forms an additive group or module of matrices upon 
which 21, itself considered as a module, is homomorphically mapped. We wish 
to consider S,y as a matrix module with 21 as both a left and right operator 
system. For a matrix A of 21, let us use the notation C<y(A), (j ^ i, i = 1 , 2 , 
••*,<) to denote the parts of A , 


(3) 


'Cn(A) 

C2l(A) C22(A) ' 


[ CnU) J 


Let B be any element of 21, and let B* be the component of B in the semisimple 
subalgebra 21*. We define B as a left and as a right operator of Cij(A) by the 
relations below, using o to distinguish this operation from ordinary matrix 
multiplication 


BoCij(A) = = Ci)(B*A), 

Ci,(A)oB = CijW'CM = Ci,(AB*). 


Let us call a module which has 21 as both right and left operator system an 
( 21 , 21 ) module, and a homomorphism which is an operator mapping under the 
system of operators 21, acting on both sides, an (21, 21) homomorphism. Under 
definition (4), Syy is a simple (21, 21) module. Moreover, each S;y is either a 
0 -part or there exist elements in 21 such that the corresponding C*y have any 
arbitrary components from K. 

Accordingly, we shall call the E.-y the simple parts of 21. The simple parts 
E«, or irreducible constituents of 21, have been rather thoroughly studied. Our 
first aim, then, is to develop a theory of simple parts which includes the S»y, 
with i 7 * j y that is, the simple parts of 21 wliich appear in the radical 91 of 21 . 

A first remark is that, when 21 is considered as an (21, 21) module, it can be 
proved in a direct manner that each S*y which is not a 0 -part is ( 21 , 21 ) isomorphic 
to a composition factor group of 21. This result may also be obtained by apply¬ 
ing a useful theorem given recently by R. Brauer . 3 


* R. Brauer, op. cit. 










ON MATRIX ALGEBRAS OVER ALGEBRAICALLY CLOSED FIELD 


149 


When Si*, are respectively the irreducible constituents F K , F\ of 31, we 
shall say that S*y is of type (k, X). If k X, we shall say that (5,-y is of mixed 
type, and if k = X that S* 7 is of unmixed type. An (2t, 21) isomorphism may be 
defined for any two (£,•/ of type ( k , X) and, as a consequence, the corresponding 
factor groups of 21 are also isomorphic. It is easy to see that two simple parts 
not of the same type are not (21, 21) isomorphic. A representation 2t of 21 may 
be considered as an (21, 21) module. Again applying Brauer’s Theorem it follows 
that the simple parts of type (k, X) of a representation 8, which is in reduced 
form, are (21, 21) isomorphic to the simple parts of 21 of type (*, X). 

In N.R. extensive use was made of a basis system of 21 first employed by 
Cartan 4 , and which was there called the Cartan basis system. The coefficients 
in the expressions for the elements A of 21 as linear combinations of the Cartan 
basis elements yield a set of matrix modules fQ* 1 , (p = 1, 2, • • • , m) upon which 
21, as a module, is homomorphically mapped. The may also be considered 
as (21, 21) modules. We shall call these elementary ( related) modules of 21. 
The number m of elementary modules is equal to the composition length of 21 
considered as an (21, 21) module, and each is (21, 21) isomorphic to a composi¬ 
tion factor group of 21. Conversely, to each composition factor group of a com¬ 
position series of 21 there corresponds an isomorphic elementary module. It 
follows that up to (21, 21) isomorphism the set of elementary modules is uniquely 
determined. 

We say that an elementary module is of type (*, X) if is defined by 
Cartan basis elements of type (k, X). Simple parts and elementary modules 
may also be classified by the powers of the radical to which they belong. A 
simple part (§*/ (an elementary module Q 1 ) belongs to 9l A if this is the highest 
power of the radical such that all its elements are not mapped on the 0-matrix 
in (Eij ($ /l ). The principal relation between simple parts and elementary 
modules is that every simple part (§*/ of type ( k , X) which belongs to 9?* is ex¬ 
pressible as a linear combination of elementary modules of type (k, X) which 
belong to 91 j ^ h. 

The form of a center element of 21 can be computed. For a center element 
Z, Cij(Z) = 0 if (£*•;• is a simple part of mixed type, and C*/(Z) is of form (c-6 mn ), 
c and element of 2C, if (£*•/ is of unmixed type. Let c*x denote, as in B.N., the 
Cartan Invariants. Then the rank p of the center is at most equal to the number 
s = c KK of elementary modules of unmixed type. 

As an application of these concepts we obtain a decomposition of the matrix 
algebra 21. By use of the concept of type, the simple parts of 21 may be classi¬ 
fied into ‘blocks’. If ©**•, (£,-,•, j < i , do not belong to the same block then 
S*, must be a 0-part. An immediate consequence is that 21 may be decomposed 
into parts which contain all simple parts belonging to a block. Each such part 
defines an invariant subalgebra which may not be decomposed into a direct 
sum of invariant subalgebras. 

4 E. Cartan, Les groupes bilinearies et les systemes de nombres complexes , Annales de 
Toulouse, 12B (1898), p. 1. 



160 


WINSTON M. SCOTT 


1 

We consider a matrix algebra 21, with unit element E , over an algebraically 
closed field K. We may assume that 21 has no 0-constituent. 6 We take 21 in 
the reduced form (1) and define operators for the parts £<,- of 21 by (4). The 
field K may be considered as included in the operator system by identifying 
the element k of K with the element k*E of 21. 

By means of (4) the parts of 21 (cf. (1)) may be regarded as (21, 21) modules. 
If £,/ contains only the 0-matrix, we call ®, ; a 0-part, and obviously, in this 
case, is a simple module. Suppose, instead, that there exists an element 
A of 21 such that C,/(A) ^ 0. It is well known that elements L of 21 can be 
chosen so that the corresponding Cu(L), Cjj(L) have any components arbitrarily 
chosen from K. It follows from (4) that by suitably choosing right and left 
operators L, M f • • • to be applied to Ca(A) one obtains a system of matrices 

Cij(L*AM*) = C u(L)C ij(A)C j j(M) 

in St/, from which may be generated matrices of ©,/ with arbitrary coefficients. 
Thus, the admissible subgroup which contains Ca(A) is the whole module C,/. 
Let us call a module of matrices, each of m rows and n columns and with co¬ 
efficients in K ) a complete module if it contains all such matrices. We then have 
Theorem 1: Each &,•/ is either a 0-part or a complete module. In either case 
dij is a simple (21, 21) module. 

We shall let 9t r denote the set which contains all elements of the form 
52 • • • N hr , where the N M . are elements contained in the radical 5W. 

For a sufficiently large value of t } = 0. These 9? r form invariant subalgebras 
of 21. If we denote 21 by 51°, we have 

51 = 9 ?° z> 9I 1 3 5TC 2 <Sl z 3 • • • 3 W 1 - 1 3 = 0. 

We say that amelement A belongs to if h is the largest value such that 
contains A. In particular, the elements of 21* belong to 91°. The product 
of an element belonging to 9i A with an element belonging to W* belongs to 
with k it h + j. 

We can now prove 

Theorem 2: A non-zero simple part &ij of 21 is (21, 21) isomorphic to a com¬ 
position factor group of 21 itself considered as an (21, 21) module. 

Proof: Take the series 

n z> ft 3 3 • • • W 1 - 1 r> = o 

and refine it to obtain a composition series 

21 3 • • • 3 2t*-i 3 2l< 3 • • • 3 21 n —i 3 0 


6 For, from conditions stated for 21, we may show that if 21 has 0-constituents, then it 


may be decomposed in the form 21 



where 2ti does not have 0-constituents, and 21 


is isomorphic to 21 1 . 



ON MATRIX ALGEBRAS OVER ALGEBRAICALLY CLOSED FIELD 151 

of 81. Let 8 l q be the first group in this series such that for each of its elements 
A, Cn(A} = 0. Then denoting elements of 2l g _i by A q ~i , we have that 

Aq-l ► Cij(Aq— l) 

is an (81, 81) homomorphism, since for any element of 81, 

B • A q—x = B* t A q —i + N‘Aq —1 

-> Cu(B*A^ + CaiN-Aq-,). 

But N'Aq-i C 81* so that Cij(NA q - 1 ) = 0 and, therefore, 

B-Aq~i-~+ Cij(B*A q -i) = BoCii(A*-d- 

The same argument holds for right side multiplication. The kernel of this 
mapping (that is, the elements mapping into the identity element) contains 81* 
and, therefore, the kernel is 81* . It follows that 2t fl _i/2l« is (31, 81) isomorphic 
to the simple part S,-y. 

We have said that if S,*, (£,-,* are respectively the irreducible constituents 
F k , F\ of 81, then the simple part is of type (k, X). We now show that 
Theorem 3: The non-zero simple parts (S t y of a given type (k, X) are mutually 
isomorphic in the (81, 21) sense . Simple parts of different types are not isomorphic . 

Proof: It should be noted here that the word isomorphic is to have a meaning 
differing from that of Theorem 2. In Theorem 2, the isomorphism is obtained 
from A —> Cij(A), that is, S,-y is considered as a set of matrices related to 21. 
Here, (S t y is just to represent the system of matrices. 

Let St,, S mft be two simple parts of type (k, X) which are not 0-parts. An 
(81, 21) mapping of S,y upon S mn is obtained by the relation C*y —> C mn if C„ = 
C mn where C,y, C mn denote matrices of S»y, Smn, respectively. For if B is 
any element of 21, then 

BoCij = Cii&yCii = F k(E) *C%j 

^ Cmm(B) • Cmn = B&C mn . 

On the other hand, if S»y, Smn are of types (/c, X), (p, v ), respectively, with 
k j* /x, then any relation C*y —> C mn is not an (21, 21) operator mapping, for we 
may choose B in 21 such that Cu(B ) = F K (B) = 0 while C mm (B) = F M (B) = 
where 2?/ M is a unit matrix of proper degree. 

2 

The basis of 81 which seems best adapted to the study of simple parts is a basis 
system first employed by Cartan. 8 This basis was used by C. Nesbitt in his 
study of regular representations of linear associative algebras. 7 
We denote, as before, the irreducible representations of 81 by Fi , F 2 , , 


* E. Cartan, op. cit. 

1 A detailed description of the Cartan basis system is given in N.R. section 2. 



152 


WINSTON M. SCOTT 


• • • , F k , and their degrees by fi , / 2 , • • • , /* . Let E K (m, n) be the matrix of 
81 which has 1 for the (ra, n) component of each which is equal to F K , and 
has zero elsewhere. The relations 


E^rriy n)-E v (p y q) 


0 if either p v, or p q, 

Ejjriy q) if p = v, n = p, 


follow from the definition of the n). 

Let E k = Yj Et(ii). Then the E K , (k = 1, 2, • • • , k), form a system of 
mutually idempotent elements. In fact, E K is the unit element of a certain 
simple algebra which is a summand in a decomposition of ?I* into a direct sum 
of simple algebras, and E K = E, where E is the unit element of 8L 

A matrix A of 81 is said to be of type (k, X) if E K A E\ = A. In particular, 
E K (m, n ) is of type (k, k). An clement of type (k, X), k X, we shall say is of 
mixed type, and if k = X, of unmixed type. The product of an element of type 
(k, X) and an element of type (m, v) is an element of type (k, y ). The product 
of an element of type (*, X) and of an element of type (m, v) vanishes for X ^ ju. 
A matrix A of type ( k , X) has non-zero components only in simple parts £*/ of 
type (k, X). 

A basis set B 1 kX , , • • • , B 1 kX is first obtained for those matrices of 81 of type 

(k, X) which have non-zero components only in the upper left corners of simple 
parts of type (#c, X). The rank t of this set is the Cartan Invariant, c K \ . 8 A 
basis for the whole algebra is then given by the set 


E K (al)B^E x (lb) t 

M ~ 2, • • • , c K \ , ci = 1, 2, • • • , f K , 

6 = 1, 2, • • • , , k, X = 1, 2, • • • , k. 

This basis can be-so chosen that any element A of 81 which belongs to 9I T is 
expressible in terms of basis elements which belong to s Ji p , for p ^ r. 9 

For convenience we change slightly the above notation. We take together 
in one set all the elements B*\ for all k, X, and p and redefine the superscript p 
to enumerate these elements, independently of type, in such a way as to take 
first all those elements which belong to 9i°, namely J^i(ll), • • • , 2^(11), then 
next those which belong to 9t l , and so on. p will now have the range 1,2, • • • , m 
where m = c K \ . Where the type of the element need not be explicitly 

stated we shall drop the subscripts k, X from the symbols BJx . 


3 

Expressing the element A of 81 as a linear combination of the elements of the 
Cartan basis system, we have 

(5) A = Z h^(A)EM)B^ x E x (lb). 

a,b,fi 


* See B.N. or N.R. 

• See N.R., p.641. 



ON MATRIX ALGEBRAS OVER ALGEBRAICALLY CLOSED FIELD 


153 


Here each Bh is a basis element having components different from zero only 
in the upper left hand corners of simple parts iSa which are of type (k, X). 

We denote the matrix ( Kb(A))ab (cf. (5)) by H P (A). The module consisting 
of the matrices i7 p (A) will be denoted by We shall call an elementary 
( related ) module of 21. $ p is evidently a complete module, since the coefficients 
in (5) may be taken arbitrarily from K. 

An elementary module will be said to be of type (k, X) if it results from, or is 
defined by, Cartan basis elements of type (k, X). We shall write to denote 
that & is of type (k, X). If k X we shall call £>Ux an elementary module of 
mixed type, and if k = X we shall call an elementary module of unmixed 
type. 

Let us say that a module 2)? of matrices is related to 9? p if 9? p is homomorphi- 
cally mapped on 50?. In general, we shall deal with modules 50? which are re¬ 
lated to 9?° = 21, but it is convenient sometimes to consider 9)? as the map of 
only the subalgebra S J? P rather than 21 itself (see p. 157). A related module 3/1 
will be said to belong to 9?* if or is the largest value such that for at least one 
element N a belonging to 9?", the corresponding matrix M(N a ) of 50? is not zero. 
In particular, the elementary module is related to 21, and belongs to the same 
power of 9? as does the corresponding basis element B tl . 

Lemma 1: Let 50? be a module for which the concept of type is defined. If 

i) . 50? is of type (k, X), and its elements are f K X /x matrices, 

ii) . 50? is related to 5)? p , and belongs to 9? a , and 

iii) . 50? is a simple (21, 21) module such that for any element B of 21, ami dement 
N p of 5)? p 

BoM(N p ) = F k (B)-M(N p ) = M(B*N P ) 

( 6 ) 

M(N p )oB = M(N P )-I\(B) = M(N P B*), 

then it follows that as a module related to 5)? p , M(N p ) = m p H*\{N p ), where m p 
are fixed scalars, N p is any element of 5)l p , ami where the elementary module 
belongs to 9? T , p ^ r ^ a, and is considered as related to 9? ff . 

Proof: We shall use the notation ( l\ 8 ) pq to denote a matrix with 1 as its 
(r, s) component, and with zero elsewhere. Then operating with E K ( 1, 1), 
E\(l, 1) on M(B\ fx) = (mp Q ), we obtain 

E'(U)oM(Bh)oEx(ll) = (f hi ) r .<■(«;,)•(( r n)«v = (mW-(f: n )p, 


= M(E k (11)B1 x E\(11)) = 
Dropping the subscripts 11 we have 

M(«x) = {m p U u ) Pq . 

Here m M 0 only if BJx belongs to SI 7 , r ^ <r. Also 

M{E K {a\W*Ex{\b)) = (art/*)*. 



154 


WINSTON M. SCOTT 


We have finally that 

M(N„) = M(£ h^N p )E.(al)B^E x (lb)) 

ft,a,b 

where belongs to 5ft T , p ^ r ^ or, since iVp is an element of 5)t p , and m belongs 
to 9r. 

Relating the simple parts of 21 to the elementary modules of 21, we have 
Theorem 4: A simple part S*,- of 21 of type (k, X) which belongs to 5ft r is ex¬ 
pressible as a linear combination of the elementary modules of type ( k , X) which 
belong to 5ft r , r ^ r. 

Proof: We consider 6<y as a simple module related to 2t = 91°, and belonging 
to 5ft r , and apply the lemma. It follows that is of the form difHix , 
that is, for each element A of 21 we have that 

Cij(A) = E dljWM), 

n 

where each H^(A) belongs to some N T , r g r. 

We now prove 

Theorem 5: The number m of elementary modules is equal to the composition 
length of 21 considered as an (21, 21) module. Each elementary module § is (21, 21) 
isomorphic to a composition factor group of 21, and coriversely , to each factor group 
of 21 there corresponds an isomorphic elementary module. 

Proof: We Will construct a composition series for 21 in terms of the Cartan 
basis elements. As indicated above, we take the whole set of elements B$\ 
(k, X = 1, 2, 3, • • • , k) and use the superscript ju (y = 1, 2, 3, • • • , m) to enu¬ 
merate these elements, taking first those elements which are not in 5ft 1 , then those 
which are in 5ft 1 but not in 5ft 2 , and so on. Let 2l,s~.i denote the subgroup of 21 
consisting of all linear combinations with coefficients in K of the elements 

(k, X 1, 2, * , or it), 

(7) 2?,(al)Mx(16), (a = 1, 2, •••,/« ; b - 1, 2, • • • ,/x), 

(r = S, S + 1, • • • , m). 

In particular, 2lo = 21. We form the product 

A-EM)B r M\b) = (A* + N)-E K (al)BlxE x (lb). 

Here 

A*-EM)BlxE x (lb) = E fmn(A)E p {.m n)-E,(al)B T .Mlb) 

p,m,n 

= T,f m a(A)E,(.ml)B:Mlb) 

m 

is again in 3t a -i. So also is N-E,(al)B T ,\E\(lb), since this belongs to a higher 



ON MATRIX ALGEBRAS OVER ALGEBRAICALLY CLOSED FIELD 


155 


power of the radical than does Six and is, therefore, expressible in terms of the 
elements (7). Then the whole product is in 2ts_i . The same is true when A is 
a right factor, so we obtain that 21s-i is an admissible subgroup of 21. More¬ 
over, the factor group 2ls_i/2l a consisting of the residue classes 

(Z Aa* E,(al)Bi\E x (lb)), modulo %, 

a,b 

is easily seen by similar calculations to be simple. It follows that 
21 0 DftDftD-O 2lm-i 3 0 

is a composition series of 31, and the first part of the theorem is proved. 

Again, we may easily calculate that 

<Z h%(A)EM)BMlb)) -» HUA) 

a,b 

is an (21, 21) isomorphic mapping of 2l s -i/?hs upon the elementary module 
(cf. (7)). From this follow the other statements in the theorem. 

It may be noted here that from Theorem 4 and Theorem 5 another proof of 
Theorem 2 may be obtained. 

We can now prove the converse of Theorem 2 

Theorem 6: To each composition factor group of 21 there corresponds at least 
one non-zero simple part (£*,•. 

Proof: By Theorem 5 we have that to each composition factor group of 21 
there corresponds an isomorphic elementary module £> p , say of type (p, v ). 
Then there exist non-zero simple parts of type (/*, v), and by the same argument 
as that used in Theorem 3 it may be shown that each non-zero simple part of 
type (/i, v ) is (21, 21) isomorphic to £> p . 


4 

A system of matrices 21 is called a representation of 21 if 21 is homomorphically 
mapped on 21. We may define 21 as a left and right operator system for a repre¬ 
sentation S by the relations 

A-A(A 1 ) = A(A'Ai) = A(A)-A(Ai), 

( 8 ) .... 

A(At)-A = A(Ai-A) = A{A X )-A{A). 

Further, if the representation $ is taken in reduced form, then a simple part 
of S may be considered as an (21, 21) module if we define the elements of 21 
as operators in by relations of the form (4). We then obtain 
Theorem 7: The simple parts , of type (k, X), of a representation 21 of 21 are 
(21, 21) isomorphic 10 to simple parts S mn , of type (k, X), of 21. 


10 As in Theorem 3, the word isomorphism here has a somewhat different meaning from 
that of Theorem 2. 



156 


WINSTON M. SCOTT 


Proof: By Theorem 2 we have that each simple part (£*•/ of 8 is isomorphic 
to a composition factor group of a composition series for 8 . Let 

(9) 8 081 => 8 2 => ... Z> 8 * = 0, 

( 10 ) 91 => 94 => % => • • • r> Sin = 0 , 
be composition series for 8 , 91, respectively, and suppose 

£</ = Sp-i/Sp. 

8 is an (91, 91) module upon which 91 is homomorphieally mapped by the relation 
A —► A (A). Then the Jordan-Holder Theorem gives us that to the factor 
group 8 P -i/ 8 P of (9) there corresponds a factor group 94-1/91* of the series (10) 
such that 

8 p_i/ 8 p a 94-i/94 . 

In addition we have from Theorem 6 that to the factor group 94-i/9l* there is 
at least one non-zero isomorphic simple part S mn of 91, 

94-1/94 - e™. 

Combining these relations, and observing that all these isomorphisms are (91, 91) 
isomorphisms, we obtain 

e.y a* Cmn 

with operators (91, 91). This completes the proof of the statement. 

Now suppose that 8 is a representation of 91, and that 8 is in reduced form. 
Let us denote the element of 8 corresponding to the element A of 91 by A, 
A —> A ; further, we shall consider 8 as a direct sum of its radical 8 * and a semi¬ 
simple subalgebra, 8 s, 8 = 8 s + 8 * , where 8 * is completely decomposed into 
its irreducible constituents. We use , A R to denote the components in 8 s , 
8 r of the element A of 8 , A = A$ + A R . 

The set 8 * of elements in the representation 8 which correspond to the semi¬ 
simple subalgebra 91* of 91 form a semisimple subalgebra of 8 equivalent to 8 s . 
Since 8 *, 8 s have the same irreducible constituents, they are equivalent algebras, 
but are not necessarily identical. It is easy to construct examples where 8 * 
is not 8 s . 

Let us suppose ^1 , * * • , & g are a set of elementary modules for the algebra 8 . 
We wish to give explicitly the relation between the modules of 8 , and the 
elementary modules £ of 91. 

The irreducible constituents of 8 are equivalent to irreducible constituents 
of 91. For simplicity, let us suppose they are identical with constituents of 91. 
Let ^ be of type (k, X) and belong to the <r th power of the radical of 8 . We may 
consider ^ as a module related to 91 by setting R(A) = ft (A). As such, 
belongs to 9T. Let B be any element of 8 , and N, an element of 91'. Then 

F'(B)R(N „) = F K (B)Rm = 



ON MATRIX ALGEBRAS OVER ALGEBRAICALLY CLOSED FIELD 


157 


But E a 33 E* f mod 3* , so that EsN ff == E*N ff , mod 3i +1 , and as |> belongs to 
the <r th power of the radical 3« of 3, we have 

H(EsNa) = #(£*#,) = = H(B*N ,). 

We may then view £ as an (21, 21) module of type (x, X), related and belonging 
to 9T, such that 

(AT„) = F'(B)H(N ff ) = #(£*#,) 

with a similar relation for right operators. Applying Lemma 1, we have that 
as a module related to 9i a , § is a linear combination of the elementary modules 
of type (k, X) which belong to 9T. 11 It follows, also, that a simple part S 
of 3 of type (*, X) which belongs to the <r th power of the radical of 21 may, con¬ 
sidered as a module related to 9T, be expressed as a linear combination of the 
elementary modules §Jx of type (k, X) which belong to 9T. 

If 21* = 21 5 then the relation 

F.(B)H(A) = H(B*A) 

holds for all elements A of 21. Again applying Lemma 1 we obtain that ^ 
considered as a module related to 21 itself is expressible as a linear combination 
of the elementary modules $ of type (x, X) which belong to 9t r , r g a. 

If 21* is not identical to fl s , that is, if 21* is not completely decomposed, then 
we can go over to an equivalent representation 3, such that 3* = 3s where 3*, 
3s have the same meaning with regard to 3 as do 3* and 3s in regard to 3. 
We may then state: 

Theorem 8: To any representation of 2( there exists an equivalent representation 
whose simple parts , considered as modules related to 21, are expressible as linear 
combinations of the elementary modules § of 2(. 

5 

It is easy now to discuss the center of 21. We note that 
Theorem 9: For a center element Z of 21, Ca(Z) = 0 if §,/ is a simple part 
of mixed type , and Ca(Z) is of form (c*6 mn ), c an element of K , if (£,-,* is of un¬ 
mixed type . 

Proof: Since an element Z of the center must commute with the elements 
E K {mn) of 21 (x = 1, 2, • ■ • , k; m, n '= 1, 2, •••,/«) an easy calculation shows 
that in terms of the Cartan basis elements 

Z = E c T (Z)EM)B:.E'(lr), 


11 If we do not take the constituents of 21 identical with constituents of 21 then it is 
necessary to apply suitable left and right non-singular matrix multipliers to the $k\ in the 
expression for 



158 


WINSTON M. SCOTT 


in which the range for r is determined by the s basis elements B T of unmixed 
type. It follows that 1 = 0 if is of mixed type, and 

H\Z) = (dXZfada 

if is of unmixed type. Applying Theorem 4 we obtain the theorem. 

It should be noted that this theorem can be obtained directly from Schur’s 
Lemma and (2). 

6 

If an element is of type ( k , X) we say that k , X are the type indices of the 
element. We say that two non-zero simple parts G,y, G P<? , belong to the same 
block 12 if there exists a chain of non-zero parts such that any two neighboring 
parts have at least one type index in common. This relation is reflexive, sym¬ 
metric, and transitive so that a separation of the non-zero simple parts into 
distinct disjoint classes is obtained. 

Theorem 10: Bach simple part in the i th row of 31 is either a 0-part or belongs 
to the same block as (£*». Similarly , each simple part in the, j th "column is either a 
0-part or belongs to the same block as Gyy. If Gt», Gyy do not belong to the same 
block , then G*-y is a 0-part. 

For if the simple part Gyy is not a 0-part, then Gyy , Gyy , Gyy is a proper chain 
and so Gyy, Gyy, Gy, all belong to the same block. 

In the same way we may classify the basis elements 

(11) «x, (m = 1, 2, • • • , m; k, X = 1, 2, • • • , or k) 

into blocks. We say that B , B pa belong to the same block if there exists a 
chain of elements 13 r a p of the set (11) connecting B<\ , B p<r such that any two 
neighboring elements in this chain have at least one type index in common. 
Let us denote by SB the set consisting of all the B T pa belonging to a given block 
and their associated elements of form E p {a\)B T pa E a (\b). If A is any element 
of 21, then A -E p (al)B r p<r E 9 (\b) is expressible in terms of SB again; the similar 
statement is true for the product E p (al)B T pa E ff (lb) A. It follows that the ele¬ 
ments of SB form a basis of an invariant subalgcbra of 21: we denote this invariant 
subalgebra by 3l + . 

We can now prove 

Theorem 11: By elementary transformations 21 may be decomposed into the form 

/»i 0\ 

(12) 31 = / ^ j 

where each 23 r contains all simple parts belonging to one block . The elements of 21 
which have only zeros in the parts 23 y ,j i, form an invariant subalgebra 21» o/2I, 

11 R. Brauer and C. Nesbitt, On the Modular Representations of Finite Groups , Univ. of 
Toronto Studies, Math. Series, No. 4 (1937); R. Brauer and C. Nesbitt, On the Modular 
Character of Groups , these Annals, Vol. 42 (1941), pp. 556-590; T. Nakayama, Some Studies 
on Regular Representations , Induced Representations , and Modular Representations , these 
Annals, Vol. 39 (1938), Theorem 5. 



ON MATRIX ALGEBRAS OVER ALGEBRAICALLY CLOSED FIELD 


159 


and 21 is the direct sum of these invariant subalgebras 21.. The 21, may not be 
decomposed into a direct sum of invariant subalgebras. 

Proof: If , @,+i,,+i do not belong to the same block, then by Theorem 10, 
is ’a 0-part. It follows that by permuting the i th row with the (i + l) 8t 
row and the i th column with the (i + l) 8t column that we may reverse the posi¬ 
tions of , £,+i,,+i in the main diagonal of 21, but leave 21 in reduced form. 
By successive transformations of this form we may bring together in an upper 
part Si of 21 all £,•; which belong to the same block as Sn . We then have 



But since the simple parts in 23i and the G,i in $) 22 belong now to different 
blocks, then the simple parts in 35 2 i must all be 0-parts, or, that is, £> 2 i is a 0-part. 
By continuing this process we obtain 21 in the form (12). 

To prove the remaining statements in the theorem we first remark that to 
each basis element B^x there corresponds at least one non-zero simple part £,,• 
of type (k, X). For, by definition, Z?£ x is a matrix of 21 having non-zero coeffi¬ 
cients only in the upper left corners of simple parts of type (k, X). 

Let now B\ a be an element of the set fB of Cartan basis elements which belong 
to a given block. If the simple parts of t}^pe ( p , a) are in part 33, of 2f, then by 
the above remark B T pa is an element of the invariant subalgebra 21,- . If BJ y 
is also from the set fB, that is, if Bf, belongs to the same block as B% , then the 
simple parts of type (p, v) belong to the same block as simple parts of type 
(p, a). For a simple part of type (p, <r) may be connected by a chain of non¬ 
zero simple parts to a simple part of type (p, v)> the members of this chain 
corresponding to the members of the chain connecting B\ a , Bf y . Then the 
simple parts of type (p, v) are in the part 33, and so B py also is an element of 21,. 
It follows that the invariant subalgebra 21 + , generated by the elements of £B, 
coincides with 2L . We now have at our disposal a correspondence between 
invariant suhalgebras determined by blocks of Cartan basis elements on the 
one hand, and by blocks of simple parts on the other. 

Now let 

(13) 21 = Si + «.+ •••+«, 

be a direct decomposition of 21 into invariant subalgebras which may not be 
directly decomposed, and let Aj be an element of 21, . If in the expression 

Aj = £ hMAAE'ialWMlb) 

for Aj in terms of the Cartan basis elements, K b (Aj) 0, then since 

E K (la)AjEx(bl) = K b {Aj)Bix 

is in 8 /, so also is B$\ . In this way we can determine the direct summand of 
the decomposition (13) to which each element BIx belongs. In particular, we 
may determine the summands to which the elements E K ( 11), (*c = 1, 2, • • • , k) 
belong. If Bfx belongs to 8 ; then E K ( 11), 2?x(ll) must also belong to 8/, since 

^(ll)BJx = B5A(11) = Bix 



160 


WINSTON M. SCOTT 


and if, for instance, 2?«( 11) did not belong to the same summand as BH\ , the 
product E K (ll)Bti would be zero. 

Suppose now that the element B T pa of the set SB is in ft, but that there are 
elements of SB which are not in ft/. Then we may choose an element 
of SB which is not in ft /, but such that in the chain joining B T pa to Bf v , B py is the 
only member which does not belong to ft /. The member preceding Bf ¥ in this 
chain has either the type index p or v\ let this member be B *\. Then by the 
above, jE?„( 11) is in ft/, since B*\ is in ft/ ; on the other hand 2^(11) must be in 
the same summand as BJ ¥ , which gives a contradiction. Thus all elements of 
the set SB are in ft/. Similarly, if any member of a second set SBi of Cartan 
basis elements which all belong to one block is in ft /, then all members of the 
set SBi are in ft /. But then the invariant subalgebras 2l + , ?li" which have as 
bases the sets SB, SBi , respectively, would be direct summands of ft / contrary to 
our assumption that ft/is directly indecomposable. We now have 31* = 9l + = ft/ 
which completes our proof. 

We suppose now that each 93* has been split into its indecomposable parts 13 
and that these are in reduced form. Let l r [ , f/{ , • • • , V x n be the indecom¬ 
posable parts of 93*. We now classify the U x by the following relation: we say 
that U' p , Ul belong to the same block if there exists a chain 14 

(14) 

such that any two neighboring parts of the chain (14) have at least one irre¬ 
ducible constituent in common. We will show that all indecomposable parts U l p 
(p = 1, 2, • • • , n) of 93» belong to one block. Let us take together all U' p that 
are connected with U\ by such a chain and suppose (/« does not belong to this 
set. Then the simple parts of U l a cannot be connected by a proper chain of 
non-zero simple parts of V\ , contrary to our provision that simple parts of 33/ 
all belong to one block. 

Since a matrix commuting with an indecomposable part U' p can have just 
one distinct characteristic root 15 and since, as we have seen, all U' p of 93* have 
at least one irreducible constituent in common, then for an element Z of the 
center of 31 the matrix J5*(Z) of 93* corresponding to Z has just one distinct 
characteristic root. 

We have then proved 

Theorem 12 : The indecomposable constituent of a part 93, may be characterized 
as belonging to a block . It follows that for a center element Z, the corresponding 
matrix B*(Z) has only one distinct characteristic root. 

University of Alabama 


13 For definitions of indecomposable parts, see B.N., or N.R. 

14 R. Brauer and C. Nesbitt, On the Modular Representations of Groups of Finite Order , 
Univ. of Toronto Studies, Math. Series, No. 4 (1937), p. 14. 

15 R. Brauer and I. Schur, Zum Irreduzibilitdtsbegriff in der Theorie der Gruppen linearer 
homogener Substitutionen f Sitzber. der Preuss. Akad. der Wiss., Phys-Math. Klasse, Berlin, 
XIV (1930). 



Annals or Mathematics 
Vol. 43, No. 1, January, 1942 


QUADRATIC FORMS PERMITTING COMPOSITION 1 

By A. A. Albert 
(Received November 8, 1941) 

1. Introduction 

An associative algebra, with a unity quantity, over a field g has an involution 2 
(involutorial anti-automorphism) J such that x + x J and xx J are in g if it is 
either g, an algebra of order two over g, a purely inseparable field the squares 
of whose elements are in g of characteristic two, or a generalized quaternion 
algebra over g. The quaternion algebras have order four and may be imbedded 
in a generalized Cayley-Dickson 3 4 algebra of order eight over g. All the algebras 
so obtained are then alternative 4 algebras with a unity quantity and an involu¬ 
tion J such that x + x J and xx J are in g. Then the norm xx J is a quadratic 
form in the coordinates of x which permits composition. 

In 1898 A. Hurwitz 5 showed that if g is the field (£ of all complex numbers, 
forms equivalent to the quadratic norm forms described above are the only 
quadratic forms permitting composition. L. E. Dickson 6 pointed out the con¬ 
nection of these forms with the corresponding algebras and thus completed the 
proof in the case g = CS of the following 

Theorem. A quadratic form over g permits composition if and only if it is 
equivalent in g to the norm form xx J of an alternative algebra over g with a unity 
quantity and an involution J such that x + x J and xx J are in g. These latter 
norm forms are quadratic forms in 1, 2, 4 or 8 imieterminates except for the diagonal 
norm forms , in 2 l indeterminate $, of purely inseparable fields of degree 2 1 and 
exponent two over g of characteristic two. 

It is readily verified that Dickson’s version of the Hurwitz proof is valid for 
any algebraically closed field of characteristic not two. However, there seems 
to be no treatment of the case g of characteristic two, a case presenting interest¬ 
ing new features. We shall derive the results for this case here and shall indeed 

1 Presented to the Society Nov. 21, 1941. 

2 An involution over g is a linear transformation J over g of an algebra SI such that 
( ab) J =* b J a J and J 2 is the identity transformation. 

8 For the definition and elementary properties of such algebras see L. E. Dickson, On 
quaternions and their generalizations and the history of the eight square theorem , these Annals, 
vol. 20 (1919), pp. 166-71. 

4 See M. Zorn, Theorie der Alternativen Ringe , Hamburg Abh. vol. 8 (1930), pp. 123-47 
and his Alternativk&rper und Quadratische Systeme , loc. cit. vol. 9 (1933), pp. 396-402 for a 
bibliography and the discussion of such algebras. 

5 Gottingen Nachrichten, 1898, pp. 309-16. 

• See footnote 3. 


161 



162 


A. A. ALBERT 


provide an elegant unified study 7 for the case where g is arbitrary. Our results 
will include a generalization 8 of the Cayley-Dickson algebras to provide new 
algebras of order 2* over %. Certain of these algebras 93 have connected quad¬ 
ratic norm forms and the property that if these norm forms are not null forms 
then every non-scalar quantity of 93 defines a quadratic subfield. However 93 
need not then be a division algebra 9 unless e < 4. 

2. The general Hurwitz problem 

Let Xi , • • • , x n , yi , • • • , y m be independent indeterminates over a field g, 
§ = , • • • , x n , yi , • • • , y m ) be the corresponding rational function field. 

Designate the transpose of any matrix S with elements in § by S' and define 
the one rowed matrices 

x = (xi , • • • , x n ), y = |/m). 

Then if A is any n-rowed square matrix with elements in g the matrix product 
f(x) = xAx' is a quadratic form in x x , * • • , x n . Conversely every quadratic 
form /(#) is expressible as such a product. However A is not unique and 
xAx' = xAqx' if and only if A — Aq is an alternate (that is, skew-symmetric) 
matrix. 

Suppose that B is an ra-rowed square matrix so that g(y) = yBy' is a second 
quadratic form. Then the general Hurwitz problem 10 is that of determining 
under what conditions on f{x) and g(y) there exist quantities in g such that 

(1) Singly) = /(*), 

where z — (zi, • • • , z„) and 

j— 1, 

(2) 2* = 2 Xiy'ik 'Vi (k = 1, ••• , n). 

■*' t—1, • • - .n 

Write (?, = ( 7 ^), so that G, is an n-rowed square matrix with elements in 
Define 


7 The argument required to make the extension of our theorem from the case where our 
field of reference is an algebraically closed field of characteristic not two to that where it 
is an arbitrary field seems not to be in the literature and will be a major contribution of our 
discussion. 

8 The formulation by L. E. Dickson of the Cayley algebras is sufficiently general so that 
we are able to use it to define such algebras over a field of characteristic two. These 
algebras are new, however, and so are the other more general types (for example, of order 16) 
yielded by the Dickson process and given here. 

8 We define algebras which are generalizations of the Cayley-Dickson algebras to alge¬ 
bras of order 2* and with an involution J such that x -f a/ and xx* =» x?x are in 3 . For 
proper choice of the defining parameters xi? is a quadratic form in 2* squares. If g is the 
field of all real numbers each element of 93 defines a subalgebra equivalent to the complex 
field. But 93 is not a division algebra for t > 3. We shall show this in a later paper where 
the conditions that our algebras' be division algebras will be discussed. 

10 See his posthumous paper in the Math. Annalen, vol. 88 (1922), pp. 1-26. 



QUADRATIC FORMS PERMITTING COMPOSITION 


163 


(3) G y = G\yi + • • • + G m y m , 

so that if ei = ( 0 , 0 , ■ • • , 1 , 0 , • • • , 0 ) with 1 in the t th column then 

(4) ' G. { = Gi . 

We see that ( 2 ) is equivalent to the statement that z is the matrix product 

(5) z = xG v . 

Then ( 1 ) holds if and only if x[g{y)A]x' = 3(7*4 (G*)V, that is, if and only if 

( 6 ) N = g(y)A - GyMGy) f 

is an alternate matrix. Thus ( 2 ) holds if and only if 

(7) g(y)E = G y E(G v Y } E = A + A', 

and the diagonal elements of N are all zero. 

Let <£(£) = £C£' and ^(f) = fDf' be equivalent to f(x) so that there exist non¬ 
singular matrices P and Q such that £C£' = £PAP'£', fDf' = fQ/lQ'f'. Then 
N\ = C — PAP' and N 2 = D — QAQ' are alternate matrices. If ( 1 ) and (2) 
hold the matrix N of (6) is alternate and so is PNP' = g(y)PAP' — TyQAQ'iTyY, 
where r„ = PG y Q~ l . Then the matrix 

No = g(y)C - TyDYy = PNP' + g(y)N l - T v N 2 (T y Y 

is also alternate. Let also p(vj) be equivalent to g(y) so that there exists a non¬ 
singular matrix S such that g(rjS) = p(rj). Then if A, = T y , y = tjS, the 
matrix 


p(v)C - \D(A,Y = No 


is alternate and 

(8) <I>(£)p(v) = iKD 

for f = (fi, • • • , f n ) and the f* bilinear forms with coefficients in g in the £ t - 
and the rjj . Thus we have shown that (1) is possible if and only if (8) is 
possible for quadratic forms </>(x) and \f/(x) equivalent to f{x) and p(rj) equiva¬ 
lent to g(y). 

The principal purpose of our discussion is the derivation of a complete solu¬ 
tion of the Hurwitz problem in the case m = n and g(x) equivalent to /(x). 
Thus we shall stress the study of 

(9) f(x)f(y) = f(z), 

for z as in (2) with m = n. If /(x) has the property (9) we shall say that the 
quadratic form f(x) permits composition . Then our principal object will be that 
the determination of the conditions on f(x) that it permit composition. 



164 


A. A. ALBERT 


3. The non-singular case 

Let f(x) be a quadratic form given by 

n 

(10) fix) = 22 XiotikXk {cL*k in 55)> 

so" that/Or) = xAx ' for 4 the n-rowed square matrix (a ik ). If the charac¬ 
teristic of 55 is not two we may take A to be a non-singular diagonal matrix so 
that ant = 0 for i 9 * k y an 9 * 0. However, when 55 has characteristic two the 
rank of A + A' is an even integer 2 r, and n ^ 2r. Then 11 

(11) Six) = a\x\ + • • • + ctnx\ (on 9* in 55) 

if r = 0, and otherwise 

(12) /(#) OL\X\ "4” * * * “4“ OL n X n iX\X r +\ + * * * + X r X2r) • 


Here we assume that if n > 2r the quantities a 2 r+i, • • • , a n are all not zero. 
Moreover if 0 < i < r and both and a, +r are zero the transformation Xi = £*, 
Xi+r = ft + ii+r gives XiXi+r = & + £t£i+r. If oti = 0, a» +r ^ 0 we may inter¬ 
change the corresponding indeterminates. Thus we may assume in all cases 
that if r > 0 the quantities a \, ■ • • , a r are all not zero. 

If g has characteristic not two and A is a non-singular diagonal matrix as 
above then E = A + A ' = 2A is non-singular. However when J5 has character¬ 
istic two the matrix E is non-singular if and only if n = 2r, 


(13) 



E = A + A' = 



where D\ and D 2 are diagonal matrices and Di is non-singular. We shall show 
later that a quadratic form/Or) permits composition only if it is either a diagonal 
form (11) or is equivalent to a form xAx' with A + A' non-singular. Let us 
then study (1) temporarily only in this non-singular case , and treat the re¬ 
maining cases later. 

If we replace y = iy x , • • • , y m ) in g{y) by rj = (rji , • • • , rj m ) for the rji in 55 
we obtain a quantity y = g(rj) in 55 which we say is represented by g(y). Then 
we have 

Lemma 1 . Lety 9* 0 in 55 be any quantity represented by g(y) in the non-singular 
case of (1). Then f{x) is equivalent to y~ x f(x). 

For by (1) we have/(x)y = fiu) where u = xP and P = (/„. Thus/Or) = 
y~ l fiu) is equivalent to y~ l fix) if we can-show that the matrix P is non-singular. 
But xyAx f = xPAP'x ' if and only if yA — PAP ' is alternate. Hence yE = 
PEP 1 and P must be non-singular if y 9* 0 and E is non-singular. 


11 For these results see Chapter II of my Symmetric and alternate matrices in an arbitrary 
field , Transactions of the A. M. S., vol. 43 (1938), pp. 386-436. 



QUADRATIC FORMS PERMITTING COMPOSITION 


165 


Write 

m 

(14) g(y) = £ ViPaVi, 

t.j-i 

where we are taking fin 0. Then (1) implies that <t>{x)g(y) = 4>(z) where 
<t>(x) = anf(x). Also g(y) = fint(y) and fin = g{i)), y = (1, 0, • • • , 0). It 
follows that <f>(x) is equivalent to fin<t>(x) and that then [^riVOr)]• [/3n^(!/)] = 4>(z), 
4>(x)\p(y) = </>(z). Thus we have shown that in the non-singular ease there is no 
loss of generality if we assume that an = fin = 1. 

Note that in the study of quadratic forms f(x) permitting composition 
Lemma 1 implies that f(x) is equivalent to aTif(x ) and thus that we need only 
study forms f(x) with an = 1 in the non-singular case. 

We now write (7) in the equivalent form 

(15) G V (G V ) J = g{y)I, 
where I is the n-rowed identity matrix and 

(16) (G V ) J = EGyET 1 = G J m + • • ■ + G J m y m . 

It is well known that the correspondence G —* G J — EG'ET 1 is an involution 
of the algebra of all n-rowed square matrices with elements in £), and we shall 
use the consequent properties of J. 

The equation (7) is an identity in the indeterminates y x , • • • , y m and is 
equivalent, in view of (14), to 

(17) G,G{ = fiul, G X G\ + GjGi = «,,/ (i *j;i,j= 1, ■ ■ ■ , m), 

where 

(18) a.-,- = ta = Pa + Pa . 

In particular G\G{ = / since we have taken fin = 1. Then also G{G\ = I. 
We now define 

(19) c = GiGy = G[ w yi + • • • + Gl 0) y n 
and clearly have 

(20) G[ 0) = I, G^iG^y = G{(GyG J y)Gi = g{y)G{G x = g(y)I, 

so that Gy 0) is a solution of (7) if G v is. But also Gl = Gy 1 and g(ei) = 1, (6) states 
that A — GiAG'i is alternate and so is GiA(G()' — A. Then by (6) 

N 0 = [A — GiA(Giy]g(y) + Gl[g(y)A - G v A(G y ) f ](Giy = Ag(y) - G^ 0) A(G^ 0> ) , 

is an alternate matrix. It follows that G„ defines a solution (5) of (2) so that (1) 
holds if and only if G» 0> does, and we may thus assume that 

(21) Gi « Gi = /. 



166 


A. A. ALBERT 


Equations (17) for i — l now give 

(22) Gj = Si,I - Gj 0 = 2 ,---, m), 

while Gi = 21 — Gi. But then 

(23) (0,) / = t(G v ) - G y , 
where the trace function, <(G„), is the scalar matrix 

(24) G v + ( G V Y = (2y\ + 22 

and is linear in the coordinates yt of y. Thus the mapping 

v -*G n 


is a linear mapping of the linear space 8 of one by m vectors y on the linear 
space of matrices G„. Moreover if we define e = (1,0, • • • ,0), 


(25) 




we see that J is a linear transformation on 1? such that 


(26) G„ = ((?,)'. 
But [(G,) - ']'' = G, and hence 

(27) (»y = V- 


We shall now specialize our study of (1) to the case m = n and g(y) = /(?/). 


4. Composition in the non-singular case 

Let f(x) be a quadratic form permitting composition. Then 

(28) f(x)f(y) = f(xGy), 

where the elements of the n-rowed square matrix G„ are linear forms in 
yi, ■ • • , y n and we have seen that if e = (1, 0, • • • , 0) we may take 

(29) G. = I, /(e) = 1, GyGy = f(y)I 

such that (23) and (24) hold. Then the ordinary matrix product z = xG y gives 
(2) and these equations may also be written as the first equation in 

(30) z = yH z = xGy , 
where we define pjj* = yIk and have 

(31) Hy = (pft), H x = HiX! + • • • + H»X„ . 

But f(y)f(x) — f(yH x ) and our derivation of the properties of G v implies that 
we may interchange the roles of f(x) and f{y) to obtain 

(32) HM = f(x)I, 



QUADRATIC FORMS PERMITTING COMPOSITION 


167 


as a consequence of (15). It follows from (29) that 

(33) H e H J e = 7, 
and thus that H e is non-singular. Define 

(34) u = u(y) = yH t , R u = G y , 

so that u = (wi, • • • , tin), V = wfl r 7 1 and yi, ••• ,y n are linear forms in 
tii, • • • , tin . Then by (30) and (28) with x = e we have 

(35) ti = eG y = , /(ii) = f(eG y ) = /(e)/(y) = f(y). 

Moreover u(e) = cG e = e since G e = 7, R u(6) = G e = 7. But then f(x)f(y) = 

/(*<?„) - /(*«-) - /(*)/(*), 

(36) /(*)/(*) = f(xR y ). 

We have thus proved that R v is a solution G y of (2) such that 

(37) R e = 7, eR v = t/, = /(t/)7, N = /(»M - 

is an alternate matrix. 

Let now 8 be the linear space of all f = (£i, • • • , f n ) for £*■ in 5- Then the 
correspondence 

(38) i -> 

is evidently a linear mapping of 8 on the linear space of all the matrices R { . 
Define an operation 

(39) = Wr, 

on 88 to 8 and call the result £ • rj the product of £ and rj. Then 8 becomes an 
algebra 21 of order n over g. Indeed all algebras may be defined in this way. 

We have = eR v = rj by (37), and = rjft* = rjl = 17 . Hence 2t has e 
as its unity quantity. We shall give further properties of 21 in the next section. 
We now relate this result to a certain class of algebras. 

An algebra 2t over $ is called alternative if 

(40) x{xy) = {xx)y, (yx)x = y(xx) 

for every x and y of 21. Then (x + y)[(x + y)x] = (x + y){xx + yx) = 
x{xx) + x(yx) + y{xx) + y(yx) = [(x + y)(x + y)]x = (xx + xy + yx + yy)x = 
(xx)x + (xy)a: + (yx)x + ( yy)x . By (40) we have x(yx) = (zi/)a;, a postulate 
which is sometimes given as one of the alternative postulates but which is 
actually a consequence of (40). We now prove 
Lemma 2 . Let 21 be an algebra with a unity quantity and an involution J such 
that 

(41) t x = x + x J , xx J 

are in g. Then x J x — xx J , and 31 is alternative if and only if 

(42) (xx J )y = x(x J y) = x J (xy) = (yx)x J = (yx J )x. 



168 


A. A. ALBERT 


For xx J = x(t x — x) = t x x — xx — ( t x — x)x = x J x since t x is a scalar. Also 
x(x J y) = x[(t x - z)s/] = <*(zy) - z(xy), 

and 

(xx J )i/ = [x(t x - x)]*/ = k(xy) - (xx)y. 

Thus x{x J y) = (xx y ) 2 / if and only if x(xy) = (xx)y. Similarly {yx)x J = y(xx J ) 
if and only if ( yx)x = 2 /(xx). However ( x J ) J = x and if SI is alternative we 
have x J {xy) = (x J x)z/ = (xx J )y, (yx J )x = ^(x'x) = ( xx J )y . 

We next prove 

Lemma 3. Le< SI be alternative and as in Lemma 1. Then the quadratic form 
xx J in the coordinates x \, • • • , x n of the quantities x = (xi, • * • , x») of 31 permits 
composition . 

For it is clear that if (q is any scalar extension of 3 the algebra 31$ has the 
properties we have assumed for SI. We let X \, • • • , x n , yi , • • • , y n be inde¬ 
pendent indeterminates over 3 and § = 3(xi, • • • , x n , yi , • • • , t/„). Then 

x = (xi , • • • , x n ) and y = ( 2 / 1 , • • • , y n ) are non-zero quantities of 3($ and so 
are x J , y J . Also xx J — x J x = f{x) is in Then so ar ef(y) = yy J = y J y,f(xy) = 
(xy)(xy) J . But [f(xy)]x J = x / [(a^)(xy) J ] = [x J (xy)](y J x J ) = /(aOfofoV)] = 
f(x)f(y)x J , f(x)f(y) = /(xy) as desired. 

We now prove the partial converse, 

Lemma 4. Let f(x) = xAx' st/c/i A + A' is non-singular . /(x) 

permits composition if and only if f(x) is the norm form xx J of an alternative 

algebra 31 unth a unity quantity and an involution J such that x + x J and xx J 

are 12 in 3- 

For f(x) = 23 XiO-iPi with an = 1 and all the a tJ in 3 and we define an 
algebra SI over 3 by (39) and have (37). Then if £ = (£ 1 , • • • , £ n ) with the £, 
in 3 we have seen that R $ + (R^Y = t^I where t = t$ is in 3 and is determined 
by/(x), that is, actually 

n 

(43) t$ = 2£i + 23 (<*i/ + • 

y-2 

We now write 

(44) £ J = ^6 - £, 

and have defined a correspondence J which is evidently a linear transformation 
on SI. Now if t = t$ we have 

(45) R$j = — Rz = (Rz) J 

so that tp = t$, (YY = £ for every { of 31. Since £77 is not defined as a matrix 
product we may drop the dot in (39). We define § = 3(^i > * * * > x n , 

18 In fact we show that each solution Q v of the consequent equation z — xO v defines an 
algebra. Moreover those of these algebras which have unity quantities have the property 
of our lemma. 



QUADRATIC FORMS PERMITTING COMPOSITION 


169 


2/i i • • ■ , Vn) as above and in 21$ have x = (xi , • • • , x n ), y = (yi , * • • , y n ), 
/(x)c = xx 7 is the norm form of 21 as desired. By the proof of Lemma 1 
xx J = x J x. Finally 

(46) ( yx)x J = iyRz){Rxj) = y(RzRt) = t//(x), 

so that we have part of (42). 

We note now that we may write £r) = %R n = rjL $ as in (30). But then 
L* + (L*) 7 = hi since h depends only on £i, • • • , £ n and the coefficients of /(x), 
not on the choice of the solution G v of (2). Thus (L f ) 7 = L^j . It follows that 
x J (xy) = ( yL x )LxJ = yL x (L x ) J = y/(x). We now use/(x)/(?/) = f(xy) and form 
x J f(xy) = x J [(xy)(xy) J ] = x\xy){xy) J = f{x)y(xy) J = f(x)f(y)x J . Thus 
2/(^2// = ( 2 / 2 /)^ = y(y J x J ). Multiplying by y J we have 2/ J 2/( :r 2/) J = (; y J y)(y J x J ), 
(xi/) 7 = 2 / 7 x 7 . But then (£t?) 7 = t? 7 ^ 7 for every £ and rj of 21, J defines an involu¬ 
tion of 21 such that £ + J 7 and ££ 7 are in ft, 21 is alternative as desired. This 
completes our proof. 


6. Algebras and their norm forms 

The proof of our principal theorem requires that we exhibit algebras 2( whose 
norm forms xx J are quadratic forms xAx' with A + A* non-singular. We shall 
show later that such algebras necessarily have orders 1, 2, 4, 8. Let us then 
exhibit algebras of the kind desired and of these orders. 

The algebra of order one is ft itself. Any algebra 21 = (1, u) with u 2 = 
0u + 7 and 0 and y in ft has the desired properties and is trivially shown to be 
an associative algebra. Then if 0 = 0 and ft has characteristic two we have 
u = 7 , (u) 2 = 7 , u = u for any involution J. But then xx J — xx = 
xl + x\y — xAx' with A + A 9 = 0, contrary to hypothesis. If 0 = 0 and ft 
does not have characteristic two we have ( u + l ) 2 = u + 2u + 1 = 
2(w + 1) + 7 — 1 and may take 0^0. Define k = 0~ 1 u and have k 2 = k + a 
for a in ft. Thus the algebra 

(47) « = (1, k), k 2 = k + a , 

is the only algebra of order two which we need to consider. It is associative, 
has the automorphism T defined by 

(48) x = xi + x 2 k, x T - x x + x 2 (l - k), 
and T is an involution of 8. It follows that 

(49) x + x T = 2x\ + x 2 , xx T = x? + XiXi — x \. 

The quaternion algebras may be defined as algebras 8 of matrices 

m i= C- 9 A 

with 0 j* 0 in ft, a = Xi + x 2 fc, b = x 3 + x 4 fc, and k and T defined for the algebra 



170 


A. A. ALBERT 


given by (47), (48). Then 21 has a basis 1, u, v, uv over where the unity 
quantity 1 of 21 is really the two rowed identity matrix, and 


(51) 


4 0 ), »«(° \ 

fi 1 -*/ \fi 0 / 


Then vu = (1 — u)v , u 2 = u + 1, v 2 == & ^ 0. Conversely every algebra with 
such a multiplication table is a simple associative algebra of degree two over its 
center 13 g, and is representable as the set of matrices (50). 

Every involution 14 of a two-rowed total matric algebra is a correspondence 


(52) 


x —> x J = ex'e \ 


for a non-singular symmetric or alternate matrix e. We take 


(53) 



and see that 
(54) x 7 


c :x; :'x -;) 




Then x J is in 31 and (54) defines an involution over g of 31. Moreover if a = 
xi + x 2 k we have 

(55) x + x J = a + a T = 2xi + x 2 

in g, 

(56) f(x) = xx J = ( )( \ = aa T — = x y x. 

\b T p a T /\—b T p a) 

Thus f(x) = x\ + X1X2 — otx 2 — (X3 + X3X4 — X4a)]3 is the determinant of x 
and is a multiplicative function, that is, f(x)f(y) = f(z ) where z = xy. 

Forms in Xi, • * • , xs have been connected 15 with certain non-associative alter¬ 
native algebras. We shall construct these algebras over fields of arbitrary 
characteristic by the use of L. E. Dickson’s formulation of the non-associative 
algebra of Cayley. Let 31 be an algebra of order n over g and let 31 have a 
unity quantity 1 which we shall identify with the unity quantity of g. Assume 
that 31 has an involution J over g such that x + x J and xx J = x J x are in g. 
Then every x of 31 is a root of an equation 


0(A) = (A — x) (A — x J ) = X 2 — (x + x J )\ + xx J = 0 


with coefficients <(x) = x + x J and f(x) = xx J in F. Hence 31 has degree two 
over g and 0(A) is the minimum function of every x of 31 which is not in g. 


18 We use center instead of centrum, 

14 See Chapter V of my Modern Higher Algebra . 

15 See the paper referred to in footnote 3. 



QUADRATIC FORMS PERMITTING COMPOSITION 


171 


Every x in 31 defines a subalgebra ft[z] of all polynomials in x and the unity 
quantity of 31 with coefficients in ft. Clearly if x is not in ft then ft[z] = (1, x) 
over ft and ft[z] is an associative subalgebra of 31. Since x J = t(x) — x we see 
that the involution J of 31 induces an automorphism in every subalgebra ft[z] 
of 31 which is the identity if and only if either x is in ft or t{x) = 0 and ft has 
characteristic two. Observe now that the function f(x) = xx J is a quadratic 
form in the coordinates of x with respect to any basis of 31 and that every x ^ 0 
of 31 has an inverse in 31 and actually in ftM if and only if f(x) is not a null form. 

We shall now construct an algebra 33 of order 2 n over ft with the properties 
described above by the use of any such algebra 31 of order n over ft. The 
algebra 23 will have quantities which are pairs of quantities of 31 and it is cus¬ 
tomary to indicate this by the statement that these quantities are uniquely 
expressible in the form 

(57) x = g x + g 2 w 

for gi and g 2 in 31. Let 75^0 be in ft, J be the given involution of 31 and define 
multiplication 16 in 23 by 

(58) xy = (gx + g%w)(hi + h*w) = (g x h x + yh J 2 g 2 ) + (h 2 g x + g 2 h{)w. 

Extend the definition of J so that J becomes a linear transformation of 23 by 
means of 

(59) x J = gi - g 2 w. 

It is clear that J is now r a one-to-one linear transformation of 23 such that ./ 2 
is the identity transformation. 

To form the product xx J we replace h 2 by — g 2 , hi by g x in (58) and have 
h 2 gi + g 2 hi = ~g 2 g x + g 2 g t = 0. Then 

(60) xx J = g x gi - yg 2 g J 2 = x J x 

since the interchange of g x with gi , g 2 with — g 2 does not alter gig J — yg 2 g 2 . Also 

(61) x + x J = g x + gi 

is in ft, each x of 33 is a root of X 2 — (x + x J )\ + xx J = 0, and 23 has degree 
tw’o over ft. Finally 

(xy) J = (higi + yg J 2 h 2 ) - (h 2 g x + 

yx J = (hi - h 2 w)(gi - g 2 w) 

= (higi + ygifh) — (g 2 hi + fhgi)w , 

w This is clearly a generalization of the L. E. Dickson formulation of the definition of 
the Cayley algebra of order eight. 


(62) 

and 

(63) 



172 


A. A. ALBERT 


so that (xy) J = y J x J and J is an involution of 58. We observe that (58) may be 
given more simply by the use of the distributive law and the relations 

g(hw) = (hg)w, (gw)h = (gh J )w, 

(64) 

(l gw)(hw) - yh g, 

for every g and * of 21. Clearly if 21 is not a commutative algebra the algebra 58 
is not associative. 


To prove that 58 is alternative we use (57) to compute 


(65) 


x(xy) = [ 01 ( 01*1 + 7 * 202 ) + 7 ( 01*2 + * 102 ) 02 ] 

+ [(* 20 i + gthbgi + g 2 (higi + 702*2)R 


as well as x 2 = g\ + 70202 + [ 02(01 + gi)]w and 
^ x 2 V = 1(0? + 70202)/ii + 7*2 [02(01 + 0i)]} 

+ [^ 2 ( 0 ? + 70202) + [02(01 + 0l)]*l}w. 

If 21 is an alternative algebra we have 01(01*1) + 7(*i0?)02 = 01*1 + 7(0202)^1. 
However 0i + 0? is in g and h J 2 [g 2 (gi + 0?)] = (01 + gi)(h J 2 g 2 ) = 0i(*?0 2 ) + (01*2)02 
if and only if gi(h J 2 g 2 ) = (01*2)02, that is, 21 is an associative algebra. Con¬ 
versely if 21 is associative we have also (*201)01 + 702(02*2) = *2(0? + 7020 2 ), 
(02*O0i + 02(*?0?) = 02(01 + 00 *? > and x(xy) = x 2 y for every x and y of 58. 

It follows that x J (x J y J ) = ( x J x J )y J and, by operating with J , that (yx)x = 
y(xx) for every x and y of 58. Hence 58 is alternative. This completes our proof 
of the result that 58 is an alternative algebra if and only if 21 is associative. 

We now see that if 21 is the associative algebra.of order four defined by the 
matrices of (50) the algebra 58 is an alternative algebra of order eight over g. 
Its norm form f(x) = xx J = x J x is given explicitly by 


(67) 


f(x) = x\ + XiX 2 — x\a — ( x\ + x 3 x 4 — x\a)fi 

- (xl + x b Xt - x\a )7 + ( x7 + X7X8 - x\ot)$y. 


Note that f(x) becomes the norm form of the algebra 21 of (50) by putting 
x b = x* = xi = x* = 0, and that of (47) by taking also x 3 = x 4 = 0. The 
values a = # = 7 = —1 then give us the forms x\ + x\ + X\X 2 , x\ + • • • + x\ + 
xix 2 + X 3 X 4 , Xi + • • • + xl + £ 1*2 + x z x A + x 6 x b + x 7 x 8 . 

Observe that the algebra 58 of order eight defined above is a division algebra 
if and only if its associative subalgebra 21 is a division algebra and f(x) is not a 
null form. For if xy = 0 then x J (xy) = ](x)y = 0 and if x 0 then/Or) 0, 
y = 0. The condition that xx J = 0 if and only if x = 0 is satisfied when 7 is 
chosen so that 7 is not the norm gg J of any g of 21. 

It is already known that in g of characteristic not two a quadratic form 
f(x) = xAx ' with A = A' a non-singular n-rowed matrix permits composition 
only when n = 1, 2, 4, 8. Thus it remains to show the corresponding result for 



QUADRATIC FORMS PERMITTING COMPOSITION 


173 


A + A f non-singular and 5 of characteristic two, as well as to study other 
possible cases of composition for this characteristic. Let us thus assume hence¬ 
forth that the characteristic of $ is two. 

6. Determination of possible orders in the non-singular case 

Let 21 be any simple algebra of degree n over its center g and let 7 be the 
unity quantity of 21. Assume the existence of quantities U and V in 21 such that 

(68) V* = pi, V 2 = y I, W = VV = 7 — VII, 

for p 0 and y in g. Then W 2 = U(I - UV)V = W - pyl, UW + WU = 
pV + U(I - IT) = U. It follows that 

(69) W 2 = W - 0yj, tTF = (7 - IF)I/. 

But then 21 contains a generalized quaternion algebra 33 = (7, IF, U, WU) = 
(7, L r , F, f r F). This algebra of order four over g is a simple algebra of degree 
two over its center g. But then it is well known 17 that 21 is the direct product 

(70) 21 = 33 X 21i, 21i = 21®, 

where we write 21® for the set of all quantities of 21 commutative with both U 
and V. Moreover n = 2n \, 2li is a simple algebra of degree n\ over its center g. 
Assume now that 21 contains also l\ , V 2 such that U\ — p 2 I, V\ = y 2 7, 
U 2 V 2 + V 2 U 2 = 7, where p 2 9* 0 and y 2 are in g. Then if UU 2 — U 2 U = 
UV 2 - V 2 U = VU 2 - U 2 V = VV 2 - V 2 V = 0, the quantities U 2 and V 2 are 
in 21®. But then 2ti contains 33 2 = (7, l\ , V 2y U 2 V 2 ), and 2li = 33 2 X 21s 
where n\ = 2n 2 , 21 2 = 21® 2 is simple of degree n 2 over its center g. We now 
let s ^ 1 and suppose that 21 contains s — 1 pairs of quantities Uj, F ; such that 

(71) u 2 = Pjij v 2 = 7//, u,r, + VjU y = 7 O’ = 1, 1), 

for pj 9^ 0 and 7/ in g. Assume also that 

W*.- W, = UiV k - F*(/y = FyFfc - VkVj = 0 

(72) 

(j k;j, k = 1, • • • , s ~ 1). 

Then 21 = 33i X • • • X 33*-i X 21« , and we have proved that 2* -1 divides n. 

Let us now apply these results to the study of (1) in the case where/(x) = 
xAx', E = A + A' is non-singular, and we have (12) for n = 2r, <x\ = 1. We 
assume that g has characteristic two and that 

(73) g(y) = yl + Ptyi + • • • + Pmyh + (yiy*+i + • • • + y»y^) 

where m ^ 2s, the Pi are in g, and p 2 , • • • , p 8 are all not zero. By (17) there 
exist matrices (?i = 7, G 2 , • • • , G m such that 

(74) GiG J i = Pi, G x <% + G k G< = 5*7 (i * fc; t, k - 1, • • • , m), 


17 See my Structure of Algebras , pp. 52-56. 



174 


A. A. ALBERT 


where now da = 0 unless 0 < i ^ 2s, 0 < k ^ 2s and i — k is s or —s. 
In the remaining cases d ik = 1. Define t/*_i = G,, F,_i = G,v«forjf = 2, • • • , «. 
Then by (74) for i = 1 we have G* + G*. = 0 for A = 2, • • • , s and thus G* = G* . 
Hence f/y = 0y +i , Fy = Pj+i+s for j = 1, • • • , $ — 1 where 0y+i 5 ^ 0. Also by 
(74) for i = j + 1 and /c = i + $ we have f/yFy + F,/7y = I and thus (71). 
By these same equations for k — i ^ s and i = j + 1 or j + 1 + $ respectively 
we obtain (72). We let 21 be the algebra of all n-rowed square matrices with 
elements in 5 and it follows as above that 2 a_1 divides n. 

Note that we have not utilized all consequences of (74) and that the conditions 
omitted as well as other results would have to be used in order to complete the 
study of (1). However we are using this study of (1) only as a tool in our dis¬ 
cussion of quadratic forms permitting composition and so shall leave the more 
general considerations for later study. 

Let us then take m = 2s = n = 2r. The results just derived imply that 
2 r ~~ 1 divides 2 r. This is not the case for r — 3, r > 4, it is the case for r = 1, 2, 4, 
and thus n = 2, 4, 8. We have shown that in the non-singtilar case over g of 
characteristic two a form f(x) = f(x 1 , • • • , x n ) permits composition only if 
n = 2, 4, 8. We combine this with the known results 18 for g of characteristic 
not two to see that in the non-singular case the only possible values of n are' 
1,2, 4, 8. 


7. The singular case with A + A ' 0 


Let g be a field of characteristic two and f(x) — xAx r such that 
E = A + A f 7 ^ 0 is singular. Then f(x) has the form ( 12 ) for n > 2r > 0 and 
we have seen that we may take «i, • • • , a r and a 2r +i , • • • , a n all not zero. 
Thus 


(75) 



= 



18 The method we have used does not require that g be algebraically closed and is 
applicable to obtain a much simpler derivation than that in the literature of the analogous 
results for g of characteristic not two. We take f(x) = aix J 4- • • • a n x 2 n , g(y) = 
Piyl 4- • • • 4- PmVm f° r the and 0/ all not zero and in g. Then we have seen that we may 
take ai = )3i » 1 and have GiQ J { = 0,7, G^G] 4- GjG\ — 0 for i ^ j and i, j =* 1, • • ■ , m. 
Thus we may take Gi = / and obtain G\ =■ — , G] = —0»/, G%Gj = —G } Gi for i 9 * j and 

i,j = 2, • • • , m. We define m to be the greatest integer in \m and may prove that 2* divides 
n. For 53 1 = (7, G% , Gt , G 2 Gi) is a generalized quaternion subalgebra of the total matric 
algebra 21 of degree n and 21 =* 53i X $2 where 2la is simple of degree n 2 over its center g, 
n *■ 2n± . If m > 1 we have m ^ 6 and define G<Gh = Hk-i for fc «* 5, • • • , m and see that 
Ga/fy - HyG, - G,tf, - Hfit - 0 for j - 2, - - • , m - 3. Also * -040,vs/, fl\J7y - 
—HjHi for i 5 * j and i, j ** 2, • • • , m — 3. But then the i/< are in 212 and we have the 
same situation as above with m replaced by m — 3, n by n %, 21 by 21 2 . It follows that 2 
divides n 2 ,2 s divides n. After m such steps we obtain the result desired. Moreover if 
n ** m we have shown that 2** divides n ** 3/s, 3 m 4- 1, or 3 m + 2. However if /a > 3 we 
have 2>* > 3 m 4- 2, if /s « 3 then 2* «* 8 does not divide 9, 10, 11, and if m • 2 then 2* - 4 
only divides 3 m 4- 2 — 8. If m * 1 then 2" ■■ 2 only divides 3 m 4- 1 = 4 and if m " 0 then 
n - 1, 2. . 



QUADRATIC FORMS PERMITTING COMPOSITION 


175 


where D x is the non-singular r-rowed diagonal matrix with diagonal elements 
ai = 1, as, •" i a r , £2 is the r-rowed diagonal matrix with a T+x , • • * , a 2r as 
diagonal elements, D is the (n — 2r)-rowed non-singular diagonal matrix with 
diagonal elements a 2 r +1 We now prove the 

Lemma 5. The equation (1) is possible only if there exist linear forms u^ r +i , 
• • • ,u n inyij • • • , y m such that 


(76) g(y) = a 2 r 4 -l(a 2 r-flW 2 r+l + • • • + a n u n ). 

For proof we observe first that g(y)[A + 4'] = G V (A + A')G' V . Write 


(77) 


G t 


< 3 


where G is a 2r-rowed square matrix and the number of rows and columns in 
H, K, L are thereby determined. Then 


(78) 


A + A 


-a 


But then C is non-singular and 

/g(y)G 

0 


(79) ( 9iy)C °) = (° H )( C °)( G ' K '\ = ( GCG ' 
\ 0 0/ \K L/\ 0 0/ \//' L'J \KCG' 



./ 

f GCG' 

GCK' 

-\ 

k KCG' 

KCK\ 

i, K = 0. 

Also 

11' 

\( A ' G ' 


L, 

/\DH ' 

DUJ 


■ 


(80) Gy A (Gy)' = Gy 


But the diagonal elements of g(y)A — G y A(G y )' are all zero and thus the di¬ 
agonal elements of g(y)D — LDL' are zero. Write L — (uij) for i,j = 1, • • • , q, 
where q = n — 2r and the u^ are linear forms in y x , • • • , y m . Compute the 
element in the first row and column of this matrix and obtain 

<*2r+ig(y) — (a2r+lUu + * * * + OL n u\f) = 0 . 


This completes our proof. 

A quadratic form g(y) = yBy' is a diagonal form $iy\ + • • • + & m ym if and 
only if B + B' = 0. Then every quadratic form equivalent to g(y) is also 

m 

diagonal. Now if w* = X) with the X», in 3 we have u\ = X?#? + • • • + 

7-1 

. By (76) we have 


19 Observe that then we are led to the study of the same type of equation as in the non¬ 
singular oase. We show below that g(y) is diagonal and later that there is then an asso¬ 
ciated field $ which is purely inseparable of degree 2 * and exponent two over g. But then 
the set of all 2r-rowed square matrices contains a subfield generated by the matrices 
equivalent to & and thus 2 * divides 2r. 



176 


A. A. ALBERT 


Lemma 6 . The equation ( 1 ) is possible for n > 2 r only if g(y) is diagonal . 

It follows immediately that if n > 2 r for a quadratic form f(x) with r > 0 
it cannot permit composition. 

8 . Diagonal quadratic forms 

The theory of ( 1 ) for f(x) diagonal and 3 °f characteristic two is very simple 
and the results obtainable with little difficulty. We write f(x) as in ( 11 ) with 
the ai all not zero. Define $(/) to be the purely inseparable field 3(wi, • • • , u n ), 
u] = ai , so that $(/) has degree 2 * and exponent one or two over n ^ 2 \ 
Define 

(81) u(x) = XlUl + • • • + XnUn , 

and let 2(f) be the set of all values u({j) for £ = (£i, • • ■ , £ n ) and the fc in 3 * 
Clearly 2(f) is a linear subspace of order v ^ n, v g 2\ of $(/). Moreover 
f(x) = [u(x)]\ 

By permuting the indeterminates xi, • • • , x n if necessary we may assume that 

V 

u \, • • • , u v are linearly independent in 3 and that u k = for k = v + 1 , 

y—i 

• • • , n. Then the linear transformation 

w 

(82) x,- = Wj - 22 , Xk = w k (j = 1 , • • •, v; k = v + 1 , • • •, n) 

is non-singular and carries/(x) into aiic? + * * • + al/aj + a k )w\ . 

But u\ = a k = X] . Hence (82) carries /(x) into 

(83) a\W{ -)-••• -f- . 

W T e have thus proved 

Lemma 7. Letf(x) be a diagonal quadratic form and 2(f) have order v over 3* 
Thenf(x) is equivalent to a form a\x\ + • • • + a v x 2 v . 

Note that v is an invariant of f(x) and that/(x) is not equivalent to a diagonal 
form in fewer than v variables. We now prove 

Lemma 8 . Let f(x) = aix? + • ■ • + a n x 2 n , h(x) = 71 x\ + • * • + such 
that n and m are the respective orders of 2(f), 2(h). Then there exist linear forms 
W \, * ■ • , w n in X\ } • • • , x m such that h(x) = f(w) if and only if m g n, 2(h) is 
contained in 2(f). Also then f(x) and h(x) are equivalent if and only if 2(h) = 2(f). 

For w T e may construct a field 3( w i > * * * 1 u n , t>i, • • • , v m ) = SB such that 
u\ * ai y v) = 7 j . Then SB contains both 2(f) and 2(h) } f(x) = [u(x)f y h(x) = 
[v(x)fy where u(x) is defined by (81) and v(x) = ViXi + • • • + v m x m . Also 
h(x) = f(w) implies that v(x) = u(w) f every quantity of 2(h) is in 2(f ), 2(f) 
contains 2(h). Conversely if 2(f) contains 2(h) we may take Vi } • • • , v m as part 
of a basis of ?(/) and then replace f(x) by an equivalent form with a* = 7 * for 
i = 1 , ■ • • , m. Then ft(x) = /(w) where Wi = x,- for i = 1, • • • , m and w* = 0 
for k = m + 1 , • • • , n. The last part of our lemma is a trivial consequence of 
what we have already proved. 



QUADRATIC FORMS PERMITTING COMPOSITION 


177 


The result above implies that we may take n — 2r to be the order of 
the matrix of a 2r + i£ 2r +i + • • * + oc n x\ in Lemma 5 and obtain m ^ n — 2 r in 
that lemma. 

We now prove 

Lemma 9. Let f(x) be a diagonal form x\ + a 2 xl + • • • + a n x\ where n is 
the order of 2(f). Then f(x)g(y) = f(z) as in ( 1 ), ( 2 ), if and only if g(y) is equiv¬ 
alent to a diagonal quadratic form f$iy\ + • • • + PmyL such that 2(f) contains 
ft(g). Moreover n is divisible by the order 2 l of $(g) over g. 

The result above states that 2(f) is a linear space of order v over .R'(flf), n = 2 t v 1 

(84) 2(f) = St(g) + Si(g)d 2 + • • • + Si A . 

Then we may take a basis of $(< 7 ) of order a = 2 l over 5 to be a set of quantities 
Vi, • • • , v 9 such that v\ = in g and also take d) = 7 for/ = 2, • • • , v. Then 

9 9 

we may take g(y) = 0iy\ + • • • + (i m yL and f(x) = + Z) Prr&l+i + 

t-1 t«—l 

9 

’ ’ * 4“ 0tYvX(v—l)9+i • 
i—1 

To prove this result we note that /(e) = 1 and thus g(y) = f(w) for 
w = ( Wi, • • • , w n ) and the linear forms in yi, • • • , y m . Thus g(y) is a di¬ 
agonal form. Then g(y) is a diagonal form such that 2(g) is contained in 2(f). 
But then g(y) = [v(y)f for v(y) = v<yi + * • • + v m y m and v\ = fa where the 
Vi are a basis of 2(g). If a and b are in £(gr) the quantity a is in 2(f). However 
[u(x)f[v(y)f = [^(z)] 2 , u(x)v(y) = u(z) and ab is in 2(f). It follows that the prod¬ 
uct of any two quantities of 2(g) is in 2(f), and 2(f) contains the ring of all poly¬ 
nomials in the quantities of 2(g). This is the field $( 0 ). Conversely if 2(f) 
contains Si(g) we know that 2(f) = [$( 0 )]® where 93 = (1, d 2 , * • • , d v ) over 
g and 1 , d 2 , • • • , d v are a basis of 2(f) over $I(gr). Then 2(g)Sl(g) is in Sl(g), 
2(g)2(f) is contained in 2(f). However the equation g(x)f(y) = f(z) is precisely 
equivalent to the statement that 2(g)2(f) is contained in 2(f). 

If g(y) is equivalent to f(x) we may require that m = n is the order of 2(f). 
Then aif(y) = f(w) and f(y) = ci\ l f(w). Lemma 8 then states that f(x) is 
equivalent to aT x f(x). It follows that we may take a\ = 1. Then we may 
apply Lemma 9 to obtain the result that 2(f) contains $(/). Then 2(f) = £(/). 
Thus a diagonal quadratic form over a field of characteristic two permits composi¬ 
tion if and only if it is the norm form of a purely inseparable field of degree 2 l and 
exponent two over g. We have thus completed the proof of our theorem. 

The University op Chicago 



Annals or Mathxmatios 
Vol. 43, No. 1, January, 1942 


ON INTEGRAL GEOMETRY IN KLEIN SPACES 

By Shiing-shen Chern 
(Received September 3, 1940) 

The classical results in integral geometry found by Crofton, Poincar6, Cartan, 
and recently developed by Blaschke 1 and his school, are mostly restricted to 
Euclidean spaces. It is the object of this paper to give the fundamental con¬ 
cepts of integral geometry in a general space of Klein, by which we mean a 
number space of n dimensions with a transitive r-parameter group G r of trans¬ 
formations. The discussion is mainly based on Cartan’s theory of Lie’s groups. 2 

The paper is divided into three sections. In §1 we give a brief summary of 
Cartan’s theory of Lie’s groups with some results concerning the measures of 
geometrical elements. In §2 we define the incidence of the geometrical elements 
of different fields, which plays a fundamental role in all subsequent discussions. 
As applications of the general notions we give in the last section two formulas 
which are respectively generalizations of the well-known formulas of Crofton- 
and Cauchy. 


1. Some Fundamental Notions in Klein Spaces 

Let G r be an abstract r-parameter group of Lie 3 with the parameters a 1 , • • • , a r , 
so that G r denotes an r-dimensional space and a 1 , • • • , a are the coordinates in 
the space. If S a denotes a point in G r with the coordinates a 1 , • • • , a r , there are 
defined in G r two simply-transitive groups of transformations 

(1) Sa S c Sa , ' 

—> S a S c , 

called respectively the first and the second groups of parameters. There exists 
one, and only one, set of r linearly independent Pfaffian forms 

co 1 (a, da), • • • , co r (a, da) 

invariant under the first group of parameters. They are determined up to a 
linear transformation with constant coefficients and have the property that any 


1 Cf. W. Blaschke, Vorlesungen xiber Integrajigeometrie , Bd. I Leipzig 1936; Bd. II Leipzig 
1937. These books will be cited respectively as I. G. I and I. G. II. 

* Cf. E. Cartan, La thkorie des groupes finis et coniinus el la gkomktrie diffkrentielle traiUe 
par la mkthode du rephre mobile , Paris 1937. This book will be cited as Cartan, Th6orie 
des groupes. 

3 For the definition of an abstract group of Lie we may follow that given by Cartan in his 
book “La thkorie des groupes finis et coniinus et V analysis situs” Memorial des Sciences 
Mathgmatiques, Fasc. XLII, Paris 1930. 


178 



INTEGRAL GEOMETRY IN KLEIN SPACES 


179 


exterior differential form 4 of degree p invariant under the first group of param¬ 
eters is of the form 

( 2 ) £ ip «*], 

where A tl ... , p are constants. According to Cartan, we call « l , • • • , w r the relative 
components of G r . They satisfy the equations of structure of Maurer-Cartan 

(3) (<*>*)' = £ <?}* [«*«*] , C/* + Ctf = 0, i = 1, • • • , T, 

J.fc-1 

where c}* are the constants of structure of G r . 

Now let 0 be a subgroup of G r . This subgroup g and its left-hand cosets Sg 
fill up the whole space G r and have the property that no two of them can coincide 
without being identical. Thus the varieties Sg are the integral varieties of a 
completely integrable Pfaffian system. Owing to the invariance of the totality 
of the cosets Sg with respect to the first group of parameters, the left-hand 
members of the Pfaffian system are linear combinations of w 1 , • •. , « r with 
constant coefficients. We may suppose the system to be 

(4) w 1 = 0, •. • , co n = 0. 

The theorem of Frobenius then gives 

(5) c) k = 0, i = 1, • • • , n; j, k = n + 1, • • • , r, 

which are the necessary and sufficient conditions for the system (4) to be com¬ 
pletely integrable. We say that the subgroup g defines a field of geometrical 
elements such that each element of the field is represented by a left-hand coset 
of g or by an integral variety of (4). 

We may interpret the geometrical elements defined by (4) as the points of a 
space E. The first group of parameters becomes then a group of transformations 
in E , so that, with some necessary assumptions on the analyticity of the equations 
of the transformations, E is a space in the sense of Klein. It is clear that any 
space of Klein can be obtained this way. In fact, we need only take G r to be the 
group of transformations in the space and g the subgroup of G r leaving invariant a 
fixed point. 

Let x l (a l , -, a r ), • • • , x n (a 1 , • • • , a) be n independent first integrals of 

(4), so that dx\ • • • , dx n are n linearly independent combinations of w 1 , • • • , w”. 
Then we have 

( 6 ) [i dx 1 • • • dx n ] = A (a 1 , • • • , a r )[w • • • co n ], A ^ 0. 

By the measure of a domain D of elements in E we shall mean an n-tuple integral 
of the form 

( 7 ) J = f f(x\ • • • , z n )[dx l • • • dx 11 ] 

Jd 

4 For the notions of exterior differential forms and exterior derivation cf. E. Cartan, 
Legom sur les invariants inttgraux , Chap. VI, VII, Paris 1922. 



180 


9HIING-SHEN CHERN 


extended over D such that its value is invariant under the group of transforma¬ 
tions in E y i.e., under the first group of parameters. Since this property holds 
for all domains Z), we see from (6) that it is expressed by 

/A = constant. 

Hence a necessary and sufficient condition for the elements of E to possess a measure 
is that the function A (a) in (6) he a function of x 1 , • • • , x n only . The measure is 
then defined up to a constant factor and is given by 

(8) J = [ [» l ...« n ]. 

* D 

The condition for the existence of a measure can also be expressed in terms of 
the constants of structure. Let d and 8 be two operations such that d denotes a 
displacement in the set of elements in E and 8 a displacement on an integral 
variety of (4). The condition for the existence of measure can be written as 

5[co J • • • w n ] = 0. 

But we have 

«*($) = ••• = a; n (5) = 0. 

The equations of structure (3) then give 

8o)\d) = 2 23 22 c)kO) 3 {8)u> k (d), i =? 1, • • • , n. 

fc-l j-n+l 

From the last equations we get 

8[ co 1 ... . «"] « 2 E Z c)ia> j (8)[ co 1 • • • a> n ]. 

3 — n+1 i—1 

Therefore a necessary and sufficient condition for the elements defined by (4) to 
possess a measure is 

n 

(9) 2 c )i = 0, j = n + 1, • • • , r. 

t-1 

2. Definition of the Incidence of Elements of Different Fields 

Consider two fields of geometrical elements, defined respectively by (4) and 

(10) w 1 = 0, • * * , w m — 0, 

where w % (i = 1, • • • , m) are linear combinations of the relative components 
with constant coefficients. Each of the systems (4) and (10) being completely 
integrable, the same must be true of the combined system 

(11) « l - 0, = 0, w 1 - 0, • • • , w m = 0. 



INTEGRAL GEOMETRY IN KLEIN SPACES 


181 


Of the last system suppose a of them (a > m, a > n) be linearly independent, so 
that there exist m + n — a relations of the form 


( 12 ) 


X b\iw + = 0, X = 1, • • •, m + n — a, 

t-i j-i 


where b\i , Cx* are constants. The integral varieties of (11) are of dimension 
r — s. Denoting the integral varieties of (4), (10), (11) respectively by V r - n , 
Vr-m , V r -&, we see that a V r ~, lies completely on a F r _ n (or V r -m) if and only if it 
has a point in common with V r - n (or F r _ m ). It follows that a F r _ n and a F r _ m have 
a V r -s in common if and only if they have a point in common . 

By virtue of the above properties we define an element N of (4) and an dement 
M of (10) to be incident when their corresponding integral varieties V r - n , V r - m 
have a F r _, in common. 

Consider the elements M incident with a given element N. In the group 
space G r each of the integral varieties F r _ m corresponding to M cuts F r - n in a 
V r -s . The totality of V T ~s on a fixed F r _ n depends on s - n parameters. 
Therefore the elements of the field (10) incident with a given element N of (4) depend 
on s — n parameters . The same property remains naturally true when the two 
fields are interchanged. 

It is important to notice that this definition of incidence includes the ordinary 
notions of incidence in Euclidean geometry, affine geometry, projective geometry, 
etc. as particular cases. Take, for instance, the group of motions in the Eu¬ 
clidean plane 


(13) 


x* = x cos c — y sin c + a, 
y* = x sin c + y cos c + b, 


where a, f>, c are parameters. 


(14) 


L 1 


The relative components are 
= cos eda + sin cdb , 

= —sin eda + cos cdb , 


= dc. 


As the equations of points and straight lines we may take respectively 


(15) 

w 1 = 0, 

II 

o 

and 



(16) 

a? 1 = 0, 

o 

II 

CO 

3 


In the three-dimensional group space (a, 6, c ) the elements of the field (15) are 
represented by lines parallel to the c-axis and those of (16), by the lines 


cos c«a + sin c>b = const., c = const. 


By interpreting a, 6 as the Cartesian coordinates in the plane and the equation 

cos c*a + sin c*6 = const. 



182 


SHIING-SHEN CHEEK 


as the equation of line in Hesse’s normal form, we see immediately that our 
definition of incidence coincides with the notion of incidence in the ordinary sense. 

Before concluding this section, we want to remark that, instead of using the 
integral varieties in the group space to characterize the elements of a field, we 
may employ the geometrically more intuitive idea of families of frames (in 
French “repfere”). 5 6 But it is sufficient for our purpose to restrict ourselves to 
one point of view. 

3. The Generalized Crofton’s Formula and Cauchy’s Formula 

Let us first consider the geometry in the Euclidean plane. It is well known 
that the measures for points and lines exist and that they are given respectively 



Crofton’s formula in its simplest form asserts that the measure of the lines 
incident with the points of a curve is equal to a constant multiple of the length 
of the curve, each line being counted as many times as the number of its points of 
intersection with the curve. 

To generalize this formula, we must first have the generalized notion of the 
length of a curve for a p-dimensional variety of points under an arbitrary group 
of transformations of Lie. For this purpose, take a p-dimensional variety V p 
formed by the elements of the field (4). Let u l y • • • , u p be the parameters on 
V p . Then co 1 , • • • , co n are linear combinations of du\ • • • , du v on V v . On 
eliminating du l , • • • , du p , we may express <o p+1 , • • • , o> n as linear combinations 
of w 1 , • • • , c/ in the form 

»“ = 53 £* a = p + 1, n, 

k -1 

where £* are functions of the parameters a\ • • • , a of the group and of u 1 , • • • , 
u p . By the method of moving frames of Cartan, it is in general possible (by 
supposing the variety V p to be sufficiently general) to determine some or all of 
the ££ to be constants or functions of the parameters w 1 , • • ■ , u p such that the 
determination is invariant under transformations of the group G r . If, up to a 
certain step in the determination of £* , the exterior differential forms of degree p 

[a / 1 • • • a/ p ], 

where i\ , • • • , i v run over all combinations of 1, • • • , n, depend on u 1 , • • • , u p 
only and differ from each other only by constant factors, we say that the variety 
V p possesses a p-dimensional area , which is equal to the integral of any one of the 

5 Cf. Cartan, Thtorie des groupes , Chap. V, or H. Weyl, The Classical Groups , Princeton 
1939, pp. 16-17. 

6 Blaschke, I. G. I, pp. 5-7. 



INTEGRAL GEOMETRY IN KLEIN SPACES 


183 


differential forms and is thus determined up to a constant factor. It is of course 
possible that the area of V p does not exist, as in the case that V p is a quadric in 
projective space. But in the cases which usually occur to us, the p-dimensional 
area of V p under G r exists. 

In the case p = 1 the p-dimensional area so defined leads to the Pick's invariant 
of a curve, 7 which includes the affine arc, projective arc, etc. as particular cases. 
Pick has proved that if a group of transformations of order r in the xy-plane trans¬ 
forms transitively the elements of contact of order r — 2 


z, y, y = 


dx ’ 


y dx - 2 ’ 


then a plane curve possesses an intrinsic parameter invariant under the group 
considered . This intrinsic parameter is called the invariant of Pick. Its exist¬ 
ence is an easy consequence of our preceding discussions. In fact, it is only 
necessary to take x, y, y', • • • , y {r ~ 2) to be parameters of the group and 

u = A\x, y, • • • , y (r ~ 2) ) dx + B\x, y, , y {r ~ 2) ) dy = 0, 

u = A 2 (x, y, • • • , y^ r ~ 2) ) dx + B 2 (x, y, ••• , y {r ~~ 2) ) dy = 0 

to be the equations of points. Along a curve we have dy = y'dx, so that u 

is a constant multiple of co 1 , and the integral 

/“ 


gives the intrinsic parameter of the curve. 

Another important particular case of the notion of p-dimensional area is the 
case that p = s — m and that V v consists of all the elements of the field (4) 
incident with a fixed element of the field (10). In this case, the relations between 
to 1 , • • • , o) n are obtained from (12) by setting w 1 = •. • = w m = 0 and are there¬ 
fore 

n 

2 = 0, X = 1, • • • , m + n — s. 

t-i 

Since 6x* are constants, we may take as the p-dipiensional area of this V p the 
integral 

J [w 1 • • • w p ]. 

If the value of this integral over V p is finite, we call it the measure of the elements 
of (4) about a fixed element of (10). 

With these preparations we can give the generalized Crofton’s formula in the 
following theorem: 


7 Cf., for example, G. Kowalewski, Allgemeine Naturliche Geometrie und Liesche Trans - 
formationsgruppen f Berlin 1931, pp. 106-110. 



184 


SHIINGhSHEN CHERN 


Let two fields of elements M, N be defined respectively by the equations (10), (4). 
Let p = m + n — s and let V p be a variety of p dimensions formed by the elements 
of M. If the measure of the elements of N incident with the elements of V p and 
the p-dimen$ional area F of V v both exist and are finite , then 

(17) J [co 1 • •. co n ] = cF, c = constant, 

where the integral is extended over the elements of N incident with the elements of 
V p , each element being counted as many times as the number of elements of V p with 
which it is incident . 

To prove this theorem, consider the p = m + n — s relations (12). Since 
the relations are independent with respect to co 1 , • • • , co n , we may solve them in 
terms of co 1 , • * • , co p , obtaining 

co x = X) X = 1, • • • , p, 

j-p+i 

where e) , ft are constants. Setting 

-x x V" x j 
w = co — / 4 e-; co . 

j-p+1 

and denoting w x again by we have 

(18) <o x '=Z/£«A X 

jfc-l 

The proof of our theorem then depends on a new form of the expression 
[co 1 • • • co n ]. We apply first the method of moving frames of Cartan to V p . 
Since by hypothesis the p-dimensional area of V p exists we can attach to each 
element of V p a frame such that within the p-parameter family of frames so 
attached the exterior differential forms 

[w %i • • • w' p ] 

where i \, • • • , i p run over all combinations of 1, • • • , m, differ from each other 
only by constant factors. In the group space G r V p is represented by a p-param¬ 
eter family of left-handed cosets with respect to a subgroup h. To the frames 
attached to the elements of V p there corresponds in G r a p-dimensional variety 
W p such that on every coset corresponding to an element of V p there lies one 
and only one point of W p . Now every integral variety Sh of (10) is cut by the 
integral varieties of (4) incident to it in the (r — $)-dimensional integral varieties 
V r -9 of (11). The totality of F r -« on Sh depends on s — m parameters. To 
define a system of coordinates for the V r ~» on Sh we may first set up a system of 
coordinates X 1 , • • • , X*~ m for the F r _ f on h . By the transformation S a —► SS a 
(S being fixed) the variety h is carried to Sh and we define the coordinates of a 
Vr- 9 on h to be the coordinates of its image on Sh. 

Let u l , • • • , u v be the parameters on V p . As the coordinates of an element 


X = 1 , • • • , m + n — s, 


= 1, . • - , m + n — s. 



INTEGRAL GEOMETRY IN KLEIN SPACES 


185 


of the field N incident with an element of V p we may take u 1 , • • • , u p , X 1 , • • • , 
X*~ m . Now the expression [w 1 • • • o> n ] depends only on the elements of the field N 
under consideration. This property, called by Blaschke the property of in¬ 
variance of choice (“Wahlinvarianz”), may be interpreted as follows: Choose on 
each integral variety of (4) a point and compute the relative components u 1 , • • • , 
w n on the n-dimensional variety so obtained. The resulting expression is 
independent of the choice of these points. 

This being clear, we may chose the points on the integral varieties of (4) to be 
on the F r _ g of the integral varieties corresponding to the elements of V p . Its 
totality forms a variety of p + $ — ra = n dimensions, which we call V n . To 
find the relative components on V n notice that the relative components co l (a, da) 
(i = 1, • • • , r) are the parameters of the infinitesimal transformation Sa l S a +da 
and that by choosing the infinitesimal transformations Xtf (i = 1, * • • ,r) such 
that X m +i f, • • • , X r f generate the subgroup h leaving invariant a fixed element 0 
of the field M, the infinitesimal transformation Sa'Sa+da carrying 0 to a neigh¬ 
boring element P is of the form 

w l Xif + • • • + w m X m J + c c m ^ l X m ^f + • • • + c SX r f, 

where w\ • •. , w m are exactly the left-hand members of the system (10). 

Let 0 1 , • • • , 6 m denote the relative components w 1 y • • • , w m on W p , • • • , 

lo 1 the relative components co p+1 , • • • , to” on Sh, and let w 1 , • • • , w m , co p+1 , • • • , « w 
denote these components on V n . As the independent components on V n we 
may then take p independent forms among the 0 1 , • • • , 0 m , and </ +1 , • • • , w n , the 
latter being Pfaffian forms in X 1 , • • • , X* _m only. Now every point of V n is of 
the form 


S a = S b R, 

where S b and R belong respectively to W p and h. By writing 

S a +da = Sb+dbRR 1 € h , 


we get 

Sa l s^ da = Br l s?Stt+R-Br l &. 

Since RT'R 1 belongs to h , it produces no effect on 0. The relative components 
w 1 , • •. , w m are thus transformed according to the transformations of the adjoint 
group which correspond to transformations of h . As R transforms between 
themselves the infinitesimal transformations of h y we have 

a\$ l + • • • + ai l 0 m , 

aTd 1 + • • • + cCS m , 
where a*(i, k = 1, • • • , m) are functions of X 1 , • • • , X a-m . 





186 


SHIING-SHEN CHERN 


On the other hand, for the components co p+1 , • • • , w n on V n we must have 

( w m+n->.+1 _ ^m+n-,4-1 & «+n-.+l0l . . . _|_ 

. 

o> n = w n + bid 1 + • • • + b n m 6 m , 

since a )\i = p + 1, • • • , n) reduce to w‘ when u 1 , • • • , u p are constants. Thus 
we get 

[o p+1 . •. * n ] « [w p+1 • • • w n ], (mod 6\ ... , O. 


Finally, from (18) and (19), we find 

(21) [co 1 • • • a> n ] = /(X 1 , • • • , X'~ w )[u p+1 . • • 

where dQ denotes the element of the p-dimensional area of V p . 

It is easy to finish the proof of our theorem from the last formula. In fact, 
evaluating the integral 


j f(\\ ■■■ ,\ , ~ m )[u p+l ... <o"dQ] 


by first holding an element of V p fixed, we get a constant which is the same for 
all elements of V p . Integration over V p then gives the p-dimensional area of V p . 

From this proof we observe that even when the measure of the elements of N 
incident with the elements of V p becomes infinite, there still exists a generalized 
formula of Crofton. In fact, we may make the following restriction: On each 
variety of s — m dimensions V 8 ~m consisting of the elements of (4) incident with a 
fixed element of (10) take a fixed domain D such that the integral 

/ /(X 1 , • • • , X—HfiH - 1 ... «"]' 


is finite and constant for all elements of (10). Then the measure of the elements 
of (4) incident with the elements of V p and belonging to the domain D at each 
element of incidence is finite and will be given by Crofton’s formula (17). 

We now come to generalize the formula of Cauchy. Consider a surface S in 
Euclidean space and the planes E intersecting the surface. If 1i denotes .the 
density of the planes and U B the perimeter of the curve of intersection of the 
surface by the plane, then the formula of Cauchy is 8 

(22) J U„t! =^F, 

where F denotes the area of S and the integral in the left-hand side is extended 
over all planes intersecting S. 

To give a generalization of this formula take a variety F„ of p dimensions 
formed by the elements of the field M. As in the proof of Crofton’s formula (17) 


* Blaschke, I. G. II f p.73. 




INTEGRAL GEOMETRY IN KLEIN SPACES 


187 


let u 1 , • • • , u p be the parameters on V p and let us attach to each element of V p a 
frame. Almost all the arguments used above are valid, including the formulas 
(19), (20). The only difference is that the variety, formally denoted by V n , 
is now of p + s — m dimensions. This variety V will be cut by an integral 
variety F r _ n of (4) in a variety Vp+,_ m _ n of p + s — m — n dimensions (supposing 
p > m + n — $). Through each point of there passes an integral 

variety of (10). Its totality consists of all the elements of V p incident with 
V r -n . To find the independent components on Fp+.-m-n it is only necessary to 
set = 0 in (18). The independent Pfaffian forms in the equations so obtained 

m 

^^fiu) k = 0, X = 1, - • • , ra + n — s 

k— 1 

then give the independent components on F pf *_ m _ n . The generalized Cauchy's 
formula states that the integral over all elements of (4) incident with V p of a (p + 
s — m — n)-dimensional integral invariant of of the form 

(23) / IC irVl -[w* • • • t^*-—], 

where C tl ... < p+f _ m _ n are functions ofX 1 , •.. , \*~ m only , is equal to a constant multiple 
of the p-dimensional area of V p , provided that both quantities are finite . 

It is only necessary to prove the formula for the case when the sum (23) 
contains one term. Suppose it be 

f C[w 1 ■ ■ ■ w p+, - n ~ n ]. 

By making use of (18), (20), we get 

C[w l ... ... *,"] = F(\\ ... ,X i “ m )[fl 1 ••• ^ m+w ’ 3+1 ... « n ], 

where F is a function of X 1 , ... , \ 8 ~ m . The generalized Cauchy's formula as 
stated above is then an easy consequence of the relation obtained. 

It may be helpful to give an example of the above general discussions. Con¬ 
sider the three-dimensional Euclidean space with its group of motions. Instead 
of introducing the group space we may take a fixed right-hand rectangular tri¬ 
hedral To in space and take all such trihedrals T as the elements of the space. 
Then the motions in space and the trihedrals T are in one-to-one correspondence 
such that to T corresponds the motion carrying To to T. As the parameters 
of the group we may take the parameters of T . Let P be the origin of T and 
el, , ei the three unit vectors along the axes of T. Then the relative com¬ 

ponents of the group are defined by the equations 

dP = <a l e\ + co 2 €2 + , 

de% = tojej -f- <a\e% 4- w 8 es , 


(24) 


»} + « =0, (ij = 1,2,3). 



188 


SHIING-SHEN CHERN 


The equations of structure of the group are 

(«*)' = t tfcol), 

(25) 7 i, k - 1, 2, 3. 

U)' = £ u«a 
1-1 

As the coset in the group space corresponding to a point we take the family of 
trihedrals having the point as origin. As the coset corresponding to a straight 
line we take the trihedrals whose origin is on the line and whose vector e 3 is along 
the line. Finally, the coset corresponding to a plane is formed by the trihedrals 
with origin on the plane and with the third unit vector e 3 perpendicular to the 
plane. By these definitions, the equations of the points, lines, and planes are 
respectively 



f 1 
CO 

= 0 , 

CO 2 = 0 , 

< 0 3 = 0 ; 


(26) < 

1 

r 

= 0 , 

CO 2 = 0, 

C 03 = 0 , 

C 03 — 0 


1 

= 0 , 

cos = 0 , 

W 3 = 0 . 



For all these three fields of elements the measures exist, as may be verified by 
using the criterion (9). They are then given by the integrals 

(27) J [« l co 2 co 8 ], J [co l co 2 co 3 o 4 ], J [co 3 coaa>l]. 

Consider now the fields of points and lines and denote them by M , N respec¬ 
tively. Then 

~ m = 3, n = 4, s = 5. 

Put 

W 1 — co 1 , W 2 = co 2 , w' 3 = CO 8 . 

The relations (18) become 

co 1 = to 1 , co 2 = w 2 . 

To obtain the formula of Crofton in this case take a surface 2 and attach to each 
point of the surface a trihedral Pv\v 2 V 3 with origin at this point and with tJs normal 
to the surface. Choose the trihedral Pei e 2 e' z attached to a line through P such 
that 7i is on the plane PviVz . Then we have 

pi = — sin \f/ 7i — cos ^ cos <pet + cos ^ sin <p7 9 , 

(28) V 2 = cos yj/e 1 — sin ^ cos <p £2 + sin ^ sin <p e $, 

sin <p 7i + cos <p e$ 


vz = 



INTEGRAL GEOMETRY IN KLEIN SPACES 


189 


where ip, yfr are the parameters of the lines through P, <p being the angle between 
el and v 8 . From 

dP = 0 V 1 + 0 2 vt = w% + w 2 e >2 + w %, 

co 1 = — 0 1 sin \[/ + 0 2 cos \p, 
co 2 = — 0 l cos xp cos (p — d 2 sin ^ cos 

[coVcoJco 2 ] = COS ^[wsul0 1 0 2 ]. 

This is a particular case of the formula (21). The classical proof of Crofton's 
formula is based on this relation. 

Next, take the fields of points and planes and denote them by M, N respec¬ 
tively. Then 


we get 

(29) 

It follows that 

(30) 


Put 


m = 3, n ~ 3, s = 5. 


w 1 = co 1 , w 2 — co 2 , w* — co 3 . 
The set of relations (18) consists only of one equation 


As by (29), we get 

(31) w z = 0 1 cos ^ sin <p + 0 2 sin \f/ sin ip. 

Along the curve of intersection of the plane Peie 2 with X, we have 

w* = 0 


and 

dP = w l ei 

so that e\ is the tangent and w 1 the element of arc. Then we get 

(32) [wW cojco 2 ] = [wVcoJoj 2 ] = — sin y^VcoJcol]. 

From this relation we derive easily Cauchy's formula. 

Tsing Hua University 
Kunming, China 




CONSERVATION OF SCHOLARLY JOURNALS 

The American Library Association created this last year the Committee on 
Aid to Libraries in War Areas, headed by John R. Russell, the Librarian of the 
University of Rochester. The Committee is faced with numerous serious 
problems and hopes that American scholars and scientists will be of considerable 
aid in the solution of one of these problems. 

One of the most difficult tasks in library reconstruction after the first World 
War was that of completing foreign institutional sets of American scholarly, 
scientific, and technical periodicals. The attempt to avoid a duplication of that 
situation is now the concern of the Committee. 

Many sets of journals will be broken by the financial inability of the institu¬ 
tions to renew subscriptions. As far as possible they will be completed from a 
stock of periodicals being purchased by the Committee. Many more will have 
been broken through mail difficulties and loss of shipments, while still other sets 
will have disappeared in the destruction of libraries. The size of the eventual 
demand is impossible to estimate, but requests received by the Committee 
already give evidence that it will be enormous. 

With an imminent paper shortage attempts are being made to collect old 
periodicals for pulp. Fearing this possible reduction in the already limited 
supply of scholarly and scientific journals, the Committee hopes to enlist the 
cooperation of subscribers to this journal in preventing the sacrifice of this type 
of material to the pulp demand. It is scarcely necessary to mention the appre¬ 
ciation of foreign institutions and scholars for this activity. 

Questions concerning the project or concerning the value of particular periodi¬ 
cals to the project should be directed to Wayne M. Hartwell, Executive Assistant 
to the Committee on Aid to Libraries in War Areas, Rush Rhees Library, Uni¬ 
versity of Rochester, Rochester, New York. 




* 8. HU. »9«* 

Amkalb of Mathematics 
Vol. 43, No. 3, April, 1942 


THE CLOSURE OPERATORS OF A LATTICE 

„ By Morgan Ward 
(Received January 29, 1940) 

I. Introduction 

1. If 0 is a lattice of elements A , B, • * • , the class of all operators of 0 (that 
is, one-valued functions <f>X — <£(X) on 0 to 0) may be made into a lattice by 
defining the union 5 and cross-cut k of any set <£ of operators <f> by 1 

8 X = (••• 4 >X •••), kX = <t>X •••], 

The union and cross-cut here are taken over all the values <f>X of the operators 
in for any given X of 0. 

It is easily verified that the operators of 0 form a lattice in which <f> ^ if 

and only if for every X of 0; furthermore this lattice is closed, modular, 

or distributive according as 0 is closed, modular or distributive. 2 

The operator lattice of a lattice is a concept comparable in generality to the 
Boolean algebra of all subsets of a lattice. As in the algebra, it is certain 
distinguished sets of operators which are useful in investigating the given lat¬ 
tice rather than the operator lattice itself. 

One obviously important distinguished type is the linear operator. An 
operator <f> is said to be linear if for any subset 51 of elements A of 0, it has one 
or more of the four properties 

(11) (0 <*>(' * • A •••) = (••• <t>A • • •), (iii) <*>[••• A •••] = [... <t>A •], 

(ii) <*>(••• A 4>A • • •], (iv) <#>[••• A •••] = (••• <t>A ■■ •). 

Here the unions and cross-cuts are taken over all the elements of 51, and 51 is 
finite if 0 is not closed. Lattice homomorphisms and homomorphisms with 
respect to union with properties (i), (iii) and (i) respectively are familiar ex¬ 
amples. (Ore 1). 

The linear operators and certain associated* lattices are important in the 
study of residuated lattices (Ward-Dilworth 1) as I plan to show in detail 
elsewhere. 3 


1 If <0 is not closed, <i> is assumed to contain only a finite number of operators. A lattice 
is said to be closed (or “complete” or “continuous”) if it contains the union and cross-cut 
of any subset of elements in it. 

* Chain conditions in 0 do not usually carry over to the operator-lattice. 

* The product^ of two operators tf> and \f/ defined by = <£(^(X)) immediately gives 
us an associative multiplication over the operator lattice. On the other hand if B is any 
fixed element of a residuated lattice 0, the operators/* andp defined by/*X = BX, pX «* 
B:X have the linear properties /*(• • • A •••) * (• • • pA * • •)» p(* * * A • • •) *» [• • • pA • • •]• 

191 


Linlithgow Library 

imperial Agricultural fica>"‘’ch Institute, 



192 


MORGAN WARD 


I have discussed elsewhere (Ward 2) a type of operator associated with a 
point lattice, 4 5 which may be used to classify all such lattices of finite order. 

2 . I develop here the properties of a type of operator which is of fundamental 
importance in the study of certain imbedding problems of ring theory and 
semi-group theory. 6 7 A typical problem of this class is to imbed a system I of 
elements over which a commutative and associative multiplication is defined in 
a residuated lattice © so as to preserve the multiplication in I and thus to 
study the arithmetical properties of I. (Clifford 2, Ward-Dilworth 2). The 
imbedding is effected by defining a suitable type of “ideal” (distinguished sub¬ 
set) of I in the Boolean algebra of its subsets. 6 A closely related problem is to 
imbed a semi-ordered set in a closed lattice. (Mac Neille 1). 

The “closure operators” introduced here enable us to view all these problems 
from a unified standpoint, and explain why in all extant theories of ideals as 
distinguished subsets, the cross-cut of two ideals is the set-theoretic cross-cut 
of their elements. f 


II. Closure Operators 

3. Let © be a closed lattice. An operator 0 of © is said to be a closure operator 
if it satisfies the following three conditions:' 

11. A 3 B implies that <f>A Z> <j)B. 

12. 0 => i. 

13 . 0 2 — 0 . 

Here i is the identity operator leaving every element of © unchanged. 

If X is any set of elements T of @, it may be proved that every closure operator 
0 has the quasi-linear properties 

(3.1) 0h..0T-..] = [...0T ...], 

(3.2) 0(— r • • •) = 0(* • • 0T • • •)> TeX. 

No actual linearity is assumed. 

Theorem 3.1. The cross-cut 8 of any set of closure operators is again a closure 
operator . 


4 A lattice is called a point lattice if every element in it save the null element is a union 
of points. Here a point is any element covering the null element. Point lattices include 
important types of projective geometries, exchange lattices, and Boolean algebras. 

5 For a discussion of these problems, the reader is referred to Clifford 1,2 where references 
are given to the work of Priifer and others. 

6 Several definitions are usually possible. See Ward-Dilworth 2. 

7 These axioms are satisfied by Kuratowski’s closure operator over a Boolean algebra 
with points. (Kuratowski 1). But they are essentially weaker, as Kuratowski's operator 
is linear with respect to union. Compare also Birkhoff 1. 

• In general, no closure properties hold for the union and product of (closure) operators. 
It may be shown that if <f> and0 are operators, then (0,0) is an operator if and only if (0, 0) ■■ 
00 * 00. Commutativity is thus a necessary condition for the union (0,0) to be an operator. 
It is evidently, a sufficient condition for the product 00 to be an operator. 



CLOSURE OPERATORS OF A LATTICE 


193 


Proof. Let $ be a set of closure operators 0, and let k = [• • • 0 • • • ] be their 
cross-cut. We shall show that k satisfies 11, I 2, I 3. 

I 1 is satisfied . For A 3 £ implies 0A 3 <f>B for every 0 c $. Hence 
[• • • 0A • • •] 3 [• • • <(>B • • •], kA 3 kB. I 2 is satisfied. For since ^34 
for every 0 €<f>, [• ■ • 0A • • •] 3 A or * 3 i. 13 is satisfied. For K 2 A = 
[• • • 4>kA • • •], <t> €<i>. Now 0 3 ac. Hence 0A 3 acA, 0 2 A 3 0 acA, 0A 3 0 acA. 
Accordingly k 3 ac 2 . By I 1 and I 2 , k 3 k. Hence k = ac, completing the proof. 

4. Let 0 be a given closure operator, and let ©' = 0 © be the set of all its 
values X' = 0X in @. By formula (3.1) any subset X' of the X f is closed under 
cross-cut. We may express this fact by writing 

(4.1) [••• r .-■]*, = [••• r •••]«, r v c©'. 

If 7 is the unit element of ©, then V = 7 divides all elements A' of ©'. Hence 
for any subset 8' of elements 7/ of S', the class of all 7C' such that K‘ 3 7/ is 
non-empty. We define the union of the L ' to be the cross-cut of the K’\ 

(4.2) (...7/...)®' = [•••*' •••]©', 7/3 7:' every 7/ of 8'. 

We obtain by a familiar argument: 

Theorem 4.1. The set ©' o/ values of a given closure operator forms a closed 
lattice within © with respect to the. operations of union and cross-cut defined by 
(4.2) and (4.1). 

To each closure operator 0 we may accordingly assign a lattice ©' = 0©. In 
particular, © = t©. We shall establish a converse result. 

Let ©' now denote a fixed subset of © closed under cross-cut and containing 
the unit element 7. We make ©' into a lattice within © by assigning to any 
subset 8' of elements of ©' as in (4.2) a union defined as the cross-cut of the 
set of all multiples of the elements of 8'. 

We next define an operator 0 on © to ©' as follows: If A is any element of 
©, then 0A is the cross-cut of all elements B ' of ©' such that B' 3 A. Then 
0 is a closure operator, for II, 12, 13 are evidently satisfied. Furthermore, 
0 @ = ©'. 

We have thus established a one-to-one correspondence between the closure 
operators of © and subsets of © closed under cross-cut and containing I. The 
lattice ©' = 0© and the operator 0 will be said to belong to one another. 

It is also easily proved from formula (3.2) that ©' is a sublattice of © if and 
only if the closure operator belonging to ©' is linear with respect to union. 

Theorem 4.2. The closure operators of any closed lattice themselves form a 
lattice within the operator lattice of ©. 

Proof. Let S denote the set of all closure operators of ©. By formula 
(3.1), the cross-cut of any set of such operators is again a closure operator. 
Furthermore the operator co defined by cvA = 7, every A of @, is obviously a 
closure operator dividing every other closure operator. Hence we may define 
the union of any set $ of such operators as the cross-cut of the non-empty set 
of closure operators containing every operator of <£. 



194 


MORGAN WARD 


We may evidently define lattice operations on the set of all subsets ©' of © 
closed under cross-cut and containing 1 by the rules 

[• • * @ r •••] = [ • • • 0 • • • ]©, <f> €<$>, = 0© Cl$@ 

(4.3) 

(•••©'•••) = (...0 ...)©. 

The lattices ©' thus form a lattice simply isomorphic with the lattice 2 of 
closure operators. We shall return to these operations at the close of the next 
section. 

6 . Consider an operator 0 belonging to a set consisting of two elements I 
and T of @. It follows from the previous theorems that 0 is characterized by 

(5.1) <t>A = I if T A, <j>A = T if T ID A y A any element of ©. 

We call 0 the two-valued operator belonging to T . Since 0© is a sub-lattice 
of ©, 0 is linear with respect to union, as is directly evident from (5.1). It is 
easy to prove 

Theorem 5.1. The ideal operator 0 belonging to any set ©' of elements of © 
which is closed under cross-cut and contains I is the operator cross-cut of all the 
two-valued operators belonging to elements of ©'. 

If X is any set of elements T of © containing /, we obtain a lattice ©' within 
@ containing X by adjoining to X the cross-cuts of all sets of its elements. ©' 
is evidently the smallest such lattice containing X. We call ©' the imbedding 
lattice of X y and its corresponding closure operator the i{ ‘imbedding operator” 
of X. We shall use the letter 0 to denote an imbedding operator. 

Theorem 5.2. If 9 is the imbedding operator of a set X of elements of © con¬ 
taining 7, then the value of 6 for any element A of © is given by the formula 

(5.2) OA = [••• T •••], TIDA y T eX. 

Proof. We have BA = S' where S' lies in ©' = 0®. Hence S' is the cross¬ 
cut of a certain set of the T in X. Now since 6A ID A y every such T divides A. 
But since 9T = T if T e X y T ID A implies that T ID 9A = S' . Hence (5.2) 
follows. 

Theorem 5.3. Let 0 and 0 be any two closure operators of ©. Then 0 Z> 0 
if and only if the lattice belonging to 0 contains the lattice belonging to 0 in the set- 
theoretic sense . 

Proof. Assume that 03^ and let A e 0@. Then <j>A = A. By I 1 y <t>A ID 
0A. Hence AID^A. Therefore by I 2, A = 0A or A e 0©. Since 0 and 0 are 
the imbedding operators of their respective lattices, the converse follows from 
Theorem 5.2. 

The following corollaries are immediate: 

Corollary 5.31. Let 9 be the imbedding operator belonging to any set X of 
elements of © containing 7, and let 0 be any closure operator such that 0 leaves 
every element of X invariant. Then 0 divides 6. 



CLOSURE OPERATORS OF A LATTICE 


195 


Corollary 5.32. The imbedding operator of any set is the union of all closure 
operators which leave every element of the set invariant. 

Corollary 5.33. The union operation on the lattices which belong to closure 
operators defined by (4.3) is the operation of taking the setrtheoretic cross-cut of 
their elements . 

It is this correspondence between operator union and set-theoretic cross-cut 
which makes the ideal operators of importance in imbedding problems. 

III. Applications to Imbedding Problems 

6 . Let 7 be a set of elements a, b, • • • semi-ordered with respect to a division 
relation x | y and containing a unit element i dividing every other element. 
The following problem has been considered by Mac Neille: (Mac Neille 1). To 
construct a closed lattice ©' such that: (i) ©' contains a subset of elements 
• • • which may be set in a one-to-one correspondence x<->X' with a, b, • • •; 
(ii) If a A' and b <-> £', then 

(6.1) a | b in I implies A ' ID B ' in 

(6.2) A' D> B' in ©' implies a | b in 7. 

We call such a construction an “isomorphic imbedding” of the set 7. If we 
do not require (6.2), we speak of a “homomorphic imbedding” of 7. 

We shall solve these problems by determining suitable ideal operators in the 
lattice 3) (Boolean algebra) of all subsets of 7. In other words, we shall de¬ 
termine all ideal operators 0 of 39 such that ©' = 0© will be a suitable lattice. 

Consider first the condition (6.1). Let T = (t) be a subset ctf 7 consisting of 
the single element t. Since T f = <t>T ID T, we must have t e <t>T. But by (6.1), 
if 1 1 y in 7, <t>T ID <t>(y)- Hence if 1 1 y, y e<t>T. Thus (6.1) implies that <t>T 
must contain all elements y of I such that t | y. 

For a homomorphic imbedding, no further conditions are imposed on the 
values of <t>T . But if the imbedding is isomorphic and <f>T D0(I), then (6.2) 
requires that 1 1 x. Hence <t>T must consist only of elements x of I such that 1 1 x . 

We let X denote the set of all T f = 0(2), t e I for any ideal operator 0. We 
call the elements of X the principal ideals of 7. 

It is evident from the preceding section that any ideal operator of 93 leaving 
every element of X invariant will solve our initial imbedding problem, and that 
the simplest of these operators is the imbedding operator of the set X itself; 
for its lattice 093 is the smallest lattice in the set-theoretic sense in which the 
imbedding can be made in 93. The isomorphism between 7 and X with respect 
to division shows that this same minimal property of 093 will apply to any iso¬ 
morphic imbedding of 7 in any closed lattice whatever; within the lattice 
©' there must lie a lattice simply isomorphic to 093. 093 is the lattice defined in 

Mac Neille 1 by “Dedekind cuts.” 

A similar situation occurs for homomorphic imbeddings. For a homomorphic 
imbedding, the “principal ideals” A', B', ••• which make up the set X are not 



196 


MORGAN WARD 


uniquely determined by the corresponding elements a, b, • • • of 7; for if a «-► A', 
A f may contain elements of 7 not divisible by a. But once the set % of principal 
ideals is chosen, the imbedding operator of 7 gives the smallest lattice in which 
the particular homomorphic imbedding can be performed. 


7. If A is any subset of 7, let A be the subset of all elements l such that 
1 1 k for every k in A, and let A' be the subset of all elements a such that 1 1 a 
for every l in A. Then the operator 

(7.1) A' = BA 

is the isomorphic imbedding operator of the set 7 discussed above. This result 
follows easily from Theorem 5.3. For a detailed discussion, the reader may 
consult Ward-Dilworth 2 or Clifford 1, to whom this definition of 6 is originally 
due. 9 

California Institute of Technology. 


References 


Garrett Birkhoff 
A. H. Clifford 

C. Kuratowski 
H. M. Mac Neille . 

O. Ore 

M. Ward and R. P. Dilworth 
M. Ward 


1 Duke Math, Journal, 3 (1931) pp. 443-454. 

1 Bull. Am. Math. Soc. 40 (1934) pp. 326-330. 

2 These Annals (2) 39 (1938) pp. 594-610. 

1 Topologie 1, Warsaw (1933). 

1 Trans. Am. Math. Soc. 42 (1937) pp. 416-460. 

1 These Annals, (2) 36 (1935) pp. 406-137. 

1 Trans. Am. Math. Soc. vol. 45 (1939), pp. 335-354. 

2 Unpublished. 


* The identity of this operator and Mac Neille’s operator was pointed out to me by Dr. 
A. H. Clifford in a letter. The definition (7.1) is used in.Ward-Dilworth 2 to imbed any 
ovum (semi-group) in a residuated lattice of ideals. 



Annals or Mathematics 
Vol. 48, No. 2, April, 1042 


UNSTABLE MINIMAL SURFACES WITH SEVERAL BOUNDARIES 1 

By Max Shiffman 

(Received August 21, 1940; revised October 1, 1941) 


Table of Contents 

Introduction. 197 

Part I. Degenerate Domains. The Spaces 9? fc and 
The Functional U[jc] 

1. The Domains of Representation. 198 

2. A Metric for Sfo. 201 

3. The Metric Space 9?*. 204 

4. The Connectivity Numbers of 9J*. 206 

5. The Space $. 207 

6. The Continuity of the Functional #[j]. 208 

Part II. Application to Unstable Minimal Surfaces 

7. Linear Paths of Surfaces. 211 

8. The Reducibility Condition. 213 

9. The Case of Polygonal Boundaries. 215 

10. The Variational Condition. 217 

11. The Main Theorems. Remarks. 221 

Bibliography . 222 


Introduction 

In recent years, the question of unstable minimal surfaces bounded by a 
single contour has been attacked with success. 2 3 * It has been shown that the 
Morse critical point theory applies to minimal surfaces bounded by the contour T 
(provided T satisfies certain restrictions). The contributions to the theory 
have been made by the author [11], [12], by Morse and Tompkins [7], [8] and 
by Courant [3]. 

In the present paper we shall show how to extend the theory to cover minimal 
surfaces of genus zero bounded by an arbitrary number k of non-intersecting 
contours Ti, r 2 , • • • , T* . 8 In a general but not precise way, the main result 
may be stated thus: if the Morse theory can be shown to apply to each of the 
contours Ti, • • • , T* individually, it applies to all the contours I\ , • • • , T* 
together. 

1 Presented to the American Mathematical Society, April 26, 1940. 

* Concerning the Plateau problem and minimal surfaces in general, see [1], [2], [4], [5], 
[10]. Numbers in brackets refer to the bibliography at the end. 

3 In the meantime a paper by Morse and Tompkins [9] has appeared which considers the 

case of two boundaries. 


197 
















198 


MAX SHIFTMAN 


A first step in the development of the theory is the following (see [12], [7]). 
Let $ denote the space of admissible potential surfaces t(u> v ) with a finite 

Dirichlet integral D [?], where D[ f] — i J J (ti + {l) du dv; and let ty# denote 

the subspace of for which D[x] g N. The first step, used even in the mini¬ 
mum theory, is that be compact. 

For the case of several boundaries, this requirement is not satisfied for ordinary 
potential surfaces. To retain the compactness of it is necessary to introduce 
degenerate potential surfaces, consisting of several pieces, and their corre¬ 
sponding degenerate parameter domains. These degenerate domains and sur¬ 
faces have been so selected that the discontinuities of the Dirichlet integral can 
be reduced to its discontinuities for single boundaries. More precisely, suppose 
that x is an admissible potential surface defined over a domain G having k 
boundaries. Let be the region of the plane bounded by the /x th boundary 
curve and intersecting (?, and the potential surface defined over G> with 
boundary values identical to £ on (?„ . Set 

E[ f] = D a [ f] - E 

Then the degenerate domains and surfaces introduced here are such that E[x] 
is a continuous functional. 

These two theorems, the compactness of ^ and the continuity of 2J[f], form 
the backbone of this paper. In Part II, the continuity of Z?[£] yields very 
simply the reducibility property of D[j] provided that reducibility is known 
for simple contours. 

For two boundaries, the theory can be developed .much more simply. But 
the decisive step is the case k > 2. 

Part I. Degenerate Domains. The Spaces and 
The Functional E[%] 

1. The Domains of Representation 

Let Ti, r 2 , • • • , T* be k non-intersecting closed Jordan curves in space. On 
each r M select three distinct points . We shall consider potential 

surfaces v), defined over certain normal domains of the w = u + tv plane, 
which map the boundaries of these domains continuously and monotonically 
into Ti, r 2 , • • • , T* . 

Choose, for the ordinary (i.e., non-degenerate) domains of representation in 
the w-plane, circular regions G consisting of an annular ring, unit circle as outer 
boundary, with k — 2 additional circular holes. Let C M denote the circular 
boundary of G mapped by x(u, v ) into r M , and let p», q?, be three distinct 
points of C M mapped by f(w, v) into P M , , i2 M respectively. These specified 

points Pm , Qfi , r M , m = 1,2, • • • , k, will be considered as part of the domain of 
representation G. In considering the class of admissible potential surfaces 



UNSTABLE MINIMAL SURFACES 


199 


$(w, v), not only do the boundary values of z(u, v ) vary but also the domains of 
representation (including the points p M , , r M ). 

It is possible to normalize G , by performing linear transformations and reflec¬ 
tions, in such a way that the following conditions are satisfied: 

(a) Ci is the unit circle; 

(b) C 2 is concentric with the unit circle; 

( c ) Pi) Qi 9 r i K e m counterclockwise order around Ci ; and 

(d) for k = 2, pi is at the point w = 1 of the w-plane, while for k > 2 the 
center of C 8 is on the positive real axis. 

We shall suppose throughout that G has been normalized in this way. 

It is necessary to define the limit of a set of domains, and for compactness to 
introduce degenerate domains. Our procedure is motivated by considering a 
sequence £ n (w, v) of admissible potential surfaces with uniformly bounded 
Dirichlet functional L% n ] g N . Let G n be the ordinary domains over which 
the y n are defined. Then obtain a limit domain G°° and a limit potential surface 
f 3 such that all pieces which contribute a non-vanishing term to the Dirichlet 
integral are retained. 

The only possibilities for the degeneracy of the sequence £ n are: 4 the circles 
of G n degenerate as n —> 00 ; and the boundary values of are not equicon- 
tinuous. Now, the non-equicontinuity of the boundary values of £ w on C M is 
equivalent to the assertion that the minimum distance between the three points 
pi , q% , r^ has zero as a limiting value as n —> 00 . Since p„ , , r M are con¬ 
sidered as part of the domain G , all degeneracies have been transferred to the 
domain. This limits the discussion of the sequence £ n , so far as degeneration 
is concerned, to that of the domains G n . 

Accordingly, let G n be a sequence of ordinary domains, and (by choosing a 
subsequence if necessary) suppose that p£ , q£ , r * converge to definite points 
of the w-plane as n 00 . The sequence has no limit domain if any pair of 
circles with radii above a positive bound approach each other as n —> 00 (this 
would contradict D[j: n ] S N ), and we suppose that this is not the case. The 
G ny s are said to tend to degeneracy if the 3 k limit points of p* , q£ , r” as n —► 00 
are not all distinct. 

If the domains G n do not tend to degeneracy as n —> 00 , the limit domain G 00 
is immediate: G 00 is the circular domain whose points p * , q* , r™ and circles C 00 
are the limits of those of G n . 

If the sequence G n does tend to degeneracy, one or several of the three fol¬ 
lowing types of degeneration must occur (for convenience a circle of G n with 
radius approaching 0 as n —> <» is called a small circle; one whose radius does 
not approach 0, a large circle): (a) One or more small circles tend to a point A 
away from all other circles; (b) One or more small circles and a large circle C? 
approach a point A , while at most one of the points p” , q? , r? on C J? approach 
A as n —> 00 ; (c) At least two of the points p* , q? , r? on a large circle C? 

1 For all the properties of the sequence y n used in the next two pages, see [1]. 



200 


MAX SHIFFMAN 


approach a point A. In cases (b) and (c) let A n be a point on the large circle 
Cp approaching A ; for (a), let A n be A. 

In any of these three situations, describe in G n a circle or arc of circle K n 
with A n as center and with the radius (p n )*, where p n is the maximum distance 
from A n to the small circles and points p? , q, , r? which may be approaching A. 
(It can be shown that the oscillation of x n on K n tends to 0.) K n splits G n 
into two regions G* and (?? , large and small respectively. Also, in cases (b) 
and (c), K n divides C? into two arcs a* and , where a, contains at least two 
of the points p? , q? , r? and 0? at most one of these points. The arc a? is to 
be counted in the limit as the circle C , ; and p? in the limit as coordinated to a 
single point of C v (since the oscillation of % n on p? approaches 0 as n —► oo). 
Separate G\ and GZ and normalize them both by performing linear transforma¬ 
tions and inversions. In this normalization, disregard the K n ; temporarily, 
G\ and GZ are considered as bounded by circles. 

Continue as above with the regions GZ and GZ separately, but with the 
following provisos: K n is to be disregarded, and /3? is to be disregarded if it 
approaches a point away from all other circles. One finally obtains the decom¬ 
position of G n (a subsequence of the original G n ’ s) into several regions HZ , 
HZ , • • • , HZ each of which degenerates no further. A passage to the limit 
yields a degenerate domain G°° consisting of several distinct circular regions 
Hi , Hi , • • • , Hi ; G°° is called the limit of this final subsequence G n . (A subse¬ 
quence of the f can then be chosen to converge uniformly to a limit potential 
surface £°° defined over the various regions which compose G°°.) 

We are now in a position to completely define degenerate domains. A 
degenerate domain G consists of several distinct circular regions, each supposed 
normalized. In all the regions together there are exactly k circles of type , 
fx = 1, 2, • • ■ , k, called boundary circles, and on each C M three distinct points 
Pm > Qn > J there maty be additional circles, denoted by , • • • and called 

point circles, which are coordinated to single points on likewise marked 

, cl , • • • . (The limit potential surface y°° would take constant values on 
each point circle equal to its value at the corresponding point c M on .) The 
domain is of genus zero, i.e., if the point circles c M ,<£,•• • are attached to the 
corresponding points c M , , • • • , then every closed curve disconnects the re¬ 

sulting surface. Finally there is no region with a single boundary and that 
boundary a point circle c M (in such a region %(u, v ) would be identically constant). 

A limit of a sequence of degenerate domains is obtained as previously, con¬ 
sidering each region separately and taking the circles and points c M , c M , • • • 
into account. 

Finally, a domain G is a limit of a set X of domains, degenerate or ordinary, 
if X contains a sequence having G as a limit in the above sense. The totality 
of domains, ordinary and degenerate, together with this definition of limit will 
be denoted by 9?* . SR* seems to be the natural totality of domains of represen¬ 
tation for potential functions. 

Figures 1 and 2 illustrate the various kinds of domains with two boundaries. 



UNSTABLE MINIMAL SURFACES 


201 


The degenerate domain in fig. 2(a) is the limit of a sequence of ordinary domains 
(fig. 1 ) whose boundary values are not equicontinuous on C 2 ; fig. 2 (b) is the 
limit of a sequence of ordinary domains whose boundary values are not equi¬ 
continuous on Ci ; fig. 2 (c), whose boundary values not equicontinuous on both 
Ci and C 2 ; and fig. 2 (d), for which the radius of C 2 approaches zero. For more 
than two boundaries the degenerate domains become much more numerous and 
complicated. 

It is necessary to know that 3i* is a topologic space, and more, that it is metric. 
This will be established by imbedding into an Euclidean space of sufficiently 
high dimension. We begin with the case of two boundaries. 


ae ia = («, pi, qi, ri) 
K be i3 = (o, ps, q^, r 2 ) 

r l pe iip = («, o, pi, pj 


Figure 1. Ordinary domains in $tt 2 

2. A Metric for $ 2 

Consider first the ordinary domains in 9? 2 (the case of two boundaries) as in 
fig. 1 . The domains depend on 6 variable parameters which we will choose 
below so as to vary continuously with the domain. The principal difficulty in 
this choice is caused by the degenerate domains. In the passage to a limit from 
a sequence of ordinary domains to a degenerate domain, many linear transforma¬ 
tions and reflections were performed; and the degenerate domains may possibly 
contain points c M corresponding to point circles c M . The parameters must be 
chosen to take these two possibilities into account. * 

The following six parameters a, a, 6 , /3, p, <p are selected (angles are measured 
in the interval from — ir to ir, excluding —ir) : 

(°°, Pi, Qi> r 0 = a e' a 

( 0 , P 2 , 32 , r 2 ) = 

0,0, Pi, Vi) = pe { * 



( 1 ) 


(0 < a < 00,0 < a < it), 
(0 < b < oo, j 8 t* 0 , x), 

(0 < p < 1), 



202 


MAX SHIFFMAN 


Cross ratio 


where (roi , w*, w*, Wi) means the cross ratio -~ ~-— , 

Wi ~ Wa W2 — Wz 

is chosen so as to obtain invariance under linear transformations, and the 





ae ia =» (ci, pi, qi, r0 
be i/5 = (c 2 , p*, q 2 , r a ) 
p as in diagram 



Figure 2. Degenerate domains in % 

points oo, 0 correspond to points of type Ci , <% in degenerate domains. The 
precise reason for the choice of the points oo, 0 in the cross ratio expressions is 
made apparent in the proof of Lemma 1 below, part (b). 



UNSTABLE MINIMAL SURFACES 


203 


The geometric meanings of a, 0 , p, <p are indicated in fig. 1 , while a = 

_ PiQi 

and b = -1^. It is easily established geometrically that the parameters a, a, 6 , 

Pi#2 

0 , p, v? subject to the inequalities in ( 1 ) (these inequalities are necessary for the 
non-degeneracy and normalization of the domain) uniquely determine the 
domain. 

For degenerate domains, the parameters a, a, b, ft, p, <p are defined as in fig. 2 . 
The undefined parameters may have any value (e.g., in fig. 2 (a), <p may have 
any value). For a degenerate domain, note that either ae xa is real (so that 
a = 0 or it, or a = 0 or <*>), or be xp is real, or p = 0. 

A remaining difficulty is the indeterminateness of some of the parameters in 
the case of degenerate domains. A method for eliminating this difficulty is to 
introduce a set of new parameters which, when defined in terms of a, a, 6 , 0 , p, <p f 
contain factors vanishing for degenerate domains. This of course necessitates 
introducing more than 6 new parameters. One such possibility is the set of 
nine parameters x v , v = 1, • • • , 9, interpreted as the coordinates of a point x 
in Euclidean space @ 9 , defined by 


( 2 ) 


1 - p 


X 2 = p 


1 + a’ 

* + “■ ■' r-h? (♦"" + o ™«nib* 

_ T 

xg + ix 9 = p sin a sin 0 * 


1 + a 2 1 + b 2 



If a = oo, a/(l + a) and a/(l + a 2 ) mean 1 and 0 respectively; and similarly 
for 6 . The* complicated expression for xg + ix 7 is necessary because reflections 
may occur in passing from a sequence of ordinary domains to a degenerate 
domain; this is made clear in the proof of Lemma 1, part (b). 

A simple examination shows that a unique point in @ 9 corresponds to a single 
domain in 9 £ 2 , degenerate or not; and to different domains in 9 t 2 correspond 
different points in (g 9 . The set of points in @ 9 corresponding to the whole of 9k 
is easily shown to form a closed 6 -dimensional set extending to infinity (in the 

direction). 

Lemma 1. The one-to-one mapping of 9k onto a point set of @ 9 , given by (1), 
fig . 2, ana (2), is a topologic mapping . 

Proof. It is required to show that the one-to-one correspondence preserves 
the limit relation. Let G n be a sequence of domains of 9k and x n the corre¬ 
sponding points in (g 9 . Since p n 1 implies £?—>«>, and conversely, the 
sequence G n has no limit domain if and only if x n has no limit point. It remains 



204 


MAX SHIFFMAN 


to establish that x n —> x whenever G n —> (?, where x is the point in @ 9 corre¬ 
sponding to the domain G. This is trivial if the domains G n do not degenerate 
as n —> oo. If the sequence G n tends to degeneracy, the possibilities are 
(see §1): 

(a) p n —> 0. By §1, the limit domain G is given in fig. 2(d), and x is therefore 
the origin of (g 9 . The equations (2) show that x n —> x . 

(b) Two of the points p x , q" , r x converge to the same point A of the unit 
circle, while p? , q 2 , r 2 converge to distinct points. Let B be the point on the 
unit circle diametrically opposite to A. The limit domain G is obtained by 
describing K n , separating G* and G% , and normalizing each of these regions. 
Hence G is of the type shown in fig. 2(b). The normalization of G 2 is per¬ 
formed by inverting with respect to K n , transforming and inverting so that 
Pi , qi , ri go into the specified points 0i, 0 2 , 03 . The point Ci in H 2 is the limit 
of the resulting point B in G 2 . Since cross ratio is invariant under linear 
transformations, 


(B, Pi , qi , ri) -> (ci ,pi,qi, r x ). 


By noting that 


(B, p?, g?, rj) 


B — qi p i — ri 
B - ri pi - q n i 


and (<*>, Pi”, qi, r x ) 


n n 

Pi — Ti 
Pi - Qtl 




it is easy to show that (B, Pi,qi, r x ) and ( 00 , p? , g? , r") approach the same 
value. Thus, 

( 00 , Pi n , qy , -> (ci, pi , qi , n), or aV an -> ae tot 


(where a = 0 or it, or a = 0 or °o). 

Of is normalized by inverting with respect to C 2 , expanding, rotating and 
possibly reflecting. Hence (0, p 2 , q 2 , r 2 ) approaches either (°°, P 2 , qz , r 2 ) or 
its conjugate complex, so that bV^ n| Since clearly p n —> p, an ex¬ 

amination of equations (2) shows that x n —* x. 

(c) A similar argument applies if two of p 2 , q 2 , rj approach the same point 
A' on C 2 , while p* , g? , approach distinct points. The limit domain G is of 
the type in fig. 2(a). 

(d) Similarly, if two of pi , g? , r x approach A, and two of p? , g? , r? ap¬ 
proach A'. The limit domain G is given in fig. 2(c). 

Like results are obtained if the G n are degenerate. 

In all cases therefore, G n —*G implies that x n —> x. The lemma is established. 

Thus, 5 R 2 is a metric space. The metric could be defined as follows: the dis¬ 
tance between two domains of SR 2 is the Euclidean distance between their corre¬ 
sponding points in @ 9 . 


3. The Metric Space 

We return to the case of k boundaries and construct a metric for 91* • The 
parameters which will be used for any domain G of 9t* are the 9 parameters of 
the preceding section for each pair of circular boundaries of G. Accordingly, 



UNSTABLE MINIMAL SURFACES 


205 


suppose first that G is an ordinary domain of , and C M , C, (p < v) any pair 
of boundaries of G with the specified points p M , , r M , p„, q p , r„. Nine coordi¬ 
nates xT, • • • , x% v may be associated to this pair of boundaries. They are 
determined by equations (2) of §2, where the parameters a, a, b, /3, p, <p are 
given by 

(ae = , Pti y Qn , rf) y 

(3) a be ^ = (s y i p* i Q v > 

[p^ * = (s M y s v , pn , p v ). 


Here and $„ are the two points which are mutually inverse with respect to 
both circles C M and C*, and is on the side of C „ opposite to C„ ; also 0 < a < t 
and e = dbl. These parameters and coordinates are invariant under linear 
transformations and reflections. They are the invariant generalizations of the 
equations (1) of §2. 

When G is a degenerate domain, the six parameters a, a, 6, 0, p, ip for the pair 
C M , C v (p < v ) of boundaries are defined as invariant generalizations of figs. 1, 
2(a)-2(d). In this generalization, and s„ replace °o or 0, and c times an angle 
(where € = ±1) replaces the angle. Thus, 

1) if and C v occur in the same region, the parameters are given by equa¬ 
tions (3) (see fig. 1); 

2) if and a point circle c v are in the same region, 


ae (Sp , Pn , Qn , rf), 

be — (c*, Pv) Qvi Tv) » ^ 

P = | > $V , Pn , P*) I, y 


(see fig. 2(a)) 


where p^ is any point on the circle c, ; 

3) if a point circle c M and a boundary circle C v are in the same region, the 
parameters are the invariant generalizations of fig. 2(b). (Cf. 2) above.) 

4) if point circles c M and c„ are in the same region, the parameters are the 
generalizations of fig. 2(c). (Cf. 2) above.) 

5) if neither nor any c M occurs in the same region as C v or any c v , then 
p = 0 (see fig. 2(d)). Since the domain is of genus zero exactly one of the 
above occurs for a given pair p, v . 

The parameters a, a , fr, /3, p, <p for the pair Cp , C„ of boundaries of G determine 
the 9 coordinates x • • • , xT by equations (2). We therefore have, for any 
domain G in 9?* , a unique point x in the 9 k(k — l)/2 Euclidean space Gs with 
the coordinates xT, • • • , x% v for all p, v = 1, 2, • • • , fc, and n < v. A simple 
discussion shows that to different domains in dtk correspond different points in 6. 

Lemma 2. The one-to-one correspondence given above of SR* onto a point set of £ 
is a topologic correspondence . 

Proof. The lemma follows from Lemma 1 if one notes that essentially the 
process in §1 treats two particular boundaries Cp , C 9 exactly as they would be 
treated if they were alone. 



206 


MAX SHIFTMAN 


Thus, Si* is a metric space. Furthermore, the discussion in §1 yields the 
result that the set of points in (2 corresponding to the whole of 9?* is a closed 
set extending to infinity. The fact that the 6fc — 6 dimensional open set (open 
in St*) of all ordinary domains is everywhere dense in Si* completes the proof of 

Theorem 1. JR* is a 6k — 6 dimensional metric space in which hounded sets are 
compact . 

4. The Connectivity Numbers of 9i* 

The connectivity numbers of Si* will be in the sense of Vietoris theory, con¬ 
sidering only cycles and homologies which lie in bounded subsets of 9t* . The 
Vietoris character of the cycles will be used in an essential way in the proof of 
the following theorem. 

Theorem 2. The connectivity numbers , R 0 , Ri , • • • , R n , • • • , of 9t* have the 
values : 

Ro = 1, R\ — R 2 = • * * — R n — * * • =0. 

Proof. The essential idea of the proof is to deform the whole space Si* into 
a single point, namely the domain consisting of k unit circles (which is the most 
degenerate domain). 

(a) Denote the invariant p corresponding to the boundaries C M , C v (p < v) 
by p M ”, and set a = minimum,,p MV . Designate the set of all domains of Si* 
for which <r ^ 8 by Ft . We shall first deform Si* into Ft , where 8 is any positive 
number. 

The deformed domain Gt , 0 ^ t ^ 1, will be obtained from the domain G 
by contracting each circle, except the unit circle, by a certain factor. It is 
important in this that all the G' s be normalized the following way: in each region 
of G the order for determining which circle shall be the unit circle, the con¬ 
centric circle, etc. is Ci or Ci, C 2 or C 2 , • • • , C* or c* . If G belongs to Ft , set 
Gt = G. If the a for G is >5, let rj(< 1) be that contraction of the circles of G 
which takes G into a domain G\ for which <r = <5; define G t as the domain ob¬ 
tained from G by contracting its circles by the factor (1 — t) + tvj. 

It is necessary to show that Gt is continuous in G and t i i.e., G n t n —> Gt whenever 
G n G, t n t. This is trivial if <r n g 8; if <r n > 5, it follows by noting that 
G n can degenerate further only by the coalescence of two of p% , , r* whereas 

contraction to form (??» retains the relative position of these points. 

One easily notices by consulting equations (2) that the above deformation 
deforms any bounded set B of SR* over a bounded set B ' into Ft ; i.e., for a given 
bounded set B there is another bounded set B 7 independent of 8 such that B is 
deformed within B' into Ft . 

(b) Let z m be any Vietoris m-cycle on a bounded set B. Because of (a), 

z n ~ w?, where w? is a Vietoris m-cycle on B' Ft and the homology takes 
place over B\ Since B' Ft is arbitrarily close, for 8 sufficiently small, to the 
closed compact set B'-F 0 , it follows from a basic theorem for Vietoris cycles 
that z m — , where w™ is a Vietoris cycle on B'-F 0 . 6 

5 A proof of an essentially equivalent theorem is contained in p. 20-23, Theorem 5.1, 
of [6]. 



UNSTABLE MINIMAL SURFACES 


207 


(c) F 0 consists of the domains G for which at least one of p Mr is zero; set or' = 
minimum of the remaining p M *. Designate the set of all domains of Fo for which 
<r' ^ 5 by Fo.a . By contracting circles, as in (a), F 0 can be deformed into F 0 ,a 
for any positive 5. As in (b), z m ^ ^ w ™ {0 where w™q is a Vietoris m-cycle 

on B ff • Fo,o . 

Continue with F 0f o as with F 0 , etc. One finally obtains z m ^ w m , where w m 
is a Vietoris m-cycle on the set F consisting of those domains G of for which 
all the pare zero. But F contains a single point, namely the domain con¬ 
sisting of k unit circles, bounded by Ci, C 2 , • • • , C* respectively, with the points 
, Qt* y r n the specified places 0i, 0 2 , 0 3 of C M . The theorem follows 
immediately. 


5. The Space 


We are now prepared to discuss the space $ of potential surfaces bounded by 
Ti, ■ • • , Tk . An element of ^ is a potential surface %(u, v) defined over a 
domain G of and satisfying the following conditions: j(u, v) maps each C M 
continuously and monotonically onto r„ , and , r M into the prescribed 

points P M , Qn , of r M ; v ) takes constant values at each point circle c M , 
equal to its value at the corresponding point c „ on ; and Do[jc] is finite, where 


D 0 [%] is the sum of the Dirichlet integrals of £, 



(£u + Jv) du dv , over the 


various regions which compose G. 

A surface £ in $ is determined by the domain G over which it is defined and 
its boundary values on each of the k circles C M . Since the points p» , q» , 
have already been considered part of the domain G , the boundary values of £ 
will be defined in the following way. Map the circle onto a unit circle by a 
linear transformation or reflection so that go into 0i, 0 2 ,0 3 respectively. 

The resulting values of £ on this circle will be called the boundary values of £ 
on C, and denoted by £ M (0). The distance between two elements y and t) of 
can now be defined by 


I - Hi - I G-H 


+ HH maximum 


Jm(0) - #*(0) I 


where G and H are the domains of i and t), and | G — H | is the distance between 
<?, H in dt k . 

The subspace of all elements f of $ for which fi[j] ^ J\f will be denoted by 
. To avoid confusion with the case of a single boundary r, the latter space 
will be designated by $ r ; the complete notation for the present space s # is then 

It will be convenient to use the following notation. A potential surface in ^ 
is to be designated by % or jc(w, v), its boundary values in the above sense by 
$ M (0), and the potential surface defined over the unit circle with the boundary 
values £ m ( 0) by . The part of the plane interior or exterior to C M , according 
as & is interior or exterior to C M , is denoted by G M ; the potential surface defined 



208 


MAX SHIFFMAN 


over G> , with the same values on C M as v), by (thus f M is the linear trans¬ 
form of back to Gf). 

Theorem 3. $ = X ty Tl X X • • • X $ r *, where • X • signifies the 
topologic product . 6 

Proof. Let £ be any potential surface with domain G and boundary values 
y M (0). Let Kp be a circle in (? concentric to C M and near it; let be the annular 
ring between C M and ; and set G' = (7 — • We have 

£Uf] = E + D«'[r]. 

M-l 

Obviously Dg/[j:] < <*>, so that Z> 0 [£] < °o if and only if D Am [$] < °o for every 
At. In A„ , set x' = £ — y M . Since $' has bounded derivatives near and is 
zero on < ». Hence, by the triangle inequality, Z>A M [f] < qo if 

and only if D^[£ M ] < «>; and D A < » if and only if . The 

theorem follows by noting that D a Jx^\ < c© is equivalent to: lies in ty Tft . 

Theorem 4. is compact and closed for every N. 

Proof. Let % n (u, v), with domains G n , be a sequence of surfaces in . 
There is a subsequence of the G n ’s converging to GT. Transform each domain 
G n linearly so that a given C ¥ is the unit circle and the points p v , q v , r v are at 
the specified places 61,62,63 . As in [1], the boundary values tf(6) are equi- 
continuous. A final subsequence, written $ n (u, v), is obtained such that y n —> £°°. 
By referring to the geometric definition of convergence given in §1, it follows 
that the Dirichlet functional is lower semicontinuous. Hence Dfe 00 ] ^ N, and 
the theorem is proved. 

6. The Continuity of the Functional E[%] 

Let £ be a potential surface with continuous boundary values $ M (0), and indi¬ 
cate the Dirichlet integral of taken over the unit circle by D 0 [j: m ]. Define the 
the functional Z?[j] by 

£[j] = £>[?] - £ D 0 [fJ. 

M-l 

In this section, we shall prove that E[%] is continuous in $. This will reduce 
the behavior of D[ jc] to that of D 0 [jcJ. 

Let G be the domain of the potential surface £, and (? M that region of the 
plane which is bounded by C „ and intersects G. Draw a circle K M in G suffi¬ 
ciently near , At = 1, • • • , k. Let G' be the subdomain of G bounded by 
the circles K M and any point circles , and • G^ that subdomain of G> bounded 
by the circle . Finally, let be the potential surface defined over G> and 
having the boundary values of f on is the linear transform of f M to a 

unit circle. 


6 Furthermore, for any number N there is a compact subset R of 91* and a number M , 
while for any R and M there is an N', such that ty# <Z R X X • • • X C . This 
follows from Theorem 5 to be proved below. 



UNSTABLE MINIMAL SURFACES 


209 


We have 


= DU] - £DoJtW = lim [Dak] - £ Doth]); 

M -rl 


and 


zWf] - £ A*[y = £ / S^ds + zf rgds - £ [ h^ds 

m m on M Jjt m on 

■ ?/„.<* - t) S dj+ ?/,.'!*• 

If K m —* C M , the first integral on the right hand side approaches zero since 
£ — = 0 on 0^ . Hence 

( 4 ) Mf] = E / t^-p^-dt + E / r^ds. 

M-i J c M on C|i «'c M on 

This is an expression for 2?[$] in terms of boundary integrals. 

Theorem 5. Let G n be a sequence of domains in $R* converging to G, and $ w 
'potential surfaces over G n converging uniformly 1 to the potential surface $ over G. 
Then 


E[f] -> E[& 

Proof. First consider the case when the G n arc ordinary domains in 91* . 
If the limit domain G is also ordinary, the theorem follows from (4). For, 
% n —> l and ? w — —> £ — £„ uniformly near . Since jc n — £” = 0 on C” , 

it can be extended by reflection across C” and d($ n — f”)/dn converges uni¬ 
formly to d(i — | M )/5n on C„ . The second term on the right hand side of (4) 
is not involved since we have assumed that G n and G are ordinary domains. 

Suppose that the limit domain G is degenerate. The continuity of Z?[{] will 
be established by induction on the number k of boundaries. We shall distin¬ 
guish two cases: a) the origin is enclosed by a large circle or is approached by a 
large circle; and b) all the circles approaching the origin are small circles. 

(a) Enclose this large circle and the set of circles approaching this large circle 
by a curve K containing no other circles. Let Gi be the domain bounded by 
all the circles outside K, and G? the domain bounded by all the circles inside 
K (Gi is a bounded domain, G? unbounded). Let the limits of G n , Gf, G?, 
considered merely as regions of the plane, be G*, G* , G* respectively. Define 
and $ n over the domains G? and G? respectively as the bounded potential 
surfaces with boundary values equal to those of £ n . Denote the limits of % n , 
t) n , J n by £*, t)*, i*; they are potential surfaces over G*, G? , G* . We shall first 
show that Don[f] - Z><?jfo n ] - D 0 j[j n ] D a *[%*] - D<?ifo*] - D 0 ;h*]. 

Designate the parts of G n and G* exterior to K by A n and A *, and interior 

7 This means that the boundary values of in the sense of §5, page 207 converge uni¬ 
formly to the corresponding boundary values of $. 



210 


MAX SHIFTMAN 


to K by B n and B*. Let Ki n and K m be the interior and exterior of K. From 
G n « A n + B n , Gi = A n + Ki n , Gt = B n + K„ , one obtains 

Do*[f] - Dd&fl - Dotf) = {DA?} - A»[»> n )} - Ar ln fo B ] 

+ {DAf] - A4i B ]} - Dtjf}; 

and a similar decomposition for D 0 >[?} — Atfe*] — Doll?}. Now, 

DA?} - DA?} - A»[? B - ?, ? + ?) 

= £(?"- ?) d - { - ? - ± A n h s - f x {? - f)*£±Wda 

= ZW1 - A •[»)*]; 

DjqJty"] -Djr in [t) ,|e ]; and like results for D*n[£ n ] — D*«[$ n ] and D Kox [l n ]. Hence 

( 5 ) Don[f ] - D 0 »W] - Doi[i n ] -> DoW] - Dofo*] - Dote*]. 

The domains G? and G? have fa and k 2 boundaries respectively, where 
fa + = k, and the theorem is supposed true for domains with less than k 

boundaries. Let G n , G? , G? when considered as domains of dtk , 9ijfc l , have 
G, Gi, G 2 as their limit domains respectively. G L consists of G * and other 
regions H (d ; G 2 consists of G* and other regions H&) ; and G consists of G* and 
the regions Hq) and H (2 ) . Indicate the boundary curves of G? by C Ml , and 
those of GJ by G M2 . By the induction, 

(6) A?fo n ] - X Altai! Dali?] + Ar (1) [f] - E A>[?„,] 


(7) Afllfl - E Alta.1 -> D ol h*} + D Bw [? ~ E DA I„l. 

M2 M2 

Adding (5), (6) and (7), one obtains the desired result 

DA?} - E Altai -> Do.{?] + Ar (U [f] + Z)„ (2) [ f ] - E Atal 

(8) M ' . „ 


o 0 [f] - X) Altai. 


(b) In this case, when all the circles approaching the origin are small circles, 
the procedure is the same as the above. The only modification is that the 
region G* is the whole plane, is identically constant, and ty* = £*. The 
relation (5) reads !)<?»[$*] — D 0 'i[Xj n ] — D Q *[f] — > 0, and the final result (8) is 
the same. 

There remains the case when the G n are themselves degenerate. One may 
suppose, without loss of generality, that all the G n have the same type of de¬ 
generacy: G n consists of the distinct regions G?, • • • , G" where the G? have 
the same boundary and point circles for all n. The above proof then applies 
to each sequence G? separately. The theorem is completely established. 



UNSTABLE MINIMAL SURFACES 


211 


Theorem 5 shows that the discontinuities of the functional 2>[$] are due to 
the boundary values of £ and not to the domains, and occur in the same manner 
as each Afe]. It should be noted that E[%] can be so defined, by (4), so as to 
exist even if D[j] is infinite. Theorem 5 will still apply. 

If £ and are potential surfaces having the same domain G, define the cross 
^-functional P[£, tj] by 

k 

E[h t)] = D[ f, t)] — Z Dolt ,,, 

M-l 

The bilinear formula applies: 

E[t + D] = m + 2«[£, t)] + EM. 

Consequently theorem 5 has as a corollary 
Theorem 6. Let £ n , be two potential surfaces both defined over the domain 
G n of 9?* , where n = 1,2, • • • . Let the sequences £ n , converge uniformly 7 to 
the potential surfaces £, 1} defined over the domain G. Then 

Elf, f]->E[ h *]. 

Part II. Application to Unstable Minimal Surfaces 

7. Linear Paths of Surfaces 

To obtain the Morse relations for minimal surfaces bounded by Ti, 
1*2 , ■ • • , T* , it is necessary to establish a variational condition and a reducibility 
condition. The latter requires discussing special paths of surfaces in 

Let us suppose that each r M is rectifiable, and select the representation g M (0), 
0 ^ 0 g 2tt, of r M in which the parameter 0 is proportional to the arc length 
on r M measured from the point P M and in the direction P M Q M P M . Each q m (0 ) 
has the property 

/ox 8,(00 - 9,(0") < D 

W 0' - 0" ~ 2tt 

where L is the largest of the lengths of Fi, • • • , I* . Any other representation 
f,(0) of T, is given by 

1,(0) = 0,(X,(0)) 

where X,(0) is a monotonic and continuous function of 0. Call A,(0) the mono¬ 
tonic function determined by f,(0). 

Let jo and ji be any two potential surfaces in 'P having the same domain G, 
and with the boundary values (in the sense defined in §5, part I) |,(0; 0) and 
f,(0; 1) respectively. Define y(t), 0 g t g 1, as the potential surface with 
the same domain G and boundary values j,(0; t) given by 

f,(0; t) = 8,[(i - OX,(0; 0) + ix,( 0 ; i)] 


( 10 ) 



212 


MAX SHIFTMAN 


where X„(0; 0) and X M (0; 1) are the monotonic functions determined by jo and 
fi respectively: 


(11) ?m(^» 0) — 0«i[X»i(0! 0)]> Jm(0> 1) — 11)1* 

The potential surfaces f(i), 0 ;S t ^ 1, form a “linear ” path joining jo and fi. 
Lemma 3. For the linear path j(<) constructed above, 


m - ?(n 

<' - 1" 


Proof. 


The potential surface 


m - lit") 
t' -1" 


has boundary values on equal to 


QM') ~ 0m(*") _ 0,(*O - UM") r ~ V 

t' - t" ‘ t' - t" 

where ^' = (1 — 0) + <'X M (0; 1), and \f/" is a similar expression involving 

t". Using (9) one obtains ’ 


% M') - u r) 

t' -1" 


Ss 1) - A„(0;O) |. 


Similarly for the boundary values of 


m - tin 

t f - t n 


on any point circle c M 


The 


lemma follows. 

Theorem 7. Let f? and , both having the domain G n ,n = 1, 2, • • • , be two 
sequences of potential surfaces in ^ converging to the same potential surface £. 
Let % n (t) be the linear path joining £? and £? . Then 


E[f(t ')] - E[f(t")] 

t' - t " 


0 as n —> oo 


uniformly in t' and t". 

Proof. The desired difference quotient can be written in the form 


E[?(t')] - E[f(t")] _ E[f{t f ) - f n (r), f(t') + ?(t")] 


( 12 ) 


t' - f 


V - t" 


= t "(0 + *"(*") • 


Now the monotonic functions X*(0; 0) and X£(0; 1) determined by j? and jf 
both converge uniformly to the monotonic .function determined by £. Conso¬ 
rt') — 

quently j n (t) converges to f, uniformly in and by Lemma 3 . 

t “ t 

converges to the potential surface identically equal to zero, uniformly in tl 
and t n . Theorem 6 of Part I applied to the last expression in (12) shows that 
the desired difference quotient converges to E[Q, 2$] = 0, uniformly in t' and t". 



UNSTABLE MINIMAL SURFACES 


213 


8. The Reducibility Condition 

We shall obtain reducibility on the assumption that reducibility holds for the 
case of single boundaries. Such results for single boundaries have been derived 
in rather general cases by the author, and by Morse and Tompkins. 

Suppose that the following result has been established for each of the curves 

F M , /i == 1, 2, • • • , k. 

Reducibility Condition for Single Boundaries. 8 Let tj be any surface in $ r . 
To each surface x of < }} r there can be associated a linear path x(t) joining x to x' = j(l), 
where x' depends continuously on £, and x' = 1) if x = t). For any r? and y there 
is an a independent of y and a 8 such that , if x is in the 8-neighborhood of l), this 
linear path %{t) has the following properties : 

1) . D[?(0] exists and depends continuously on t in 0 ^ t g 1. 

2) . D[%(t')] > Z)fo] + y and Dlx(t")] > Dfo] + y imply that 

ad mm] - Dim)] ^ 

At = —_____— ^ 

3) . y for any l', l". 

In case exists for 0 < t < 1, Properties 2) and 3) arc equivalent to: 

at 

2'). D[x(t f )] > Dly] + rj implies that g — a. 

dt 

3'). ^ yf or o < t < 1. 

dt 

There is an important consequence of Property 2). 

Lemma 4. If Dixit')] g Dfo] + rj for some t ', then Dixit)] ^ Z>[t)] + y for all 
t ^ t'. If Dixit")] > D[t)] + rj for some t'\ then D[j($)] > Z>[t)] + rj for all t g t". 

Proof. Let the maximum of D[f(£)] for t ^ t' be attained for t = r. If 
Dlxir)] > + y , choose r' between t' and r and so near r that Z)[f(r')] > 

Dfo] + y- Then ^ 0, contrary to Property 2). Hence 

T — T 

D[|(t)] ^ D[y] + y> and the first statement of the lemma is proved. The 
second statement of the lemma is a consequence of the first statement. 

Therefore, the possibilities for the graph of D[x(t)] as a function of Jin 0 ^ t ^ 1 
are: 

a) If Dl%] ^ D[tf] + y, then D[y(2)] is always at most Dfo] + rj. 

b) If Dlx] > Dfa] + y , then Dfe(0] changes at a rate ^ — a until Dfo] + y 
is reached (if it is reached at all) and thereafter Dixit )] remains below D[xj] + rf. 
This is exactly what is meant by ‘reducibility’. Compare page 36 of [6], Lemma 
9 and Theorem 4 of [12], and page 445 of [7]. 

We now return to the case of several boundaries. 


• The reducibility condition stated here can be easily generalized but it is useless to do so. 

• All the quantities 17 , y , a, 5 are positive. 



214 


MAX SHIFFMAN 


Theorem 8. Suppose that the reductibility condition above has been established 
for the individual rectifiable curves I\ , r 2 , • • • , T* . Then the same condition 
holds for the case of the k boundaries Ti, r 2 , • • • , r* together , t.e., in the space 

spr 1 ,r ai ... i r Jb 

Proof. Let be a fixed surface in ^ = <jpri t r a ,... f r fc boundary values 
, and let f with domain G and boundary values be any surface in ty. To 
the potential surface with boundary r M there is associated the linear path 
f M (0 determined by our hypothesis for r M . The required linear path associated 
to x is defined over the domain G and has the boundary values m = 1 , • • • , k. 
We have 


(13) Dm) = t DolxM + Elm, 

M-X 

which shows that jD[f(<)] is a continuous function of t because of Property 1) of 
our hypothesis and of Theorem 5 in Part I. 

Let r\ be arbitrary. Our hypothesis for T,, determines a constant belong¬ 
ing to ri/2k in place of i\. Set a = minimum a M for p. = 1, ■ • • , k. Choose 
any y g a/2k. There is a 5„ such that all the assertions of our hypothesis 
(with •q/2k replacing tj) apply if £„ is in the 5,,-neighborhood of . Set S' = 
minimum 5 M for /x = 1 , ■ ■ ■ , k. 

If £ is close to l) then %(l) is likewise close to t) (since £„(<) is close to for each 
/x). By Theorem 5 in Part I, there is a 5" such that 

(14) | E[x(t)} - EM | < n /2 


whenever £ is in the 5"-neighborhood of t). By Theorem 7, there is a 5"' such 
that 


(15) 


A E 


£[?(*')] - «")] 

At 


t' - t" 


<7 ^ a/2k 


if £ is in the 5"'-neighborhood of t). 

The required 5 is defined by 5 = minimum S', 5 ", S'". Consider only sur¬ 
faces £ which are in the 5-neighborhood of t). 

Suppose that D[£(f')] > Z>[t)] + ij and !)[£(<")] > L>fo] + r), where t" < t\ 
The first inequality requires that Do[£,(t')] > ■Dofa*] + y/Zk for some n — v, 
by (13) and (14). Lemma 4 yields A>[£..(*")] > + ij/2fc. Properties 2) 

and 3) of our hypotheses give 


(16) 

and 


AZ> 0 [£,] _ A>[£,(0 - Alf,(*")] ^ _ 

At - “■ 


AZ)q[£ m 1 

At 


^ 7 ^ a/2k 


for all #x. 


(17) 



UNSTABLE MINIMAL SURFACES 


215 


A final calculation, using (15), (16), (17), shows that 

ADfc] = £[?(<')] - £[?(<")] _ y AZ>[f„] AE[$] 

At t' - t" fci At ' "** At 




+ £I 



a 


2 * 


This proves Property 2). 

Property 3) likewise holds, by (15) and (17). But this is of no importance 
in connection with reducibility. q.e.d. 

For single boundaries reducibility has been proved by the author, by Morse 
and Tompkins, and by Courant for special classes of boundary curves. We 
shall discuss each briefly. 

The author in [12] used the following type of path joining two boundary 
representations £ 0 ( 0 ) and £i(0) of the curve T. Let £o(Xo(0)) = fi(0) and 
Ji(Xi(0)) = fl(0) where X o (0) and Xi(0) arc monotonic functions of 0 and fl(0) is 
that representation of T the parameter 0 of which is proportional to the arc 
length. Define j(v?; t) by 


f (<p; t) = fi(0) where <p = (1 - t)\ 0 (d) + *Xi(0). 

Compare with the linear path (10), (11) used in the present paper. For this 
path a property such as Theorem 7 is very difficult to prove. Accordingly, the 
results of [12] cannot l>e used here. 

Morse and Tompkins in [7] proved the reducibility condition stated above in 
case T has the following property: 

(18) 0'(0) exists and | fl'(0i) — a'(0 2 ) | ^ M | 0 X — 0 2 1 

for all 0i, 02. See Lemma 5.1, Lemma 5.4 and Theorem 5.1 in [7]. The hy¬ 
pothesis of Theorem 9 is consequently satisfied if each T M , n = 1, • • • , fc, has 
this property. 

The above reducibility condition is implicitly contained in the work of Courant 
[3] for polygonal boundaries. We shall not stress this point however, since 
Courant’s method is directly applicable without the intervention of §§7, 8. We 
consider this matter in the next section. 


9. The Case of Polygonal Boundaries 

We shall show how to apply the method of Courant [3] to the case of several 
polygonal boundaries, using only the results of Part I. 

Let the vertices of P M be • etc. For a given surface 

$(w, v) of $ denote points on which are mapped into P M , Q M , , • • • by 

Pv > Qn } r M » > * * * • Map the circle by a linear transformation into a unit 

circle so that the images of , r M are the specified points 0i, 02, 0s. The 
images of , • ■ • on the unit circle will be denoted collectively by ; and 
<n 7 * 2 , • * * > <rk will be indicated collectively by a. 



216 


MAX SHIFFMAN 


LetO denote the space of elements {G, <r), of domains G and points <r, with 
an obvious metric. The dimension of Q is 3fc — 6 + V where V is the total 
number of vertices in the polygons Ti , • • • , r*. Q has the same connectivity 
numbers as 3 J*. 

The space of potential surfaces j (u, v ) of $ defined over elements {G, a) of O 
will be indicated by has the same connectivity numbers as ty. 

For a given {G, a} of O consider the problem of minimizing D[f] among all 
surfaces f of defined over this {G, < 7 }. By the compactness of ^ (and of 
$*), this problem has a solution f. As in [3], using the linear path £(£) = 
(1 — *)£o + tx i joining two surfaces Xo and Xi having the same {G, <r}, the solu¬ 
tion is unique. Denote it by %{G y a), and set d(G, a) = D[$(G, (01- 
Theorem 9. The surface x(G, a) and the quantity d(G , a) depend continuously 
on [G y a}. 

Proof. Let {G n , a n } —> {G, a}. Abbreviate y(G, < 7 ) by £, and let be the 
potential surface defined over a unit circle with the boundary values deter¬ 
mined by x on . Vary the points and the surface as in [3] so that the 
<r M move into < 7 * . The varied potential surface is such that 

(19) ^ —> Xu and Dofo”] DofeJ as n —> 00 . 

Let be the potential surface defined over {G n , cr n ) with boundary values 
on determined by X )” , /i = 1, • • • , Jc. We have 

(20) D[tf] = ± Do[d;] + E{f] i: Z>o[yJ + £[?] = DU] 

M-l M-l 

by (19) and the continuity of the functional £[f] (Theorem 5 of Part I). Be¬ 
cause £>[»)"] ^ d{G\ a"), (20) yields 

( 21 ) d(G, a) ^ lim. sup. d{G n , a n ). 

** n —»oo 

In particular the quantities d(G n , < 7 m ) are uniformly bounded. 

On the other hand, by the compactness of fy# , a subsequence of the surfaces 
X(G n , <r n ) converges to a potential surface £°° defined over {G, < 7 }. The lower 
semicontinuity of the Dirichlet functional gives 

(22) d(G, o’) £ D[ {"] g lim. inf. d(G n , <r n )- 

n -♦oo 

The relations (21), (22) establish the continuity of d(G, a). 

The equality sign holds throughout (22), so that Z)[f“] = d(G, <r) or jf" = 
j(Cr, <r). Thus f(<? n , <r B ) —* j((?, <r), and the theorem is proved. 

This theorem shows first that the set of surfaces f (<?, <r) is topologically equiv¬ 
alent to the space O. The symbol G will henceforth designate the space of 
these surfaces 5 (G, a). The theorem then asserts that D[{] is continuous overO. 

So far as Morse theory is concerned, our discussion may be limited to the 
space O. 



UNSTABLE MINIMAL SURFACES 


217 


10* The Variational Condition 

In this section, no restriction will be made on the contours Ti, • • • , r* . 

A surface £ in $ is a minimal surface if the analytic function <p(w) = 
(?u ~ i%v? = £ 2 — £« — is identically zero in each of the regions over 

which £ is defined. The purpose of this section is to show, when tf is not a mini¬ 
mal surface, that a neighborhood of tj can be deformed so as to decrease the value 
of the Dirichlet functional. If X) is an ordinary surface in i.e., not degenerate, 
then this result is an immediate consequence of variations performed by Courant. 
The difficulty arises when X) is degenerate; for this case a more detailed investi¬ 
gation is necessary. 

Let t) with domain G, be a surface in ^3 not a minimal surface. We may sup¬ 
pose without loss of generality that —2^,, is not identically zero in at least 
one of the regions G' which compose G. 10 Let a be any value ^ D[tf], Let £, 
with domain H , indicate a surface in near t), and let H' be that region of H 
corresponding to G' and normalized analogously to G'. The following lemma 
is a result of performing certain variations due to Courant (see [2]): 

Lemma 5. Let £, with domain H, be a surface in s ]3 a , and suppose that —2 £ u £ v ^ 
—6 < 0 in a fixed square K ( with sides parallel to the u and v axes) of side 2 r 
interior to the region H' of H. Then a deformation £(c), 0 g e ^ 1, of x can be 
obtained such that 


DfeM] ^ DM - 


where /3 depends on a, 6, and r. This deformation jc(e) is a continuous function 
of c, and of £ as long as no boundaries of H' appear in the square K. 

Proof. Deform the region H' into H^e) by the transformation 


(23) 

where 


(U = u + cX(u, v), 
\V = v, 


(24) 


X(w, v) = 0 


outside the square K , 


X(w, v ) = 


[r 2 - (u - Ro) 2 ][2t - (» - Vo)] 
27 - 


inside K . 


Here, ( u 0 , r 0 ) is the center of the lower side L of the square K and 0 ^ e ^ 
1/(4 t). The function X(u, v) is non-negative, is discontinuous across the side 
L, and has its first derivatives bounded in absolute value by 2r. By coordinat¬ 
ing the point u of the lower edge of L with the point u + e\(u, Vo) of the upper 
edge of L, the transformed domain becomes a Riemann domain. Define f(e) over 
the Riemann domain H\t) as the potential surface having the same boundary 


10 If — 2£ tt £* » 0 but xl — £? p* 0, rotate the ( u , v) coordinate system 45° and consider 
the new coordinates u\ v f . We have u' = (u + v)/V%, v' * (u — v)/y/2 and xl — x\ » 

2 ;«'*»' . 



218 


MAX SHIFFMAN 


values as ( on each boundary. One may map H'(e) conformally on a circu¬ 
lar domain normalized analogously to H\ and consider ((e) as a potential sur¬ 
face over this circular domain . 11 In all the other regions besides H' which com¬ 
pose H y set ((e) 33 (. It can be proved by use of conformal mapping methods 
that 5 («) depends continuously on e, and on f as long as no boundaries of W ap¬ 
pear inside the square K. 

To compute D[((e)] } introduce the surface 3 (e) defined over the region H'(e) 
by h(UyV;t) = f(w, v ) where U, V are given in (23). Then ((e) has the same 
boundary values as 3 (e) in H r (e) f and we have by a simple calculation 

D[t(e)] £ DUW] = D[ f ] + 5 / X(u, vo)(— 2f u du + I 

& Ju 0—r 

where 111 g 4r 2 D[f] g 4 t 2 o. Since — 2f„f„ g — b in the square K, 

f u Q+ T rMO+r 

/ Hu, t»o)(—2 tuXv)v-v 0 du £-b [r 2 - (m - Uo) 2 ]du = —. 

JuQ—r JU q—t * O 

For 0 ^ e i where a' is the smaller of the two numbers 1/(4 t), rb/ ( 12 o) we 

obtain 

^ j Hu, Po)(- 2 j„f du + el g + j 7 ^- 4 r 2 a = _r3& - 

Hence, in 0 ^ e ^ </, D[((e )] ^ D[(] — t 3 &€. Replacing € by t/e, the lemma is 
obtained with 0 = rba'. 

Lemma 5 will now be used to perform a piecewise deformation of a neighbor¬ 
hood of in . There is an open region in G' in which —2t) u t) v ^ — 2 b < 0 . 
If is ordinary, a square K can be selected in this open region, and Lemma 5 
immediately yields the required deformation. If Vj is degenerate, Lemma 5 does 
not apply since a surface near l) may have small circular boundaries in K\ but 
it must have less than k small boundaries. Accordingly, in the region where 
—g —2 by select k squares Ki , K 2 , • • • , K k of sufficiently small side 2 r 
with sides parallel to the u and v axes, and at a distance ^ 16r from each other 
and from the boundaries of G Consider the 25-neighborhood of tf such that, 
for any f(with domain H) in this neighborhood: 

1 ) those boundaries of H f which correspond to the boundaries of G' are dis¬ 
placed from them by at most a distance r, and the other boundaries of H' (these 
will hereafter be called small boundaries) have a radius ^ r; 

2) if there are no boundary points of H f within a 2 t neighborhood of the 
center of the square K ,-, then ~2( u ( v ^ —ft throughout this Kj . The desired 

11 Compare [2]. The end points of L are actually transformed into points (and not 
merely slits). Of course it is possible to obtain a similar lemma and the variational condi¬ 
tion by using variations not depending on the theory of conformal mapping, as in [1], [2J. 
But it would then be necessary to distinguish two cases: varying boundary values and 
varying the domain. 



UNSTABLE MINIMAL SURFACES 


219 


neighborhood of t) in which will be deformed is the closed 5 -neighborhood 
M\ = Ni in . 

The piecewise deformation of Mi will be obtained by applying Lemma 5 suc¬ 
cessively for each square Kj . * For this purpose, limit e to the range 0 ^ c ^ <r 
where a is so small that: 

3) the surfaces £(e) of Lemma 5 for each square Kj is displaced from £ at 
most a distance b/k) 

4) the boundaries of i/'(€) are displaced from the corresponding boundaries 
of H' at most a distance r/h. 

It follows from these conditions that any surface $ obtained by a A>fold repeated 
application of all these deformations, satisfies 3), 4) with 5, r replacing b/k, r/k. 
Because of this, conditions 1 ), 2 ) likewise apply to Replace e by <re, and aft 
by so that Lemma 5 and the above apply for 0 ^ c 2 * 1. 

Let A1 be that subset of Mi consisting of those surfaces £ interior to Mi 
whose region IV contains boundaries at a distance < 3r from the center of Ki . 
Indicate the closure of Ai by Ai , and set Bi = Mi — Ai. If £ is any surface 
in Ai , it remains fixed in the deformation; if £ is in Pi, construct the deforma¬ 
tion £(c) of Lemma 5 for the square Ki . Indicate the boundary operator by 
©1 (boundary as subset of $ a ), the deformation operator by 35 i, and the final 
image y(l) by gi. Then 


(25) 


| © 2 )i Bi = Ei — 35 i ©1 Ei — 1S1B1, 
| or Ei ^ ®i©i Ei + $$1 Ei . 


These relations and the operators © 1 ,2)i, §1 are understood in the sense that 
they apply in a natural way to any Vietoris chain on Ei . In view of Lemma 5 
all the surfaces involved in (25) lie on , and giSi lies on tya-p ; the homology’ 
and all subsequent homologies, take place over % . It follows from (25) that 1 ^ 


(26) 


Mi ^ Ai + 2)i ©1 Ei + i$iBj = M 2 , 

Va ~ (% - Ml) + M 2 = P2. 


The surfaces in M 2 may be designated by /(£, t) in place of £(e), where 
0 ^ t ^ 1 if £ belongs to © 1 P 1 , t = 0 if £ belongs to Ai , and t = 1 if £ is in 
the interior of Bi . The surfaces in P 2 consist of M 2 and /(£, 0) where £ belongs 
to $0 — Mi . The surface £, from which /(£, t) was obtained, is called the 
pre-image of the surface /(£, t). The set P 2 may be considered as a new space 
in which distance between/(£i, ti) and/(£ 2 , fe) is defined by | £1 — £ 21 + | k — U |, 
where | £1 — £ 21 is the distance between £ 1 , £ 2 in % . 

Suppose that Mj and P 3 (j ^ k) have been constructed, and consist of sur¬ 
faces in % designated by /(£, < 1 , • • • , £/-i) where 0 S t I lor^ = 0 ort = 1 
according to the situation of £ in certain subsets of • Let Aj be the subset 


11 Any Vietoris chain in Mi is homologous by subdivision to a Vietoris chain part of which 
lies wholly in A \and the other part wholly in Bi . The homology Mi * M* is a consequence. 
Similarly for % P I • 



220 


MAX SHIFTMAN 


of Mj consisting of all the surfaces /(j, <!,•••, t,_i) in Mj where the region H' 
for | contains boundaries at a distance < 3 r from the center of the square Kj; 
set Bj = Mj — Aj . If /(j, k , • • • , t,_i) is any surface in A ,-, it remains fixed. 
If /(?> < 1 , • • • , </-i) is in Bj , deform /(j, k, • • • , <,_i), considered as a surface 
in , according to Lemma 5 for the square Kj ; Lemma 5 applies in view of 
conditions 3), 4) and 1), 2) above. Indicate the deformed surface by 
/(?> k , • • • , <,-i, //) where 0 ^ ^ 1. As in (25), (26), 


(271 1 ~ + ^ 5 ’ + & = ’ 

1 Pi - - Mi) + M,- +1 « P )+ i, 

where closure A,- or Bj , boundary S3, and image 5,- are taken as subsets of P,-. 
All the relations (27) for j = 1 , 2 , • • • , k yield 


(28) 


| Mi ^ il/*+i, 

{ ^ Pfc+l. 


It is clear from Lemma 5 that if j = /(j, <i, < 2 , • • • , 4) is any surface in 
M k +i , then 


(29) Dh) g 

M-l 

Furthermore, we have 

Lemma 6. Let £ be any surface in the interior of Mi, and $ = fit , t \, 4 , • • • , 4) 
any surface in M k +i with % as pre-image. Then t y ' = 1 for at least one t y of 
tl y 4 , * * * , t k • 

Proof. If the lemma were false, there would exist a $ = /($, 4 , 4 , • • • , 4) 
in for which £ is interior to and 4 5 * 1, y = 1, 2, • • • , fc. Since 4 3 ^ 1, 
it follows that the surface /($, 4 , • • • , 4-i) belongs to A k . Hence there are 
surfaces /($', 4 , • * • , 4-i) belonging to A k which are arbitrarily near 
/(f, 4 , • * • , 4-i); f', being near f, is still interior to M\ and t [, • • • , ^-1 are 5 ^ 1. 
In particular, arbitrarily near the region H r for f are regions H[ for $' which 
contain boundaries at a distance <3 r from the center of if*. 

Since t k ~i ^ 1, the surface/(y', t [, • • • , ^_ 2 ) belongs to A k - 1 . As previously, 
there are surfaces /(y", • • • , 2 ) arbitrarily near /(?', <(,*•*, ^_ 2 ) which 

belong to . In particular, arbitrarily near are regions #2 (for y") which 
contain boundaries at a distance < 3r from the center of K k - 1 . Because H[ 
has a similar property for the square K k , it follows that Hi contains boundaries 
at a distance < 3r from the centers of Kk~ 1 and of K k . Continuing in this way, 
one finally obtains regions H k (which are regions for surfaces interior to Mi) 
with boundaries at a distance < 3r from each of the centers of Ki , K *, • • • , K k . 
But this contradicts the facts that Hu contains less than k small circles, each 



UNSTABLE MINIMAL SURFACES 


221 


small circle has a radius g r (see 1), 2) above), and the squares Ki, K*, • • • , Kk 
have a distance ^ 16r from each other. The lemma is established. 

The results, (28), (29) and lemma 6, of this piecewise deformation are sum¬ 
marized in 

Theorem 10. Let tf be a surface in $ not a minimal surface , and a any 
value ^ D[tj]. There are two positive constants 8 and 0, and a piecewise deforma¬ 
tion of in itself which yields 


(the homology taking place in and applying to any Vietoris cycle on $ 0 ). Here , 
P is a subspace of consisting of surfaces $ of the form g = /($, ti , 4 , • • • , 4), 

where 4 , m = 1, 2, • • • , k y takes the value 0, or values between 0 and 1, or the 

value 1 according to the situation of jr in , and /(jr, 4 , * * • ,4) has the following 
properties: 

1) /(?> 4 , • • • , 4) is continuous in f, 4 , • • • , 4 ; and /(f, 0, 0, • • • , 0) = £. 

2) If | ? — tj | > 5, then 4, • • • , 4 take only the value 0. 

k 

3) 7/ | j — t) | = 5, £[$] ^ Z)[f] - 

4) // | f - t) | < 5, then D[j] ^ D[ f] - /3. 

The above theorem applies automatically to andO as well as Through¬ 
out theorem 10 replace and P by Q and Q. 

11. The Main Theorems. Remarks 

On the basis of the reducibility condition (Theorem 8 or Theorem 9) and the 
variational condition (Theorem 10), it is easy to prove that on each ft-cap with 
cap limit a there is a minimal surface £ for which D[jr| = a. For the meaning of 
these terms, and such a proof, see [6] and Theorem 6 of [12]. This establishes 
the validity of the Morse theory. 

Main Theorem I. Let 14 , F 2 , • • • , be k non-intersecting closed rectifiable 
Jordan curves in space. Suppose that the reducibility condition of §8, page 213 
has been established for each curve individually. Then the Morse theory applies 
to the minimal surfaces (degenerate as well as ordinary) bounded by Ti, • • • , 14 . 

There remains the question of determining the connectivity numbers of 
where only those Vietoris cycles are considered which lie on for sufficiently 
large N. This is reduced by Theorems 2, 3 of Part I to the corresponding ques¬ 
tion for . The connectivity numbers of $ r have been determined for rather 
general classes of curves T in [12], [7], [8]. They are: R 0 = 1, R\ = R* = • • • = 
R n = • • • = 0. Hence in these cases the connectivity numbers of are like¬ 
wise Rq = 1, Ri = = • • • = R n = • • • = 0, by theorems 2, 3. 

Main Theorem II. Let Ti, T 2 , • • • , T* be k non-intersecting simple closed 
polygons in space . Then the Morse theory applies to the minimal surfaces ( de¬ 
generate as well as ordinary) bounded by 14 , • • • , 14 . In particular , if M n is 



222 


MAX SHIFFMAN 


the sum of the n th type numbers of all blocs of minimal surfaces bounded by T \, 
r 2 , • • ■ , r* , and if each M n is finite , then 


(30) 


M q £ 1 , 

Mi — Mo S — 1, 

< '• 


M n - M n -1 + ••• + (-l) n Mo ^ (-l) n , 


Also , ilf„ = 0 for all n > 3k — 6 + V, where V is the total number of vertices 
in all the polygons r M , p = 1, • • • , fc. 

Main Theorem II is a result of Part I, and §§9, 10. No use of §§7, 8 need 
be made. 

An application of the Main Theorem I is to the case discussed by Morse and 
Tompkins in [7]. If each r 4 has the property (18), page 215, the Morse theory 
and the inequalities (30) apply. 

Since degenerate as well as ordinary minimal surfaces are included in the 
main theorems above there are many more possibilities than in the case of one 
boundary. This leads to problems of the following kind. Suppose that a* 
degenerate minimal surface £ consists of two pieces, one £' bounded by a set 
(T') of the boundaries and the other £' bounded by the remaining set (T"). 
What is the relation, in the form of inequalities, between the Morse type of £ 
in the space ^ (r> and the Morse types of £' in ^J (r ) and of £" in < i|3 (r * ) ? 

College of the City of New York 


Bibliography 

1. Courant, R., Plateau’s problem and DirichleVs principle. Annals of Math., 38 (1937), 

pp. 679-724. 

2. Courant, R., The existence of minimal surfaces of given topological structure under 

prescribed boundary conditions , Acta Math., 72 (1940), pp. 51-98. 

3. Courant, R., Critical points and unstable minimal surfaces, Proc. Nat. Acad. Sci., 

27 (1941), pp. 51-57. 

4. Douglas, J., Solution of the problem of Plateau, Trans. Amer. Math. Soc., 33 (1931), 

pp. 263-321. 

5. Douglas, J., Minimal surfaces of higher topological structure , Annals of Math., 40 (1939), 

pp. 205-298. 

6. Morse, M., Functional topology and abstract variational theory, Memorial des Sci. 

Math., vol. 92 (1939). 

7. Morse, M., and Tompkins, C., The existence of minimal surfaces of general critical type, 

Annals of Math., 40 (1939), pp. 443^472. 

8. Morse, M., and Tompkins, C., Minimal surfaces not of minimum type by a new mode of 

approximation, Annals of Math., 42 (1941), pp. 62-72. 

9. Morse, M., and Tompkins, C., Unstable minimal surfaces of higher topological structure, 

Duke Math. Joum., 8 (1941), pp. 350-375. 

10. Rad6, T., On the problem of Plateau, Ergeb. der Math., vol. 2 (1933). 

11. Shiffman, M., Bull. Amer. Math. Soc., 44 (1938), p. 637, abstract of a paper read to the 

society in Sept., 1938. 

12. Shiffman, M., The problem of Plateau for non-relative minima, Annals of Math., 40 

(1939), pp. 834-854. 



Annals of Mathematics 
Vol. 48, No. 2, April, 1042 


ON THEORIES WITH A COMBINATORIAL DEFINITION OF 

“EQUIVALENCE” 

By M. H. A. Newman 
(Received June 23, 1941) 

1 

The name “combinatorial theory” is often given to branches of mathematics 
in which the central concept is an equivalence relation defined by means of 
certain “allowed transformations” or “moves.” A class of objects is given, and 
it is declared of certain pairs of them that one is obtained from the other by a 
“move”; and two objects are regarded as “equivalent” if, and only if, one is 
obtainable from the other by a series of moves. For example, in the theory of 
free groups the objects are words made from an alphabet a, 6, • • • , a~ l , b~ l , • • • , 
and a move is the insertion or removal of a consecutive pair of letters xx~ l or 
x~ l x. In combinatorial topology the objects are complexes, and the allowed 
moves are “breaking an edge” by the insertion of a new vertex, or the reverse 
of this process. 1 In Church’s “conversion calculus” 2 the rules II and III are 
“moves” of this kind. 

In many such theories the moves fall naturally into two classes, which may be 
called “positive” and “negative.” Thus in the free group the cancelling of a 
pair of letters may be called a positive move, the insertion negative; in topology 
the breaking of an edge, in the conversion calculus the application of Rule II 
(elimination of a X), may be taken as the positive moves. In theories that have 
this dichotomy it is always important to discover whether there is what may be 
called a “theorem of confluence,” namely, whether if A and B are “equivalent” 
it follows that there exists a third object, C, derivable both from A and from B 
by positive moves only. A closely connected problem is the search for “end- 
forms,” or “normal forms,” i.e. objects which admit no positive move. It is 
obvious that in a theory in which the confluence theorem holds no equivalence 
class can contain more than one end-form, but there remains the question 
whether in such a class any random series of positive moves must terminate at 
the end-form, or whether infinite series of moves may also exist. 

The purpose of this paper is to make a start on a general theory of “sets of 
moves” by obtaining some conditions under which the answers to both the above 
questions are favorable. The results are essentially about “partially-ordered” 
systems, i.e. sets in which there is a transitive relation >, and sufficient condi¬ 
tions are given for every two elements to have a lower bound (i.e. for the set to 
be “directed”) if it is known that every two “sufficiently near” elements have a 
lower bound. What further conditions are required for the existence of a 
greatest lower bound is not relevant to the present purpose, and is reserved for a 
later discussion. 

1 See Alexander [l] and Newman [1]. 

* See Church [1] and references there given. 

223 



224 


M. H. A. NEWMAN 


As an application the normal form theorem of Church and Rosser [1] in the 
conversion calculus is derived. 


2 

We are concerned with two kinds of entities, “objects” and the “moves” per¬ 
formed on them, and each move is associated with two objects, “initial” and 
“final.” We are therefore dealing essentially with indexed 1-complexes (in 
which, therefore, a positive sense is assigned in each 1-cell), the vertices being 
the “objects,” and the positive 1-cells the “moves.” It will be convenient to 
make use of this topological terminology. 3 The incidence relations are in no 
way restricted: there may be many cells with the same vertices, and the initial 
and final vertices of a cell may coincide. In diagrams the positive 1-cells slope 
down the paper, and some of the terms used are chosen accordingly. 

Vertices are denoted by italic letters, cells (the single word is used from now on 
for “positive 1-cell”) by the letters £, 77 , f, co with various suffixes, “x^y” 
means “there is a cell with initial vertex x and final vertex y*’ An ordered set 
of cells fi, £2 ,••*,&, form a path ir if there arc vertices Xo , Xi , • • • , Xk such 
that Xi -1 and are the vertices of £» for 1 £ i k. The cell £» is direct or 
reversed in w according as it runs from x^i to Xi or from Xi to 1 , and the path 
is denoted by e x £i + e 2 % 2 + • • • + , where e, is ±1 as £» is direct or reversed. 

If there are no reversed cells, w is a descending path. It is convenient to regard 
a single vertex, x , as a “null path” with x as initial and final vertex. A vertex 
which is not the initial vertex of any cell is a minimal vertex , or end. 

If there is at least one non-null descending path from x to y we write x > y. 
z is a lower (upper) hound of x and y if x ^ z and y ^ z (if z ^ x and z ^ y). 

3 

Expressed in this terminology the confluence property is 

(A) If X\ and x 2 are connected by a path in the indexed complex 2 they have a 
lower bound. 

By a simple induction on the number of cells in a path from x\ to x 2 this property 
can be deduced from the following special case of it: 

(B) If x\ and x 2 have an upper bound they have also a lower bound. 

This in its turn is easily deduced from the still more special form (C): 

(C) If ayx 1 and a > X 2 , x 1 and x 2 have a lower bound. 

The transition from (B) to (C) is a step towards localizing the property, and 
the theorems that will be proved in this paper give conditions in which the 
localization may be completed, i.c. in which (A) may be inferred from the fol¬ 
lowing condition (holding for all a, X\ and x 2 ) : 

(D) If apx 1 and apX 2 , x\ and x 2 have a lower bound. 

Note. The cell and vertex terminology, although the most convenient for 

* The notions that arise are closely related to those of the theory of partially ordered sets, 
but usually not identical. Except in the case of identity the terms of that theory are 
therefore avoided. 



“combinatorial theory” and “equivalence” 225 

our purpose, may suggest that “xyy” implies that y is a “next” vertex below x. 
Actually the force of y is that y is an element satisfying y < x, and lying in a 
certain neighborhood of x. For example, all the conditions (A) to (D) are 
satisfied if the vertices are taken to be the points of a vertical plane, and the 
positive 1-cells the downward sloping directed segments of length less than 1. 

4 

The simplest way of strengthening (D) so that it implies (A), is to require 
that paths descending from x\ and x 2 to their lower bound shall each contain one 
cell; or, in terms of moves, that if two moves are possible on an object X , they 
can also be performed one after the other, and give the same result in cither 
order. 

Theorem 1. Let 2 be such that if aux and any, and x y, there exists b such 
that x/ib and yyb. Then property (A) holds. 

Let “ xvy ” denote “xyy or x = y” We prove that if a\ix and anyinyw * • * v-Vk , 
there is a bk such that xvbipb 2 v • • • vbk and y k vbk ,—a stronger form of (C). 
Suppose this proved for k — 1 (the case k = 1 following immediately from the 
datum), and let xvbivb 2 v • • • vb k - 1 , and y k -\vbk ~\. If yk-i = b k -i take b k to 
be y k . If y k ~mbk-i , since also y k -inVk there exists a b k such that bk-ivb k and 
yicvbk ; and this completes the induction. 

Corollary 1.1. The theorem remains true if “xvb and yvb ” is substituted for 
“xnb and yfib” in the enunciation. (No change is needed in the proof.) 

This almost trivial result is sufficient to settle many of the more familiar 
theorems of the kind that we are considering. In the “word groups” already 
referred to, a move is to be regarded as completely determined by the initial 
and final words, (so that e.g. xx~' l x —> x is regarded as the same move whether 
the first or last two letters are cancelled). Hence two pairs xx -1 and yy -1 
(where x and y may be of the form xT 1 ) in the same word W, that give rise to 
different possible moves on W, have no common letter and give the same result 
if cancelled in either order. Since every series of positive moves (cancellations) 
terminates it follows that all such series starting from a given word W lead to a 
common end-form. 

Theorems of the Jordan-Holder type also belong to this category. The kernel 
of these theorems is a theorem on modular lattices (say with the partial ordering 
> and the operations V and A). If X, Y, Z are consecutive elements in a 
descending chain, S, in such a lattice let the chain S' obtained by substituting 
Y 9 for Y be said to be directly related to S (S' dr S) if X = X V Y 9 and 
Z = Y A Y f ; and S' shall be related to S if it is obtainable from S by a suc¬ 
cession of such steps. The theorem in question is then that from any two finite 
descending chains , S and S', from A to B, a pair of related chains Si and Si, 
can be obtained by the insertion of a finite number of additional terms in S and S' 
respectively . This is evidently a “confluence” theorem. To apply 1 we take 
as a typical vertex of 2 the class [S] of all chains related to a chain S, and as a 
positive 1-cell the ordered pairs of classes [Si], [S 2 ], where S 2 is obtained from Si 



226 


M. H. A. NEWMAN 


by the insertion of one additional term,—say P —between X and Y. Then if 
Si dr Si, the insertion of a suitable term in S 2 gives a chain S 2 related to Si ; 4 
and hence more generally any member of [Si] can be made into a chain related 
to S 2 , by the insertion of one suitable term. Two successive “positive moves” 
on [Si] can therefore be represented by two successive insertions of new elements 
in the same chain S, and evidently the order in which they are inserted does 
not affect the result. The system therefore fulfils the conditions of Theorem 1. 
But any two chains descending from A to B have an “upper bound” in 2, 
namely the class LAB]. Therefore they have a “lower bound,” and this is the 
required result. 


6 

In these examples it is obvious that if an end-form exists it is reached by ran¬ 
dom descent. This is necessarily so in all systems with non-interference of 
moves: 

Theorem 2. Under the conditions of Theorem 1, if there is a descending path 
of k cells from a to an end e, no descending path from a contains more than k cells. 

If k = 1, 2 cannot contain a cell ay with y e, since if it does b exists such 
that yub and eyb, and e is not an end. In the general case let ir be a descending 
path £1 + £2 + * • • + £* joining a to e , and let rji + 772 + • • * + 1 j be any 
descending path from a. Let £1 and m be cells ax and ay. If x = y it follows 
immediately from an induction thatj g k . If not, let the cells f and o> descend 
from x and y to the common vertex w. By Theorem 1 there is a descending path 
a from w to a vertex ^ e, i.e., since e is an end, to e itself. Since £2 + • • • + £* 
has k — 1 cells, f + <r has, by an inductive hypothesis, at most k — 1 cells; 
therefore a> + cr, and finally also rj 2 + ••• + »?/, have at most k — 1 cells,— 
i.e. j ^ k . 

Corollary 2.1. Every descending path from a is part of a descending path of k 
cells from a to e {i.e. there is “random descent ” to e). 

That Theorem 2 and Corollary 2.1 fail if the condition is weakened as in 
Corollary 1.1 is shown by the example in Fig. 1, (positive cells slope downward). 

The main criteria for “confluence” are established in Theorems 3, 4, 5, and 9, 
all of which are independent. It is Theorems 5 and 9 that are used in the 
application to the conversion calculus. 

Theorem 3. In an indexed complex in which all descending paths are finite , 
(D) implies (A). 

(Note that in such a complex “>” is a proper ordering, since if x > x an 
infinite descending path is obtained by going round and round the re-entrant 
path from x to x.) 


4 Namely, if X and Y are in §1, insert P itself; if XYZ and XY'Z are consecutive terms 
of Si and §{ respectively, insert P' *■ Y f A P in S( ; if UXY and UX'Y, insert P" — 
X' V P. It is easily shewn that in the second case XPYZ is related to XF'P'Z, in the third 
UXPY to UP ,f X'Y. Cf. Birkhoff [1] p. 37. 



“combinatorial theory” and “equivalence” 


227 


The symbol [£]* is used as an abbreviation for £1 + & + ••• + £* . It is 
convenient to allow the value k = 0, [f]o being a null path. 

A peak of a path is the common vertex of a successive pair of cells — £ + 17, 
(“up” before “down”). 

Let [£]/ and [rj] k be paths descending from a vertex a to vertices b and c respec¬ 
tively. Let 7n be the path — [£],- + M*, and let it be assumed that paths 
* 2 , tt 8 , • • • , T r , each leading from a to 6, have been defined. Let X r be the 
(finite) indexed subcomplex of X formed by all the cells occurring in the paths 
tti , • • • , 7 T r . The depth in X r of a vertex x is defined to be the maximum 
possible number of cells in a descending path from a to x in X r , (or 0 if there 
is no such path). Thus the depth oi any vertex in X t +i is not less than its 
depth in X,. 

If T r contains no peak, w r +i is not defined. If it contains at least one peak, 
choose one, say y, of minimum depth in X r among peaks of t t . Let the vertices 



immediately preceding and succeeding y on t t be u and v , the (positive) cells 
yu and yv being co and a/ respectively. There exist, by (D), paths a and r, 
(either or both of which may be null), descending from u and v to a common 
vertex w , and 7r r +i is formed from T r by substituting <r — r for — w + a/. The 
effect is to replace the peak at y by at most two new ones, of depths in A r r+ i 
at least 1 greater than that of y in X r (or zero). By a simple induction it 
follows that T r has at most r peaks; and if we make the inductive hypothesis 
that the peaks of 7^* are of depth at least n in X 2 * it follows that, if r g 2 n , 
at most 2 n — r peaks of T 2 n+ r are of depth n or less in X 2 »+r. Thus the induction 
is complete and it is proved that if r ^ 2 n all peaks of T r are of depth at least n. 

If [f] n is a descending palh of maximum length in X r from a to a given vertex z, 
where r ^ 2 m , belongs to X 2 * for i = 1 , 2, • • • , m. Suppose that, for a certain i, 
X, is the first of the X’s to contain , where j > 2\ Then [f]< is a descending 
path in X, of maximum length to its final vertex, z<, since any longer one could 



228 


M. H. A. NEWMAN 


be used as part of a longer descending path to z in X r . Thus the depth of Zi 
in X, is i. Since belongs to X* but to no earlier X , it is a cell of one of the 
descending paths that eliminate a peak, y , in the formation of tt, from 7 r,_i. 
By the result of the preceding paragraph, if the depth of y in X/_i is p, j — 1 < 
2 P+1 . But the depth of 2 * in X, exceeds that of y in X,_i by at least 2 , i p + 2 . 
Therefore j — 1 < 2 t_1 , j < 1 + 2 t_1 g 2 *, contrary to the hypothesis. 

series of paths in , 7 t 2 , • • • , terminates. If not choose, for each n, a 
maximal descending path, <r n , in X 2 n from a to a peak of w 2 ». Since the first 
cell of each of these paths is in the finite complex X 2 there is at least one cell, 
wi, which is the first ceil of <r n for an infinity of w. Since the second cell of each 
of this infinite subsequence is in X 4 thtre is at least one cell, o> 2 , such that 
coi + c *>2 is the beginning of an infinity of the a n . Continuing in this way we 
obtain an infinite descending path <01 + w 2 + • • • in 2 ,—contrary to its given 
property. 

Thus the series of paths 7 r r from b to c terminates in a path t q , which, since 
it has no peak, must descend or ascend directly from b to c/or else descend 
from 6 to a vertex w and then rise to c. 

The finiteness condition imposed on descending paths in Theorem 3 cannot 
be replaced by the corresponding “completeness” condition, that every descend¬ 
ing chain of vertices has a lower bound in 2 . This is shown by the complex 
in Fig. 3, in which the vertices c and d are lower bounds of all sets of vertices 
not containing either of them; but c and d have themselves no lower bound. 

6 . Topology of 2 

The complex 2 can be made into a 2 -complex, 2 2 ,,by adding a 2 -cell bounded 
by each of the 1-cycles co + a — r — co' occurring in the proof of theorem 3 
(one for each 7 r r ). Every component of 2 2 is simply connected . Any two paths, 
7 r and 7 r', connecting vertices a\ and bi are deformable, by the method of Theo¬ 
rem 3, into paths <ri — ri and a\ — ti respectively, where <n and n descend to a 
vertex 02 , <r[ and t[ to 6 2 ; and if ^ stands for “is deformable into,” — n + t[ n* 
cr 2 — 7 * 2 , and — <7i + <ti <7 2 — r 2 , where <t 2 , r 2 , o- 2 , r 2 are descending paths, 
the first two to a 3 , the second two to 63 . In this way paths <r n and r n descending 
to a n + 1 , and a n and r n to 6 n +i, are defined for every n . If an infinity of different 
paths descending from a x could be made from the cr t , r<, and 7 \ , an infinity 
of them would necessarily contain one or other of a \, ,—say <ri ; and of these 
an infinity would contain one or other of cr 2 , <r 2 ,—say <r 2 ; and so on. The 
descending path <ri + + • • • so constructed would have an infinity of different 

paths as subsets, and would therefore be infinite, contrary to the postulated 
property of 2 . The number of different paths must therefore be finite. 

It follows that for some m, <r m = r m = = r m = 0; i.e. 


7T — tt' ^ <n — n — t[ — <j[ <r m ~ T m + T m — a' m = 0. 



‘combinatorial theory” and “equivalence” 


229 


7 

To establish our second criterion, we suppose that 2 is the sum of two sub¬ 
complexes, 5 L and R, and shall use the terms “L-cell,” “72-path,” etc., in an 
obvious sense. “x\y” and “fcpt/” shall mean that there is a cell xy in L or R 
respectively, and xLy and xRy that x > y in L or 72. (In diagrams the positive 
L- and 72-cells will slope down towards the left and right respectively.) We 
denote by Q the following property of 2. 

(Q) If x\y and xpz there exists a vertex w such that zLw , and either y = wor yRw. 
(We require zLw, which is not necessarily implied by z = w. The possibility 
that y = z is not excluded.) 

Theorem 4. If, in a complex with the property Q, all L-paths are finite, then 



if xLy and xRz there exists a vertex w such that zLw and either y = w or yRw . 
If all L-paths are finite, then, in property (Q), z w. 

It is sufficient to prove the theorem when x\y, the general case then following 
by induction. Let rj be an L-cell from x to y, and [f]* an 72-path from x to z . 
Let 7Ti be — tj + [f]* , and suppose, inductively, that a path 7r r from y to z has 
already been defined. 

If 7T r has no peak 7r r +i is not defined; otherwise let vq be the last peak on w r , 
from y towards z . We assume, inductively, that in proceeding from y towards z 
the direct (“downward”) cells of ir r are in R and the reversed cells in L,—an 
assumption evidently satisfied for r = 1. The part t; 0 • • • z of ir r is of the form 
[w] q — a where [w] fl is a descending 72-path and a a descending L-path. a may 
be null, but q 0 since Vo is a peak. Let & be the predecessor of wi in v r , 

6 This always means “indexed subcomplex,“ the positive direction in each l-cell agreeing 
with that in 2. 



230 


M. H. A. NEWMAN 


(and therefore an L-cell). Assuming inductively that £ m is defined, for some 
ra ^ q, as an L-cell with the same initial vertex as w m , let <r m and a m be the 72- 
and L-paths which, by (Q), descend from the final vertices of £ m and <a m to a 
common vertex. Then <r' m is not null, and we define £ m +i to be its first cell: 
say <r' m = £ m +i + r m . The path v r +i is now defined to be the result of substi¬ 
tuting <ri — ri + <T 2 — r 2 + • • • + — <rg for — + [w] fl . It evidently has 
the property that reversed cells are in L and direct cells in R , and the inductive 
definition of ic r is therefore completed. 

If v q is the final vertex of o) q , and —err is the portion v Q • • • z of v r , oy is a 
descending L-path from z. The corresponding portion of — 7r r +i is o> + a q , 
with at least one more cell. Since all L-paths are finite it follows that the process 
of constructing paths r r terminates after a certain number, fc, of steps, i.e. ir* 
has no peaks and is therefore a descending (possibly null) 72-path from y to a 



vertex w , followed by an ascending (non-null) L-path from w to z. Thus yRw 
or y = w, and zLw. 

Corollary 4.1. If (Q) is strengthened by excluding the 'possibility y = w y 
Theorem 4 may be strengthened in the same way . (Obvious from the method of 
proof.) 

Corollary 4.2. A descending l-path and a descending R-path have at most 
one common vertex . If the two paths have their initial and final vertices, a and b 
in common, i.e. if aLb and aRb , there is a vertex c such that bLc and bRc (the 
alternative 6 = c being impossible in this case); and a vertex d such that cLd 
and cRd; and so on. The path a • • • b • • • c • • • d • • • is an infinite descending 
L-path. 

In particular a cell cannot be both an L- and an 72-cell. This does not mean 
that the condition (Q) could be weakened in Theorem 4 by adding “if y ?* z” 
at the beginning. That this would make the theorem untrue is shown by the 
example in Fig. 3, where segments sloping down towards the left and right 
belong to L and 72 respectively, and the cells marked b<c are in both L and 72. 



“combinatorial theory” and “equivalence” 231 

The condition (Q) is satisfied for pairs with y j* 2 , and no descendingL-path has 
more than two cells; but c and d have no lower bound. 

Theorem 4 also fails if the alternative “z = w” is allowed in (Q). This is 
seen by omitting the vertices in Fig. 3 so that each aiC becomes a single 72-cell 
(but not now an Jb-cell). 


8 

We return to the consideration of indexed 1 -complexes in general. A set of 
(positive) cells, E x , is at x if x is the initial vertex of every member of E x . 
(The null-set is at every vertex.) We suppose that if £ is a cell xz , each cell y 
at x has a finite set of cells at z , called y | £, assigned to it as its %-derivatef 
The £-derivate, E x | £, of a set E x is the logical sum of the £-derivates of members 
of E x . (An appearance of the symbol E x | £ implies that £ is at x.) If x is a 
descending path from x , E x | x, the derivate of E x by continuation along x, is 
defined inductively by the equation 

E x | (x + £) = (E x | x) | £; 

and if x is null, E x | x = E x . We usually write E | (x + £) without brackets: 
E | x + £. The path [£] m is a development of E x if, for 1 ^ i ^ m, £» c E x | [£],_i. 
The development is complete if E x | [£] m = 0, partial if not. 

The letters CD are used as an abbreviation for “complete development.” 
We postulate the following conditions on the derivates: 

(Ai) y I £ is null if , and only if, y = £; 

(A 2 ). if y f, (y | £) H (f | £) = 0 ; 

(A 3 ) if y and f are distinct cells at x, there exist developments k v and /q of y | f 
and f | y respectively , with a common final vertex w. 

(A 4 ) with the notation of (A 3 ), £ | (y + <q) = £ | (f + k v ), for any £ at x . 

It follows from (A 4 ), by summation, that the derivates of any set E x by con¬ 
tinuation along y + *q and f + K n are the same. A further consequence is that 
k v and are complete developments of y | f and f | y respectively. For 

(v I f) I K, = V I ? + 

= V I V + “t — 0. 

From (A 2 ) it follows by induction on the length of r that if El D El is null, 
(El 1 7 r) fl (El 1 7 r) is also null. 

Lemma 1 . If v is a development of El , and El | x £ El | v, then El C El. 
Let ir be [f] m . Let j be the least integer such that El | [$], C El | [£],•. If the 
lemma is false j 2 : 1 , and El | contains a cell f not in El | [£],_i. Hence 
f ^ and f | £,■ is a non-null subset of El | [{],• not contained In El | [{],•, con¬ 
trary to the hypothesis. 

In particular if El | w = 0 , El £ El. 

It is assumed, further, that a relation J holds between certain of the pairs of 


• Fof an illustrative example of derivates see §13. 



232 


M. H. A. NEWMAN 


cells at a vertex, and a set E a is defined to be a J-set if £Jq and 17 J£ for every 
distinct pair £, rj in E a . (Thus all sets with less than two members are J-sets.) 

(Ji) If £Jij, £ | rj has precisely one member . 

(J2) If m € {1 1 f and rj 2 e f 2 1 f, and 2/ £iJ£ 2 or £1 = £2, ^en rj x Jrj 2 or m = m - 
It follows from J 2 that if E is a J-set, E | f, and more generally E | tt, is a J-set. 
From Ji it follows that for no £ does £J£. 

It is now agreed that a set denoted by E , E a , etc., shall be finite. (A CD of 
any set is finite by definition.) 

Lemma 2. If the J-set E has k members , all CD’s of E have k cells and the same 
final vertex , and all partial developments are parts of CD’s. 

If rj and £ are in E, rj | £ has one member if rj 5* £, and none if rj = £. Thus 

E | £ is a J-set with k — l members, and the development comes to an end after 

k steps. 

Let r\ + a and f + r be any two CD’s of S, ending at y and z respectively; 
and let rj ^ £. By Ji and A 8 the sets rj I f and f | rj are single cells, r\ f and f', 
with a common final vertex, w. By A 4 , E \ rj + f' = E | f +’ rj', a set with 
k — 2 members. Let t be a CD of this set, ending at u . By an inductive 
hypothesis f' + tt and a, being CD’s of E | rj, a J-set with k — 1 members, have 
the same final vertex: u = y. Similarly u = z, and so y = z. 

Lemma 3. If E x is a J-set , and El any set at the same verteq x , aZZ derivates 
of El by continuation along CD’s of E x are identical. 

With the notation of the previous lemma, 

El | rj + c = El | rj + f 1 ' + tt, (inductive hypothesis), 

= El | f + 17' + ir, (A 4 ), 

= 2?i | f + t, . (inductive hypothesis). 

9 

If Si is a J-set, E x | El denotes the continuation of E x along a CD of El . 
By Lemma 3 it is independent of the CD chosen. E x | El + El + • • • + El is 
defined inductively to be (E x | El + • - • + E k ~ l ) | (E k x \ El + • • • + E^ 1 ). 
Thus if [£]* is a CD of El , E x | El and E x | [£]* are alternative notations for the 
same set. 

We now come to the main results of the paper. All the conditions A and J 
are purely local, and involve only a fixed number of given cells. 

Theorem 5 . Let ir x and be paths in a 1 -complex with the properties Ji_3 and 
Ay-Ai descending from a to vertices b and c. Then there exist paths 7t 3 and ir 4 
descending from b and c to a common vertex d, such that if E a is a set at a , 

E a \ *1 + 1 TZ *= E a \ Ti + ITi . 

We first prove the following special case. 

Lemma 5 . 1 . If El and El are J-sets , the CD’s of E\ | E\ and El | E\ have the 
same final vertex , and if E a is any set at a, E a | E\ + El = E a | El + E\ . 



“combinatorial theory” and “equivalence” 233 

(Since all sets called “ E ”, “ E 1 ”, etc., in the following proof are at a, the suffix 
a will be omitted. 

7 Case 1. E l fl E 2 = 0 . Let n(E) denote the number of elements in E, We 
proceed by induction on m, .= n(E x | E 2 ) + n(E 2 | E 1 ). Excluding the trivial 
case where one set E l is null, the minimum possible value of m is 2. This 
minimum is only attained if n(E l ) = n(E 2 ) = 1, and Lemma 5.1 then follows 
from A 8 and A 4 . We may therefore assume that m > 2, and also that n(E l ) > 1 
or n(E 2 ) > 1, —say E 1 = E z U rj , where E z ^ 0 . 

The proof depends on the fact that if £ is not in E, n(E | £) ^ n{E ); and hence 
if E v and E q are /-sets satisfying E p fl E q = 0 , and JRT C E q , then n(E p | E r ) g 
n(E p | £<). Thus n(E 2 1 E z ) g rc (£ 2 1 E 1 ) and n(E z | E 2 ) < n(E l | E 2 ). There¬ 
fore, by the inductive hypothesis, CD’s of E 2 \ E z and E z | E 2 have the same 
final vertex, z. Since E l is a /-set, tj | E z is a single cell, £, and 

£ | (E 2 | E z ) = rj | E z + E 2 = ri \ E 2 + E z (by the inductive hypothesis) 

= E 1 | E 2 + E z , 

since E z | E 2 + E z = E z | E z + E 2 = 0 . Thus there is a CD of E 1 \ E 2 consist 
ing of a CD of E z | E 2 followed by a CD of £ | ( E 2 | E z ). Since E z | E 2 is not 
null it follows that n(£ | (E 2 | E z )) < n(E l | E 2 ), and since also (E 2 | E z ) \ £ = 
E 2 | E 1 the inductive hypothesis may be applied to the sets £ and E 2 | E z at z. 
The final vertices of CD's of (E 2 \ E z ) | £, i.e. E 2 | E\ and of £ | (E 2 | E z ) are 
therefore identical, and the latter set has been seen to be the end portion of a 
CD of E 2 | E 1 . The first part of the induction is therefore complete. If E is 
any set at a, 

E | E 2 + E 1 = E | E 2 + E z + Vy 

= E | E z + E 2 + rj, (by the inductive hypothesis applied to E 2 and E z ) 

= E | E z + tj + E 2 , (by the inductive hypothesis applied to £ and E 2 j E z ) 
= E | E 1 + E 2 . 

Case 2 . As Case 1 save that E 1 fl E 2 0 . Let E' = E°U E'+ 2 , for i = 1 , 2 , 
where E z PU 5 4 = 0 . By Case 1, applied to E* | rf* and E z | E°, £ 7 4 1 E° + E z 
and E z | E° + E i have the same final vertex, and since jE 7 ° | £ 7 ° = 0 these two 
sets are E 2 \ E° + E z and E 1 \ E° + E 4 , i.e. E 2 1 E 1 and E 1 | E 2 . 

In the general case, to which we now turn, the result may be stated more 
explicitly as follows, taking 7n and 7T2 to be [r)]j and [f]& . 

Lemma 5 . 2 . If [77],- and [{*]* are any paths descending from a, to b and c there 
exist paths <r r « and r r %, (possibly null , r = 1, • • •, j + 1, s = 1, • • • , k + 1) such 
that 

( 1 ) 7 J § = <71. , f r = Tn , 

(2) ov+i,. and Tr f ,+1 have the same final vertex , 

7 This proof of Case 1 was suggested by Dr. J. H. C. Whitehead, in place of one based on 
Theorem 4. 



234 


M. H< A. NEWMAN 


(3) 0 >, is a CD of E \ 8 , = rj 8 | ri, + r* f + • • • + r r -i,«, and r t8 of E \ 9 , = 

fr | <M + * ‘ * + 0V,«-1 • 

(4) for any E a , E a | xi + ri,*+i + • - • + Ty,*+i = 2? a | ^2 + 07 + 1,1 + • • • + 
07+1.* • 

Starting from the two given paths [rj]j and [f]* we add, one by one, the pairs 
of paths <r r +i t , and r r , # +i for the couples (r, s) in the standard “triangular” 
order (1, 1), (1, 2), (2, 1), (1, 3), • •• . When the time comes for <r f+ i,« and 
Tr,«+i to be added, the paths <r r » and r r «, descending from a vertex x „, and 
corresponding to the earlier couples (r, $ — 1) and (r — 1, s), have already been 
constructed as CD’s of E) 8 and E 2 r8 . Hence by the cases of Theorem 5 already 
settled, CD’s E\ 8 1 r T8 and E 2 r8 1 <r r *, i.e. of El t8 +1 and El+i, 8 , meet at a common 
vertex. These CD’s are r r ,*+1 and <r r +i,«; the induction is complete. (In the 
limiting cases r = 1 and s = 1 the single cells rj r and f * play the parts of El and 
El in the earlier cases.) 

The proof just given provides a method of deforming xi + x 8 into x 2 + x 4 
by a series of steps in each of which a path r r8 + <r r + 1 ,» is replafced by a path 
c t„ + r r , 8 + 1 . By Lemma 5.1 a set E x at the common initial vertex of <r f8 
and r r8 , when continued along either of these paths gives the same result, and 
therefore the continuation of E a along the whole path is unaffected by a single 
step. 

Theorem 6. Any two CD’s of a {finite) set E x have the same final vertex . 

If [rj]j and [f]* , ending in b and c, are the developments then, with the nota¬ 
tions of Lemma 5.2, since tj, e E x | [rj] 8 -i , 

E\ 8 C E x I [rj]s-l + Tu + * * * + T r —i t * , 

= Ex I [f]r-l + (Trl + • • • + <Tr,a-l > 

and therefore a r i + 0 V 2 + * • • is a development of E x | [f] r _i ; and in particular 
<Tk+ 1,1 + <Tk+i ,2 + • • * is a development of E x | [f]*, = 0, since [f]* is a CD. 
Thus c = d, and similarly b = d. 

Corollary 6.1. Continuation of a set E a along any two CD’s of a set E l a gives 
the same result . This now follows from Theorem 5, x 3 and m being null. 

’ Corollary 6.1 cannot be extended to give the general monodromy property, 
“continuation of E a along any two descending paths from a to b gives the same 
result.” Consider the 1-complex in Fig. 5, in which the vertices marked x are 
identical. Derivates are defined by parallel displacement downward, except 
that the derivates of xy and xz at z and y are zw and yw respectively. All sets 
are J-sets. The conditions A and J are satisfied in this complex, but continua¬ 
tion of ab to x via b gives the null set, via c the cell xz . 

Theorem 7. In a complex satisfying , J and A, all developments of a finite set 
E x are finite . 

Every set is a sum of J-sets, namely its individual members. We proceed by 
induction on the smallest number, k, of J-sets, E { s , whose sum is the given set 
E x . (The case k = 1 is Lemma 2.) 



235 


“combinatorial theory” and “equivalence” 

There is at least one CD of E x , namely M* , where ov is a CD of the /-set 
El | Mr -1 . Suppose that ft + ft + • • • is an infinite development of E x , and 
let err *, T r8 , E l r , and E 2 ra be as in Lemma 5.1, save that <j r replaces y \ r . Then 
just as in Theorem 6, n. +- r 2 . + • • • is a development of Z? x | . Since, 

for i < k, El is annihilated by continuation along <n , E x | M*-i = E k x | [<r]k-i . 




Thus ti* + T 2 k + • • • is a development of a /-set, and so r r k = 0 if r exceeds a 
certain q . Therefore if r > g, 

0 = -BrA? = fr | Vrl + * ' ' + < T r , k -1 - 

Now an + ••• + cr r ,*_i is a development of H r , = (Bi U El U • • • U E*T l ) | 
[f] r _i, and hence by Lemma 1, f r eH r . Therefore the infinite path ft+i + 
ft +2 + • • • is a development of H q +i , a sum of (k — 1) /-sets,—contrary to the 
inductive hypothesis. 

Corollary 7.1. There are only a finite number of different developments of E x . 
If there are an infinity, some cell & of the finite set E x must come first in an 




236 


M. H. A. NEWMAN 


infinity of developments; and some cell £ 2 of the finite set E x | £1 must be second 
in an infinity of these developments; and so on. The path £1 + £2 + • • • is 
an infinite development of E x , contrary to Theorem 7. 

10 

Theorem 7 is connected with the problem of “random reduction”. To a 
“normal form” or “end-form” in a system with moves there corresponds an 
end of 2, and to a normal form of X an end connected by a path to a given 
vertex x. It follows from Theorem 5 that there is a descending path from x 
to the end, and that a vertex cannot be connected to two ends, i.e. that an 
“object” in the corresponding system cannot have two different normal forms. 
There remains, however, the possibility of an infinite descending path from a 
vertex which is also connected to an end. It will now be shown that this possi¬ 
bility is not realised in complexes satisfying the conditions A and J. 

Theorem 8. If , in a complex satisfying the conditions A and J, there is a path 
descending from x to an end e of 2, all descending paths from x are finite, and 
all maximal paths end at e. 

That all maximal descending paths from x end at e is obvious in view of 
Theorem 5; only the finiteness remains to be proved. 

Let [rj] m be a descending path from x to e, and (if possible) £1 + f 2 + • • • 
an infinite descending path from x. Let the paths o> a and r r8 , and the sets 
El. and E r $, be constructed as in Lemma 3.2. Since e is an end all the 
are null. Let j be the largest number such that r l3 is non-null for an infinity 
of values of r, and k a number such that r r , J+ i = 0 if r ^ k. Then El,j+ 1 , of 
which T r ,j +1 is a CD, is also null, giving E 2 rj | a r j = £>,,•+1 = 0. Since <r r j is a 
CD of E)j it follows (Lemma 1) that, for r ^ k, 

Eli G E\j = El, I Tki + • • • + rr —j, i • 

Thus T k j + t*+i, 3 + • • • is a development of Eli , and by Theorem 7 cannot 
be infinite,—contrary to the definition of j. 

It follows that if a 2-complex 2 2 is constructed as in §5, all its components 
containing ends of 2 are simply connected. 

11 

The theorems that have been proved indicate that complications will arise 
when the descending paths that join the “y ” and “ 2 ” of condition (D) to “ w ” 
have either more or less than one member each, and that the difficulties are of 
a different kind in the two cases. In the fore'going group of theorems the second 
possibility, (corresponding to £ | rj = 0 for £ 5 ^ rf), was excluded. The follow¬ 
ing theorem allows this possibility, but is in other ways more special than 
Theorem 5, and the meaning of the conditions imposed is less obvious. The 
theorem is used in extending the Church-Rosser Theorem to an enlarged calculus. 

We suppose that derivates are defined in 2, and satisfy A 2 -A 4 , but that Ai 
holds only in the weakened form 

(A*) £ | £ *sr 0, and if £ | 77 = 0 then tjA£, 



“combinatorial theory” and “equivalence” 237 

where rjA £ stands for “either 77 = £ or 77 | £ has just one member”. The follow¬ 
ing additional “J-condition” is imposed, A denoting “not A”: 

(J 8 ) if £At 7 and £Jft then TjJft 

(Condition J 3 does not imply- the second half of A?, since £j£.) 

Lemmas 2 and 3 remain true under these conditions, and are proved as before. 
The notation E a \ El + • • • + El may therefore be introduced for J-sets E' a . 

Theorem 9. A complex with the properties A*, A 2 -A 4 and Ji-J 3 has the prop¬ 
erty (A). 

It is sufficient to prove the following special case, since the extension to the 
general case then proceeds exactly as in Theorem 5. 

Lemma 9.1. If El and El are J-sets, and E a is any set at a, the CD's of 
El | El and El | El have the same final vertex , and E*\ El + El = E a \ El + El . 

Let [ 17 ]* and [f]* be CD's of El and El , 77 * and ft having final vertices 6* and 
Ci . u ElJEl ” means “rjJ f if rj e El and f € El ” From J 2 it follows that if 
ElJEl, (El\Z)J(El\0. 

Case 1: ElJEl. We show further that in this case El | El has j cells. First 
let j = 1. If also k = 1 the result follows immediately from Ji and A 3 . For 



general k , we have (771 | £i)J(El | ft); and since 771 | f 1 is a single cell, and El | ft 
has k — 1 cells, an inductive hypothesis shows that 771 | El is a single cell and 
has the same final vertex as a CD, x', of (El | ft) | (771 | ft), which by A 4 is El | 771 + 
ft . Hence x' is a CD of El | 771 . That E a | El + El = E a | El + El is proved, 
as in Theorem 5, by repeated applications of A 4 . Case 1 for general j is now 
completed by applying the case j = 1 successively to r) r and El | [ 77 ]^ , for 
r = 1 , 2 , • • • , j, and using the last part of the result for r — 1 . 

Case 2: El | El = 0. We show further that, in this case , El | El has k 
members or less. If j = k = 1 the result is clear from J and A. Suppose that 
j = 1, k > 1. Then ftArji, for if ftArji, by J 3 and J 2 ( 771 1 £i)J(El | ft), and 
hence by Case 1 771 1 El 0, contrary to the hypothesis. Thus ft | 771 is a single 
cell. By a ^-induction applied to 771 1 ft and El | ft , (in place of El and El), 
all CD's of El | ft 4* m have k — 1 cells or less, and end at Ck . Since this set 
is also El | 771 + ft , the CD of El | 771 is the cell ft | 771 , followed by the k — 1 
cells, (or less), of El | ft + m • The final part follows by repeated applications 
of A 4 . 

The extension to general j is as in Case 1. 



238 


M. H. A. NEWMAN 


General Case. In view of Lemma 3 it may be assumed that if E\ | El ^0, 
[ij]y is chosen so that yi\ El ^ 0. A series of “zig-zag” paths, to , to , • • • , 
from bj to c*, is constructed, to being —[»/],• 4 [f]* . Suppose to already con¬ 
structed, 


TO — To — TO 4 T\ — • • • + T m _i — C m , 


where a and r< are CD’s of subsets, E) and El , of E\ | 7 ,- and El \ v< respec¬ 
tively. The Yi are descending paths from a satisfying 

(i) 7» is If]* 

(ii) for any E a at a, E a | y< 4 to = E a | 7 <_x + r,_i. 

This whole inductive hypothesis is satisfied by to if wi = 2, to = to = Yi = 0, 

to = 7o = [y ]j , to = 72 = [f] t . 

Let to have a peak at the join of c n -1 and r n - 1, and let u be the final vertex 
of r m _ 1. First suppose that Em- 1 1 r m _i = 0 . By Case 2 a CD, 9 , of E„- 1 1 <r m _i 
ends at «, and we define to+x to be r 0 — to + ■ • • + r m -2 4 0 — <r m , the new 
m', r' m -i and a' m -\ being m — 1 , r m _ 2 4- 0 and c m . The yl up to y' m -t are the 
corresponding 7,, and y m -i = y m . Since 0 is a CD of 

Em -1 | <T m _x C El | 7 m —1 4 Vm-l 

*— E a [ 7m —2 4" T m— 2 , 

Tm-j 4 0 is a CD of a subset of El 1 7 m _ 2 . For any E a at a, 

Ea | 7m— 1 4 TOn— 1 = E a | 7m 4 &m 

— E a | 7m —1 4 T m _i 

= Fa 1 7 m—i 4 0 m-i 4 0 (Case 2) 

= E a I 7 m —2 4 T m _2 . 


Secondly let E' n - 1 1 r m _i ^ 0. By Lemmas 2 and 3, if a CD of E' n - 1 , whose 
first cell, | (1) , satisfies £ (1) | r m _x ^ 0, is substituted for <r m -i, all the conditions 
imposed on to remain satisfied, and it may therefore be assumed that <r m _i itself 
is such a CD. Let r m _x be [u ] p , (p ^ 0 in view of the peak). We construct 
successively, as in Theorem 5, for i = 1, 2, • • • , p, pairs of descending paths 
Tm-z+i and £ <,+1) 4 to.-i+< , which are CD’s of | £ (,) and £ (,> | co< respectively, 
and, by , have a common final vertex. The notation “£ t,+1) 4- <r m _i+<” 
implies that £ <0 1 ^ 0, which is justified by | (,-1) | «,-_x ^ 0, derived ultimately 

from £ (1) | Tm-i 5* 0. If c n -i is £ (1) 4 to,_i , to+i is defined to be 


I | f I f I t f b(p+l) 

TO “ 0*1 *T * * * T“ T m —2 — GW-l T m _i — • • • -f- T m _ 24 -p — Cm-l+p ~ <7m 


Its final ascending part has at least one more cell than c m . If 7 * is taken to be 
7 * for h up to m - 2, 7 *_i 4 [w]*-m+i 4 forh = m- ltom4p-2, 

and 7m+p—i = 7m , all the conditions are fulfilled, the new “m” being m 4 p — 1> 



“combinatorial theory” and “equivalence” 


239 


the new “<r m ” <r m + £ (p+1> + gL+p-i • Condition (ii) follows for the y[ im¬ 
mediately from A 4 except for i = m + p — 1; and 

E a | 7m + <Tm + £ (p+1) + CT m -fp_l = E a | 7«-l + T m -1 + £ (,>+1) + al+p-i 

= Ea I y m—1 + [w]p_ 1 + i (P) + T m +p-2 (A 4 ) 

Eq I y m J rP—‘2 “H Tfn^.p—2 y 

as required. 

It has thus been shown how, in all cases where 7r r has a peak, a path 7 r r +i is 
to be constructed having either one less peak or a longer final ascending portion. 
Since this portion is a development of the finite J-set E' a | [f ] k , the second alterna¬ 
tive can only occur a finite number of times. Thereafter the number of peaks 
decreases at each step until a path with no peaks is reached. 

The extension to paths which are not CD’s of J-sets now follows as in 
Theorem 5. 


12 

The theorems that follow Theorem 5, as far as Corollary 6 . 1 , are proved 
under the new conditions with only minor changes in the argument. Theorem 
8 fails to survive, as may be seen by considering Fig. 1 : all the conditions A^, 
A 2 -A 4 , and Ji-Js are satisfied by taking the derivate of ajb at a r +1 to be a r+x 6 , 
and that of a r a r + 1 at b to be null; and a r bJara r + 1 but not drdr+iJarb. 

Theorem 7 still holds but its proof needs some modification. 

Theorem 10. In a complex satisfying Af, A 2 -A 4 and J 1 -J 3 all developments of 
finite sets are finite. 

We proceed*as in the proof of Theorem 7. As before it follows that n, + 
+ • • • is a development of E x | [cr],_i and that for some p and q the develop¬ 
ment t p - |_i t<z + r p + 2 ,q + * * • of E x \ [<r] 5 _i + Ti q + • • • + T pq , = E x | d pq say, is 
infinite, while r r , q +1 = 0 if r > p. Since the <r’s and r’s are CD’s of J-sets it 
follows from Case 2 of Lemma 9.1 that the number of cells in <r rq is (for r > p) 
non-increasing with increasing r. If belongs to E q x | [f] r _i, r rq is contained in 
El | [f]r-1 + ovi + • * • + <7r q = El | d r -i, q ; by the last part of Lemma 9.1 the 
path (s xq is a CD of this set, and may therefore be chosen in the form T rq + <r ' rq , 
and <r r +i, g to be a rq . Thus <r r +i,g has less cells than o rq unless T rq = 0. If, for 
r > n , r rq = 0 whenever t r eE 9 x \ [f] r _i, r ro +i, g + • • • is an infinite development 
of the sum of the k — 1 J-sets E x x I Or Q q (i ^ q) } contrary to the inductive hy¬ 
pothesis. Hence the number of cells in <r rq eventually diminishes to zero, and 
from this point on the r rq coincide with the r f , g +1 and are null,—contrary to the 
initial hypothesis. 

Corollary 10.1. Under the same conditions , the number of different develop¬ 
ments of E x is finite. (Compare Corollary 7.1). 



240 


M. H. A. NEWMAN 


13. Application to the conversion calculus 

The formalism first considered is that of Theorems 1 and 2 of Church and 
Rosser [1], but modified in two ways,—first by the adoption of the simpler 
bracketing of Church [1] secondly by the entire exclusion of “singular” formulae, 
i.e. those having “accidental” coincidences between the bound variables. 8 The 
WFF’s are therefore rows of the symbols X, variables, and round brackets, built 
up according to the following rules: (1) x is a WFF, (2) if M is a WFF con¬ 
taining x as a free variable, (XxM) is a WFF, (3) if A and B are WFF’s whose 
common variables are free in both, (AB) is a WFF. (A variable is bound in 
any row of symbols in which one of its occurrences immediately succeeds a X, 
otherwise free.) 

The allowed transformations that concern us are 

I. To replace each specimen of a bound variable x in X by y, a letter not 

occurring in X. 

The result, Y, of any series of applications of I to X will be called an adjusted 
copy of X, and X conv.-I Y. 

II. To replace a part ((XxM)N) of X by the result of substituting adjusted 

copies of N for the specimens of x in M, the new bound variables being all 

different from each other and those of X. 

It is agreed that a WFF denoted by one of the letters U, V, is of the form 
((XxM)N), and we accordingly speak of “the move U on X,” or “the move 
(X, U),” if U is a part 9 of X, meaning the application of Rule II in which U is 
the part operated on. If Y is the WFF that thus replaces X, we write (X, U) Y 
and X conv.-II Y. If a series of moves I that turns X into Y turns its part U 
into V, (Y, V) is an adjusted copy of (X, U). 

To define the residuals of a part V of X after the move ((XxM)N), suppose 
that each pair of brackets in M is provided with a numerical suffix, which is 
left unchanged in applying rule II, and that V is enclosed in thS pair i( ). If 
the move ((XxM)N) turns X to Y, then 

(a) if V = ((XxM)N), V has no residual in Y; 

(b) if V is a part of N its residuals are the corresponding parts of the adjusted 
copies of N that replace x in M; 

(c) in all other cases the residual of V is the part i( ) of Y. 

The complex 2 to which our general theorems will be applied has as a typical 
vertex the class [X] of all adjusted copies of a WFF X. A positive 1-cell is the 
class [(X, U)], or briefly [X, U], consisting of (X, U) and all its adjusted copies; 
and its initial and final vertices are [X] and [Y], where (X, U) —>Y. If V is also 
a part of X, the [X, U]-derivate of [X, V] consists of all the cells [Y, V<], where 
the V* are the residuals of V in Y. Finally “[X, U] J[X, V]” means that (i) neither 

8 Cf. Newman [2] §3. After the general theoretical work the calculus may be extended, 
for practical convenience, by re-admitting the singular formulae and resuming the original 
rules I and II; and it can be shewn without difficulty that (1) every singular WFF X conv.-I a 
non-singular X', and (2) if X conv.-II Y, X conv.-I X', Y conv.-I Y' in the extended calculus, 
X' and Y' being non-singular,then X' conv.-I-II Y' in the restricted calculus. 

9 Defined as in Church [1], 



“combinatorial theory” and “equivalence” 


241 


U nor V is a part of the other, and (ii) the free variables of U and V are the 
same. Thus J is a symmetrical relation, and is independent of the WFF chosen 
to represent [X]. 

With these definitions the conditions Ji, J 2 and A 1 -A 4 are satisfied . (Ji): no 

comment is necessary. (J 2 ): let rji and 772 be determined by the parts Vi and V 2 
of X, and £ by U, = ((XxM)N). If Vi = V 2 , distinct members of 771 1 ^ are 
determined by different adjusted copies of Vi, and evidently satisfy (i) and (ii). 
If Vi and V 2 are not identical they are mutually exterior, and a residual of Vi 
could only be a part of a residual of V 2 if Vi were a part of N, and V 2 a part 
of M containing x. This contravenes the condition (ii) for Vi and V 2 since x 
is free in M and cannot occur in N. A part of X and its residuals in Y have the 
same free variables, except that x is replaced at all occurrences by adjusted 
copies of N. Hence if Vi and V 2 have the same free variables their residuals 
in Y have also. 

In considering the conditions A, let £, 77 , f be determined by the moves Ui 
on X, (i = 1, 2, 3), where U, = »((Xx»M»)Ni), and (X, U*) —> Y* . Thus U< is 
the part <( ) of X. 

Ai : no comment is necessary. A 2 : suppose that U 2 ^ U 3 . The residuals 
are clearly distinct if they are determined by the original brackets, or one by 
old and the other by new brackets. The remaining possibility is that a residual 
of U 2 is a part, of an adjusted copy of Ni, and a residual of U 3 is either a 
different part of the same copy, or part of a different copy,—in any case different 
from U 2 . A 3 : the condition is obviously satisfied unless one of U 2 , U 3 is part 
of the other,—say U 2 of U 3 . If U 2 is in M 3 the residual of U 2 in Y 3 , and of 
U 3 in Y 2 , are determined by their original brackets, and since N 3 contains no 
copy of x 2 (a bound variable of M 3 ), the order of performance of 2 ( ) and 3 ( ) 
is indifferent. If U 2 is in N 3 the final effect is the same whether 2 ( ) is performed 
first on N 3 , followed by 3 ( ), or the residuals of 2 ( ) on the adjusted copies of 
N 3 in Y 3 . A 4 : Let W be the final result of either series of moves on x: it has 
been shown to be unique to within I-adjustment, and therefore determines a 
unique vertex, w, of 2 . If Ui (or U 2 ) is not part of either of the other UVs, 
its performance, before or after U 2 (or Ui), does not affect the residual of U 3 . 
We may therefore suppose that one of the UVs contains the other two. If U 3 
is not part of either Ni or N 2 the residual of U 3 in W by either route is 3 ( ). 
We therefore assume that Ui contains both U 2 and U 3 , and that U 3 is part of 
either Ni or N 2 . Finally, if both U 2 and U 3 are in Ni, the same residuals of U 3 
are evidently obtained whether U 2 is performed on Ni before Ui, or the corre¬ 
sponding moves on the copies of Ni after Ui. There remains only the case 
where U 2 is part of Mi, and U 3 of either, (a), N x , or, (0), N 2 . (a) : the residuals 

of U 3 by either route are the corresponding parts of the adjusted copies of N x 
that replace x x in the move x ( ) on Y 2 . ( 0 ): if the residuals of U 3 in Y 2 are 

the parts enclosed in the brackets 3X ( ), »( ),•••, the residuals in W by either 
route are the parts enclosed in the same brackets. 

The conditions for all the Theorems 5 to 8 are therefore satisfied, and we 
obtain the following results. 



242 


M. H. A. NEWMAN 


Corollary 11 .1. If X conv.-I-II Y and X conv. I—IIZ, there is a WFF W such 
that Y conv. I—II W and Z conv. I—II W. 

Corollary 11.2. There are only a finite number of different developments of a 
given set of moves II on a WFF X. All of them are finite , and all end in adjusted 
copies of the same WFF. 

Corollary 11.3. A WFF has (apart from I -adjustments) at most one normal 
Jorm y and if one exists all series of moves II terminate in this normal form or an 
adjusted copy . 


14 

Two generalizations of these theorems were given by Church and Rosser in 
their paper. The first, to the formalism extended so as to include the 5-symbol, 
is of no interest in the present connection: it is easily shown that the original 
conditions A and J are still satisfied, and hence that the Corollaries 11 hold. 
The second generalization (of which the proof was not given by Church and 
Rosser) is to the formalism in which (XxM) is counted a WFF eVen if x does 
not occur in M. The rules of procedure, and the definitions of derivates need 
no modification, and the conditions A 2 -A 4 are proved to hold, just as before. 
The second part (“only if”) of condition Ai now fails, but the condition A? is 
satisfied. The conditions J1-J3 are also satisfied if a different, more complicated, 
interpretation is given to J. 

Let “U S V” stand for “a free variable of U is bound in V.” It implies that 
U is a proper part of V, and if U' and V' are, for any W, W-residuals of U and V, 
U' 8 V' implies USV. Let “U Ex V” stand for “neither U nor V is a part of 
the other.” Then, with the same notation, U' Ex V' implies U Ex V. We now 
take [X, U] J [X, V], for any significant U and V, to mean 

“(i) U is not a part of V, and (ii) there is no part W of X such that V S W 
and U Ex W.”" 

Ji is clearly satisfied. 

J 2 . Let the notations be those of the previous discussion of J 2 , and let V( 
and V* be distinct residuals of Vi and V 2 . If V{ S V 2 and v£ Ex W', then 
Vi S W and V 2 Ex W, which is incompatible with rji = 772 or rji J rj 2 . If Vi = V 2 , 
Vi cannot be part of V£. If Vi 3 ^ V 2 , the only possibility that Vi be part of V?. 
is that V 2 be part of M, with x as a free variable, and Vi be in N,—which in view 
of 171 J rj 2 contravenes (ii). 

Js. Let 7), £ and f be determined by U, Vi, and V 2 , where U is ((XxM)N). 
Suppose that £ A 77 and 17 7 f . Then Vi is part of N and either U is part of V 2 or, 
for some W, V 2 S W and U Ex W. The first alternative gives Vi part of V 2 ; the 
second V 2 SW and Vi Ex W; and both contradict £ J f. Hence 

Theorem 12. Corollaries 11 . 1 , 11.2 and the first part of Corollary 11.3 hold 
in the extended calculus. 

It is easily seen that the second part of Corollary 11.3 fails to survive. 


Cambridge, England 



“combinatorial theory” and “equivalence” 


243 


References 

J. W. Alexander, [1] Combinatorial theory of complexes , Annals of Math., 31 (1930), pp. 
292-320. 

G. Birkhoff, [1] Lattice theory , New York, 1940. 

A. Church, [1] A formulation of the simple theory of types , J. of Symbolic Logic, 5 (1940), 

pp. 66-68. 

A. Church and B. Rosser, [1] Some properties of conversion , Trans. Amer. Math. Soc., 39 
(1936), pp. 472-482. 

M. H. A. Newman, [1] A theorem in combinatory topology , J. London Math. Soc., 6 (1931), 
pp. 186-192. 

[2] Stratified systems of logic. (Forthcoming.) 



Annals or Mathematics 
Vol. 43, No. 2, April, 1942 


ISOMORPHISMS OF NORMED LINEAR SPACES 1 

By George W. Mackey 
(Received August 28, 1941) 

Introduction 

Following Banach [1, p. 180] 2 we say that two normed linear spaces X\ and 
X 2 are isomorphic if there exists a one-to-one correspondence between their 
elements which is both a homeomorphism and an algebraic isomorphism; that is, 
if the spaces are abstractly identical as topological linear spaces. Let Ri 
(i = 1, 2) be the ring of all continuous linear 3 transformations of X, into itself. 
Eidelheit [2] has shown that if Xi and X 2 are complete, then Xi and X 2 are iso¬ 
morphic if and only if R\ and R 2 are isomorphic as rings. In this paper we 
prove two analogous theorems; one involving the lattices of closed linear sub¬ 
spaces of X\ and X 2 and the other their groups of automorphisms (self iso¬ 
morphisms). The latter theorem differs a little from the others in that from the 
isomorphism of the groups of automorphisms it is not concluded that Xi and 
X 2 are isomorphic but only that either this is the case or Xi and X 2 are what we 
shall call pseudo-reflexive and mutually pseudo-conjugate. In neither theorem 
do we need to assume anything about completeness, and we use our methods 
to prove Eidelheit-s theorem without this restriction. 

In all three theorems the proof of the necessity is trivial and that of the suffi¬ 
ciency involves three main steps. First we use the given isomorphism between 
the associated algebraic systems to set up a one-to-one linear independence pre¬ 
serving correspondence between the one dimensional, subspaces of X\ and X 2 . 
Next we show that this correspondence may be defined by a one-to-one linear 
transformation of all of A r i into all of X 2 . Finally we prove that the trans¬ 
formation is a homeomorphism. The second and third steps are accomplished 
in the same way in all three cases. We devote the first section to proving the 
two fundamental lemmas involved. The first step is accomplished by asso¬ 
ciating one dimensional subspaces with elements of the algebraic systems in a 
natural way and showing that the correspondence between one dimensional 
subspaces set up via the given isomorphism between the algebraic systems has 
the properties desired. This is carried out by giving algebraic characterizations 
of certain kinds of elements and sets of elements in the algebraic systems. The 
difficulty of doing this increases rapidly as we pass from lattices through rings 
to groups. Accordingly we prove the three, theorems in that order in sections 
II, III, and IV. 


1 Presented to the American Mathematical Society under another title, November 22, 
1941. 

2 The numbers in brackets refer to the bibliography. 

* In this paper linear means additive and homogeneous. 

244 



ISOMORPHISMS OF NORMED LINEAR SPACES 


245 


I. Two Fundamental Lemmas 

Lemma A. If Xi and X 2 are linear spaces having dimension greater than two 
and if A <=± A' where A C X\ and A' C X 2 represents a one-to-one correspondence 
between the one dimensional linear subspaces of X\ and X 2 respectively which pre¬ 
serves linear independence , then there exists a one-to-one linear transformation T 
from all of Xi into all of X 2 such that if A x is the linear subspace of scalar multiples 
of x , then 4 r(I) = A x for all x in Xi . 

Let x be any non-zero element in X and let y be any non-zero element in A*. 
It is clear that if T exists then T(x) = \y and may be chosen so that X = 1. 
With this choice of T(x), T is uniquely determined for all x in AT . In fact if 
x = Xx we must have T(x) = \y and if x and x are linearly independent then 
A x and A x - x will be linearly independent whereas A x , A x , and A x - x will be 
linearly dependent. Hence any element in A x , in particular y, will be a unique 
sum of elements x\ and yi from A x and A x _ x respectively. Since we require 
that T(x) = T(x) + T(x — x ), we must have T(x) = Xi . 

It remains to show that the T so defined is linear. It will obviously then 
have the other required properties. Let x and y be arbitrary elements of X\ . 
Let M be a three dimensional subspace of Xi containing x, ?/, and x. Let M' 
be the three dimensional subspace of X\ spanned by the one dimensional sub¬ 
spaces of the form A ' where A is in M. Lemma A for three dimensional 
spaces is well known. It is simply the theorem of projective geometry 4 to the 
effect that a collineation between two real projective planes can be represented 
analytically by a linear transformation [3, vol. I, p. 190 and vol. II, p. 252]. 
Thus there is a linear transformation of the desired sort taking M into M'. 
By the argument of the first paragraph, T as on M must be a constant multiple 
of this transformation. Therefore if X and y are arbitrary scalars, T(\x + yy) = 
\T(x) + yT{y) and, since x and y were arbitrary, T is linear. 

Before stating and proving Lemma B we make a few preliminary remarks 
concerning the relation between the “maximal” subspaces of a linear space and 
the linear functionals defined on the space. We define a maximal subspace as a 
proper subspace contained in no other proper subspace. It is clear that if z 
is any element in the complement of a maximal subspace M of a linear space X , 
then any clement in X has a unique representation in the form x = m + \z 
where m is in M and X is a scalar. If we make the definition f(m + \z) = X, 
it is easily seen that f(x) is a linear functional which vanishes on M and only 
on M . Conversely, if f(x) is any non trivial linear functional defined on X , it 
is easily verified that the set of elements x of X such that/(x) = 0 is a maximal 
subspace of X . We call it the null-space of /(x). Finally, suppose that/i(x) 
and / 2 (x) are non trivial linear functionals having the same null-space. Let z 
be in the complement of this subspace. Then / 2 (z)/i(x) — fi{z)fi{x) is zero for 
all x in X. Therefore f 2 (x) s kfi(x) where k is a constant. Thus there is a 


4 Thanks are due the referee for suggesting the use of this theorem to eliminate a large 
part of the author's original proof. 



246 


GEORGE W. MACKEY 


natural one-to-one correspondence between the maximal subspaces of X and the 
one dimensional subspaces of the space of all linear functionals on X. If f(x) 
is continuous as well as linear then its null space is obviously closed. (We sup¬ 
pose now that X is normed.) Conversely, given any closed maximal subspace 
M of X , it follows from the lemma on page 57 of [1] that there exists a non¬ 
trivial continuous linear functional vanishing on M and hence having M for its 
null-space. Thus every linear functional having M as its null-space is con¬ 
tinuous. In other words our one-to-one correspondence associates closed sub¬ 
spaces with continuous functionals and vice-versa. 

Lemma B. If X x and X 2 are normed linear spaces and T is a one-to-one linear 
transformation of all of Xi into all of X 2 such that T and T~ l carry maximal closed 
subspaces into maximal closed subspaces , then T is a homeomorphism and hence 
Xt and X 2 are isomorphic. 

Let f 2 (x) be an arbitrary non-trivial continuous linear functional defined on X 2 . 
Let M 2 be the null-space of f 2 and let Mi = T‘~ l (M 2 ). Mi is a closed maximal 
subspace of X x . Therefore there exists a continuous linear functional fi(x) 
defined on Xi, having M x for its null-space. Let z be a member of the comple¬ 
ment of Mi . By adjusting the arbitrary scalar multiplier, fi may be chosen so 
that/i(z) = 1. If x is an arbitrary member of X \, we may write x = m + f x (x)z 
where m is in M x . Therefore f 2 (T(x)) = f 2 (T(m)) + Mfi(x)T(z)) = / 2 (T(z))/i(x) 
since T(m) is in M 2 . Now let {z*} be any bounded sequence of elements of Xi. 
Then {/i(:r„)} is a bounded sequence of real numbers. Hence since f 2 {T{x n )) = 
f 2 (T{z))fi(;Xn ), \f 2 (T(x n ))} is a bounded sequence of real numbers. But/ 2 may 
be any continuous linear functional on X 2 . Therefore, by [1, p. 80 Th6or&me 6], 

{ T(x n )} is a bounded sequence of elements of X 2 . Thus T takes bounded sets 
into bounded sets. Hence using the theorem on page 54 of [1] we conclude 
that T is continuous. By an exactly analogous argument, T~ l is continuous. 
This completes the proof. 

Lemma B essentially says that the topology of a normed linear space is deter¬ 
mined as soon as it is given which maximal linear subspaces are closed. This 
is closely related to a theorem of Fichtenholz [4] to the effect that the topology 
js determined by the set of continuous linear functionals. 

n. The Lattice Theorem 

Theorem. Let X x and X 2 be normed linear spaces. Let L x be the lattice of 
closed linear subspaces of X x and L 2 that of X 2 . Then X x and X 2 are isomorphic 
as normed linear spaces if and only if L x and/ L 2 are isomorphic as lattices. 

We begin by proving some lemmas. 

Definition. If S x and S 2 are subsets of a linear space we denote by S x 4- S% 
the smallest linear subspace containing both S x and S 2 , and by S x + the smallest 
linear subspace containing S x . 

Lemma 2.1. If M is a closed linear subspace of a normed linear space X and £ 
is any element of X , then M -f £ is closed. 



ISOMORPHISMS OF NORMED LINEAR SPACES 


247 


Neglecting trivial cases we suppose that x is not in M and that M + x ^ X. 
By the lemma on page 57 of [1], there exists a continuous linear functional f\ 
on X which vanishes on all members of M and is such that/i(x) = 1. Let y 
be any element of X such that any continuous linear functional which vanishes 
on all members of M -j- x also vanishes on y. If y — f\{y)x is not in M, then, 
again by the lemma on page 57 of [1], there exists a continuous linear functional 
f 2 , vanishing on all members of M and such that f 2 (y — My)x) = 1. But 
fi — M x )fi vanishes on all members of M + x and hence on y. Therefore 
My) — M x )My) = My — My) x ) = 0. Therefore y — fi(y)x is in M and y 
is in M + x. Hence if y is any element not in M + x, there exists a continuous 
linear functional vanishing on all members of M + x and not vanishing on y. 
In other words M -f x is an intersection of null-spaces of continuous linear func¬ 
tionals and hence is closed. 

As a corollary we have 

Lemma 2.2. Any finite dimensional subspace of a normed linear space is closed . 

Lemma 2.3. If L is the lattice of closed linear subspaces of a normed linear 
space and n is a positive integer , then a member M n of L has dimension n if and 
only if there exist members of L, M \, M 2 , • • • , M n - 1 , such that M\ covers the zero 
of L and Mi+i covers Mi for i = 1, 2, • • • , n — 1. 

If M has dimension n, let X \, x 2 , • • • , x n be a basis for M. By Lemma 2.2, 
£i+> x i + * t 2 , • • • , Xi + x 2 -}- • • • + x n , are all members of L , and obviously 
each covers its predecessor; except of course xi+ which covers 0. Conversely, 
since if N is finite dimensional and N' covers N , it is obvious that the dimension 
of N' is one greater than that of N, we conclude from the existence of such a 
chain that M has dimension n. 

We turn now to the proof of the theorem. If X\ and X 2 are isomorphic it is 
obvious that L\ and L 2 are isomorphic. Suppose, conversely, that L\ and L 2 
are isomorphic. If A is any one dimensional subspace of X\ , it follows from 
Lemma 2.2 that A is in L \. Let A' be the correspondent of A in L under the 
lattice isomorphism. Since A covers 0, A ' covers 0 and hence is one dimensional. 
In this way we set up a one-to-one correspondence between the one dimensional 
subspaces of Xi and X 2 respectively. Let A \, A 2 , • • • , A r be a linearly inde¬ 
pendent set of one dimensional subspaccs of Xi , and let A\ , A 2 , • • • , A' r be 
their respective correspondents in X 2 . If A[ , A 2 , • • • , A’ r are linearly de¬ 
pendent, then A\ -{- A 2 -j— • • • + A r = M ' has dimension k < r. By Lemma 
2.2, M' is in L 2 . Let M be the correspondent of M* in L. It is an easy conse¬ 
quence of Lemma 2.3 that M has the same dimension as M\ But since A [ d M f , 
Ai C M (i = 1, 2, • • • , r). Therefore A\ -j- A 2 + • • • + A r has dimension 
less than r. This contradicts our hypothesis that A \, A 2 , • • • , A r are linearly 
independent. Since the same argument applies to linearly independent sets of 
subspaces of X 2 , we see that our one-to-one correspondence preserves linear 
independence. Hence if neither X\ nor X 2 has dimension less than three we 
may apply Lemma A and conclude the existence of a one-to-one linear trans¬ 
formation T of all of X into all of X such that A r (*> = A* for all x in X \. Let 



248 


GEORGE W. MACKEY 


M be any closed maximal subspace of Xi . M is covered by Xi . Hence its 
correspondent in L 2 under the lattice isomorphism, M f , is covered by X 2 and 
consequently is maximal. Now x is in M if and only if A x C M, which, by 
virtue of the lattice isomorphism, is the case if and only if A' x C M'. But 
A' x = At(z) . Therefore x is in M if and only if T(x) is in M '. It follows that 
T(M) — M ' and hence that T takes closed maximal subspaces into closed 
maximal subspaces. Similarly, T~ l does likewise and applying Lemma B, we 
conclude that X\ and X 2 are isomorphic. Suppose finally that either Xi or X 2 
has dimension less than three. Then since X\ and X 2 correspond under the 
lattice isomorphism, it follows from Lemma 2.3 that the other has this same 
dimension. But any two finite dimensional normed linear spaces having the 
same dimension are isomorphic by Lemma B, since, by Lemma 2.2, all of the 
maximal subspaces of both are closed. This completes the proof. 

III. The Ring Theorem 

Definition. A continuous linear transformation E of a normM linear space X 
into itself such that E 2 = E will be called a projection . The set of elements of X 
of the form E{x), where x is in X, will be called the range of the projection E. The 
dimension of the range will be called the dimension of E. If E\ and E 2 are projec¬ 
tions such that the range of E\ is contained in the range of E 2 we shall say that E\ 
is contained in E 2 , and if furthermore E 2 is not contained in E \, we shall say that 
Ei is properly contained in E 2 . Following the terminology used in lattice theory, 
we shall say that a projection E 2 covers a projection E\ if Ei is contained properly 
in E 2 and there exists no projection E 3 contained properly in E 2 and properly 
containing E\. 

Lemma 3.1. Given any finite dimensional subspace >M of a normed linear space 
X , there exists a projection E whose range is M. 

Let Xi , x 2 , • • • , x n be a basis for M . For each i = 1, 2, • * • , n set/<(ciXi -f 
• • • + c n x n ) = Ci . Then fi(x) is a linear functional defined on M, and since 
its null-space is finite dimensional and hence closed, it is continuous. By the 
Hahn-Banach extension theorem, [1, p. 55, Th6or&me 2], /,• can be extended so 
as to be a continuous linear functional F defined throughout X . Let Ei{x) 
Fi(x)xi . Since Fi is continuous and linear, Ei is a continuous linear transforma¬ 
tion of X into itself; hence so is E = E\ + E 2 + • • • + E n . Finally, 
Ei(Ej(x)) = Fi(xj)Fj(x)xi . This is identically zero for i 9 * j and identically 
Ei(x) for i = j. Therefore E 2 = (E y + E 2 + ■ • ■ + E n f = E x + E t + • • • + 
E n = E. Therefore E is a projection whose range is obviously M. 

Lemma 3.2. Given any closed maximal sttbspace M of a normed linear space X, 
there exists a projection E whose range is M. 

Let z be a member of the complement of M. Let / be a linear functional 
whose null-space is M and choose the arbitrary scalar multiple so that /(z) = 1. 
Set E(x) — x — }{x)z. Since M is closed, / is continuous. Therefore E(x) is 
a continuous linear transformation of X into itself. f(E(x)) — f(x) — f(x) = 0. 
Therefore E?(x) = E(E(x)) = E(x) — f(E(x))z = E{x). Thus E(x) is a projec- 



ISOMORPHISMS OF NORMED LINEAR SPACES 


249 


tion. Finally, since f(E(x)) = 0, the range of E(x) is contained in M, and 
furthermore, since if x is in M , then f{x) = 0 so that E{x) = x , we see that the 
range of E is M. 

Lemma 3.3. If Ei and E? are projections defined on a normed linear space , 
then E\ is contained in E 2 if and only if E 2 E 1 = Ei . 

If E 2 E 1 = Ei and x is in the range of Ei , then x = Eiy. But E 2 Ei(y) = Ei(y ); 
that is x = E 2 (x). Therefore x is in the range of E 2 . Conversely, if E x is con¬ 
tained in E 2) then for each x in X, Ei(x) = E 2 (y). Therefore E 2 (Ei(x)) — 
E\(y) = E 2 (y) = E x (x). That is, E 2 E 1 = Ei . 

Lemma 3.4. If n is a positive integer and E n is a projection defined on a normed 
linear space , then E n has dimension n if and only if there exist projections Ei , 
E 2 , • • • , £ 7 n -1 such that Ei+i covers Ei for i = 1, 2, • • • n — 1, and Ei covers 0. 

The truth of this lemma follows easily from Lemma 3.1 using an argument 
similar to that used in proving Lemma 2.3. 

Lemma 3.5. If E is a projection on a normed linear space X, then the range oj 
E is a closed maximal subspace of X if and only if the projection 1 covers E. 

If the range of E is maximal it is obvious that 1 covers E . Conversely, 
suppose that 1 covers E. If x = E(y ), then E(x) = E(E(y)) = E{y) = x. 
Therefore x — E(x) =0. If x — E(x) = 0, then x = E(x). In other words, 
the range of E is the null-space of the continuous transformation 1 — E and 
hence is closed. It follows from the lemma on page 57 of [1] that the range 
of E is contained in a closed maximal subspace M of A" and hence by Lemma 3.2, 
there exists a projection E ' whose range is M and which contains E. Since 1 
covers E, M must also be the range of E. 

Theorem. Let Xi and X 2 be normed linear spaces . Let Ri be the ring of all 
continuous linear transformations of X\ into itself and R 2 that of X 2 . Then Xi 
and X 2 are isomorphic as normed linear spaces if and only if Ri and R 2 are iso¬ 
morphic as rings. 

The necessity is obvious. The proof of the sufficiency is so much like that 
of the lattice theorem that we shall not give it in detail. Obviously, the ring 
isomorphism takes projections into projections. From Lemma 3.3 it follows 
that inclusion of projections is preserved. Combining Lemmas 3.3 and 3.4 we 
conclude that finite dimensional projections correspond to projections of the 
same dimension, and using Lemma 3.5 instead of Lemma 3.4, that projections 
with closed maximal ranges go into projections with closed maximal ranges. 
We set up a one-to-one correspondence between the one dimensional subspaces 
of Xi and X 2 by passing from such a subspace A in X , through a projection E 
having A for its range, to the range A' of its correspondent E' in R. That A ' 
is uniquely determined by A is an easy consequence of Lemma 3.3. We estab¬ 
lish the preservation of linear independence much as we did in the lattice 
theorem; using Lemma 3.1 to give us a k dimensional projection containing r 
one dimensional projections whose ranges are linearly dependent. We show 
that the T we get by using Lemma A is a homeomorphism using Lemma B and 
Lemma 3.5. Finally, if Xi or X 2 has dimension less than three, we show that 



250 


GEORGE W. MACKEY 


Xi and X 2 have the same finite dimension and hence are isomorphic by observ¬ 
ing that the units of Ri and R 2 must correspond under the ring isomorphism and 
hence being finite dimensional projections must have the same dimension. 

Eidelheit’s proof of this theorem is considerably shorter than ours. This is 
principally due to the fact that by using a device apparently only applicable in 
the ring situation he is able to avoid having to prove Lemma A. We give the 
longer proof here in order to emphasize the close relationship existing between 
this theorem and the other two. 

IV. The Group Theorem 

We begin by discussing the notion of pseudo-reflexivity. Let X be a normed 
linear space and let X be its conjugate space. For each x in X, as is well known, 
if we define F x (f) = f(x) for all/in X, F x is a member of X and || F x || = || x ||. 
In general, there will be members of X which have no such representation. In 
the contrary case X is said to be reflexive. Even if X is not reflexive it may be 
such that a new norm may be introduded into X under which a linear functional 
is continuous if and only if it is an F, . We shall call such a space pseudo¬ 
reflexive. The new norm in X is not uniquely determined but by virtue of the 
theorem of Fichtenholz referred to at the end of Lemma B this is the case for 
the corresponding norm topology. The topological linear space which X be¬ 
comes under this topology we call the pseudo-conjugate of X. Obviously, the 
pseudo-reflexivity and the pseudo-conjugate of X depend only upon the norm 
topology in X and not upon the particular norm. Therefore we may speak of 
the pseudo-conjugate of the pseudo-conjugate of a pseudo-reflexive space X, and 
it is obvious that it always exists and is isomorphic to X. If Xi is pseudo¬ 
reflexive and X 2 is isomorphic to the pseudo-conjugate of Xi so that X 2 is also 
pseudo-reflexive and Xi is isomorphic to the pseudo-conjugate of X 2 , we say 
briefly that Xi and X 2 are pseudo-reflexive and mutually pseudo-conjugate. 

A few words on the relationship between reflexivity and pseudo-reflexivity. 
Clearly reflexive spaces are pseudo-reflexive. On the other hand, we can show 
without difficulty that if X is complete and pseudo-reflexive, then X is reflexive. 
In fact if X is pseudo-reflexive and complete, let F be a member of the second 
conjugate of X. Let {/ n } be a sequence of members of X , bounded as a sequence 
of members of the pseudo-conjugate of X. Then {/ n (x)} is a bounded sequence 
of real numbers for each x in X. Therefore, by [1, p. 80, ThSor&me 5], {/*} is 
bounded as a sequence of members of the ordinary conjugate of X and hence 
\F(f n )} is a bounded sequence of real numbers. Therefore F is a continuous 
linear functional on the pseudo-conjugate of X [1, p. 54, Th6orfeme 1] and hence 
is an F x . Thus X is reflexive. Finally, since, as is shown in [5], the conjugate 
of any normed linear space is complete, no incomplete space is ever reflexive. 
In other words, a pseudo-reflexive space is reflexive if and only if it is complete. 
We have examples which we expect to publish later of non-complete pseudo¬ 
reflexive normed linear spaces. 



ISOMORPHISMS OP NORMED LINEAR SPACES 


251 


Theorem. Let X\ and X% be normed linear spaces. Let G\ be the group of 
all linear transformations of X\ into all of itself which are continuous and have 
continuous inverses and let G 2 be that of X 2 . Then Gi and G 2 are isomorphic as 
groups if and only if either (a)~X 1 and X 2 are isomorphic as normed linear spaces 
or ( b) X\ and X 2 are pseudo reflexive and mutually pseudo-conjugate. 

We preface the proof of the theorem proper with a series of lemmas concerning 
what we shall call involutions. 6 Given a normed linear space X, a continuous 
linear transformation of X into itself such that T 2 = 1 will be called an involu¬ 
tion. Clearly any involution on X is a member of the group G for X. If T 
is an involution on X, let M+ be the subspace of X containing all elements in 
X such that T(x) = x , and let M_ be the subspace containing all those such 
that T(x) = —x. Let x be any element in X. We may write x = \{x + 
T{x)) + i(x - T(x)). But T(|(* + T(x))) = \(T(x) + x) = h(x + T(x)) and 
T(%(x — T(x))) = \{T(x) — x) — —\(x — T(x)). Hence x can be represented 
as the sum of an element in M+ and an element in M_ . Since M+ and 
have nothing in common but 0, this representation is unique. Thus each in¬ 
volution T “decomposes” X into two disjoint closed subspaces in one of which 
T is 1 and in the other of which T is —1. These subspaces will be called the 
subspaces of T. If at least one of them is finite dimensional, the dimension of 
the one of smaller dimension will be called the dimension of T. If neither is 
finite dimensional T will be said to be infinity dimensional. 

Lemma 4.1. If X is a normed linear space , M is a finite dimensional subspace 
and M' is any closed subspace of X such that M + M' = X and M fl M' = 0, 
then there exists an involution T having M and M' for its subspaces. 

Let Xi , X 2 , • • • , x n be a basis for M. For each i = 1,2 , • • • , n, consider 
Mi = AT + x\ + X 2 + • • • -f x i -1 + x <+1 + * * • + x n . By Lemma 2.1, M 
is closed and since M fl M' — 0 and the Xj are linearly independent, Xi is in the 
complement of Mi . Hence there exists a continuous linear functional / t * which 
has M t for its null-space and is such that fi(Xi) = 1. Let T(x ) = 2/i(a;)£i -f 
2 f2{x)x2 +*.••+ 2 f n (x)x n — x. Then T{x) is a continuous linear transforma¬ 
tion of X into itself such that T{Xi ) = 2 Xi — Xi = Xi (i = 1, 2, • • • , n) and 
T(x) = — x for all x in M f . Therefore T 2 (x) = x for all x in X and T is an 
involution. Since T is 1 in M and — 1 in M' it follows readily that M and M f 
are the subspaces of T. 

Lemma 4.2. If M is a finite dimensional subspace of a normed linear space X, 
then there exists an involution T having M as one of its subspaces. 

By Lemma 3.1, there exists a projection E whose range is M. Let M' be 
the null space of E. Then as is readily verified M and M ' satisfy the hypotheses 
of Lemma 4.1. 

Lemma 4.3. Let M+ and M- be the subspaces of an involution T on a normed 
linear space X. Let U+ and U- respectively be arbitrary continuous linear trans¬ 
formations of M+ and M- into themselves. Let U be the unique linear transforma¬ 
tion coinciding on M+ and M~ with U + and U~ . Then U is continuous . 


• Cf. Sobczyk [0] page 80. 



252 


GEORGE W. MACKEY 


Let {x«} be a bounded sequence of elements of X. Then {x n + T(x n )} is a 
bounded sequence of elements of M+ and {x n — T(x n )\ is a bounded sequence 
of elements of . Accordingly, {U+(x n + T(x n ))} is a bounded sequence of 
elements of ilf+, and {I/_(x n — T(x n ))} is a bounded sequence of elements 
of M-. But U+(x n + T(x n )) + U-(x n — T(x n )) = U(Xn + T(x n ) + X n — 
T(x n )) = 2t/(x») (n = 1, 2, •••)• Therefore ft/(x n )} is a bounded sequence 
of elements of X . Hence by [1, p. 54, Th6or6me 1], U is continuous. 

Lemma 4.4. If T is an involution on a normed linear space X and U is an 
arbitrary linear transformation of X into itself then UT — TU if and only if 
U(M+) C M+ and U(MJ) C where M+ and M~ are the subspaces of T. 

If UT = TU and x is in then T(U(x)) = U(T(x)) = f/(-x) = -f/(x). 
Hence U{x) is in Af- . Similarly, if x is in M+ then T(U(x)) = U(T(x)) = 
U(x) and U(x) is in ikf + . Conversely, suppose U{M+) C M+ and U(MJ) C 
A/_ . If x is in X , then x = x + + x_ where x + is in M 4 and x_ is in 
UT(x) = U(x+ - X—) = [/(x+) - t/(x_). Tt/(x) = TCr/Cx^) + 
U(xJ)) = t/(x + ) - ^(x_). Therefore UT(x) - Tf/(x) for all'x in X. 

Lemma 4.5. If U is a linear transformation of a normed linear space X into 
itself and UT = TU for every involution T on X, then U is a constant ; that is, 
there exists a scalar X such that U (x) = \x for all x in X. 

Given any x in X, by Lemma 4.2, there exists an involution T one of whose 
subspaces is x + • Since UT = TU, it follows from Lemma 4.4 that U(x) = 
X*x. Let x and y be any two elements of X. U(x + y) = X x+1/ (x + y) = 
XxX + \ v y. Hence if x and y are linearly independent, X x = \ x + y = X y . If 
y = fix, then U{y) = yU{x) = p\ £ x = \ x y. In any case \ x = \ y . Therefore 
there exists X, independent of x, such that U(x) = Xx for all x in X . 

Definition. If A is an arbitrary set of continudus linear transformations of 
a normed linear spcxe into itself, we shall let A* denote the set of all involutionsT 
such that UT = TU for every U in A. 

Definition. Let T 0 be an involution. For each n = 1, 2, • • • choose involu¬ 
tions Ti, T 2 , • * • , T n not necessarily distinct, such that TiT , == TjT, (i, j = 
0, 1, • • • , n). For each choice of T\, T 2 , • • • , T n there will be a certain number 
of elements in (To, T \, • • • , 2 7 n )**. As we shall see, these numbers are finite and 
for fixed To and nform a bounded set. We denote byf(To, n) the largest number in 
the set for each To and n. 

Lemma 4.6. If To is an involution on a normed linear space X and X is not 
finite dimensional, then To is finite dimensional if and only if sup n (f(To, n)/2 m ) 
is finite and if To is finite dimensional, its dimension is equal to log 2 (sup n 
(f(T 0 , n)/2 m )). 

Let Ti, T 2 , • ■ • , T n be such that T.T, = T,7\ (i, j = 0, 1, • • • n). Denote 
the subspace on which Tj is 1 by M) and that on which it is — 1 by MJ 1 (j = 0, 
1, • • • n). Since T 0 Ti = TiTo it follows from Lemma 4.4 that Ti{M\) C M\ 
and Tiim 1 ) C M^ 1 . Hence M\ = (Ml PI M\) -j- (Ml fl MT 1 ) and similarly 
for Mo 1 . Let fl Mi~ 1)l1 = X< oil (i,- = 0, l;j = 0,1). Then X = X M + 

Xoi -f Xio 4- Xu where each Xu has nothing in common with the 4- union of 



ISOMORPHISMS OF NORMED LINEAR SPACES 


253 


all the rest except 0 and we see that T 0 and Ti are constant in each X ioii and by 
applying Lemma 4.3 twice that any linear transformation of X into itself which 
takes each X, otl into itself (that is, T(Xi oil ) C X, on ) and is continuous on eacli 
X iQil is continuous. Since T 2 T 0 = T 0 T 2 and T 2 Ti = T\T 2 , T 2 takes each X ioil 
into itself and accordingly decomposes each X ioil into X iQil i and X iQii o . 
Continuing this process we finally obtain X = Xn...\ + In... 0 + the summation 
extending over X’s having as subscripts all (n + l)-uples of 0\s and l’s,where 
X tQt|• • • » n = Mr )t0 n • • • C Furthermore, as above, we con¬ 
clude that each Tj (j = 0, 1, • • • n) is constant in each and that any 

linear transformation of X into itself which takes each X totl ...» n into itself and 
is continuous on each Xi 0 q...i n is continuous. Let U be any member of (To , 
Ti , • • • , T„)*. Using Lemma 4.4, it is easy to see^that U takes each Xi 0 i li2 ... in 
into itself. Conversely, since each T, is constant in each any such 

U is in (To , Ti, • • • , T n )* provided that it is an involution. In other words 
we get the general member of (To, Ti, • • • , T n )* by considering an arbitrary 
involution on each of the subspaces of X and taking the unique linear 

transformation coinciding with these where they are defined. Since, in particu¬ 
lar, we may select the involution 1 in all but one of the X, 0 and —1 in the 
one remaining, we see by Lemma 4.4 that any member of (To , Ti, • • • , T n )** 
must take each Xt otI ...,- n into itself. Finally, since given any involution defined 
in an X, 0 ll ...i n , there exists an involution in (To , T\ , • * • , T n )* coinciding with 
the given one where it is defined, it follows from Lemma 4.5 that any member 
of (To, Ti, • • • , T n )** must be constant on each Conversely, if we 

assign Ts and — 1 ’s in an arbitrary manner to the X tot -,... ,, t , it is clear that there 
exists a unique linear transformation of X into itself which in each X, otl ... ln is 
1 or — 1 according to the above assignment and that this transformation is a 
member of (T 0 , T \, • • • , T„)**. Thus the number of members of (To , T\ , • • ■ , 
T n )** is 2 k where h is the number of non-zero Xi oil . . This justifies the state¬ 
ment made in the definition of /(T, n) and tells us that/(T 0 , n) ^ 2 (2H) -2 (i " ) . 
On the other hand if T is m dimensional where m is a positive integer, then 
since half of the subspaces X» 0 are subspaces of an m dimensional 
space we have /(T 0 , n) g 2 m 2 (2n) . In other words if we make the convention that 
2 m = No whenever T is infinity dimensional then we have in any case/(To, n) g 
2 <2n) min (2 W , 2 (2n> ). We shall now show that the equality holds. Suppose first 
that T 0 is infinity dimensional or that it is m dimensional where m is an integer 
and m ^ 2 n . Then in M\ we may select 2 n linearly independent elements 
X \, x 2 , • • • , x 2 n . By Lemma 4.2, there exists an involution defined on M\ 
having x 2 + Xz + • ■ • 4* x 2 n for one of its subspaces. Let Mi be the other 
subspace of this involution. Similarly define Ni, y 2 , y 2y • • • , y 2 n in Mo 1 . Let 
Mi = £,+ and let Ni = y t + (i = 2,3, • • • 2 n ). Then X = Mi + M 2 + ■ • • + 
M& -f Ni + N 2 + • • • 4- Nty where all of the subspaces concerned are closed 
and at least one dimensional and each has nothing but 0 in common with the 
4- union of the rest. Put the subspaces Mi , M 2 , • • • M& into one-to-one cor¬ 
respondence with the 2 n (n + l)-uples of 0’s and Us (0, ii , i 2 , • • • i n ) and the 



254 


GEORGE W. MACKEY 


subspaces Ni , N 2 , • • • , N 2 n with those of the form (1, ii , i %, • • • i n ). Now 
given any / = 1, 2, • • • n, consider the unique linear transformation T/ which 
is 1 in each subspace associated with an (io , ti, • • • i n ) for which ij = 0 and is 
— 1 on each subspace for which ij = 1. Since Mi 4- M 2 + • • • 4- M 2 » = Mj 
and Ni + Nt + • • • + N 2 n = MJT 1 and all of these subspaces except possibly 
Mi and Ni are one dimensional, it follows from Lemmas 2.1, 4.1, and 4.3 that 
Tj is continuous. Furthermore it is clear that T) = 1 and that TjTj = TjTj 
(i, /, =0, 1, 2, • • • ri). Finally we see that each for this choice of 

Ti , T 2 , • • • T n is the Mi or iV,* associated with ( i Q , i \, • • • , i») and hence is at 
least one dimensional. In other words (To, Ti , ••• , Tn)** contains 2 (2n+1) = 
2 (2n) -2 (2n) = 2 <2n) -min (2 (2n) , 2 m ) members. Suppose now that T 0 is m dimen¬ 
sional where m is an integer less than 2 n . We may suppose without loss of 
generality that the m dimensional subspace of To is the one on which To is 1. 
Let X \, £ 2 , • • • , x m be the elements of a basis for Mj . Let M t = x t + (i = 1,2, 

• • • , m). Let Mt (i = m + 1, m + 2, • • • , 2 n ) denote the 0 dimensional sub¬ 
space. Now we proceed exactly as before and define involutions T \, T 2 , • • • T n 
such that TiTj = T{Ti (i, j = 1, 2, • • • , n) but such that exactly 2 n + m of 
the N toll ... tn are at least one dimensional. We are assured of being able to get 
our full quota of non-trivial N ’s by our hypothesis that X is not finite dimen¬ 
sional. Thus in this case also the Tj may be chosen so that (To , T \, • • • , T n )** 
contains 2 (2n) -min (2 (2n) , 2 m ) members. Hence /(To, n) = 2 (2n) *min (2 (2n) , 2 m ). 
Now consider the behavior of f(T 0 , n)/2 (2n) as n increases. If T 0 is infinity 
dimensional, then/(7 7 o, w)/2 (2n) = min (2 C2n) , 2 m ) = 2 <2n) which increases with¬ 
out limit. If To is m dimensional where m is an integer, then f(T 0 , n)/2 (2n) 
increases with n until n ^ log 2 m and thereafter we have /(T 0 , n)/ 2 (2n) = 2 m . 
Therefore/(To, ri)/ 2 (2n) is bounded and has 2 m for its greatest value. Thus 
m = log 2 (sup n (/(T 0 , ri)/ 2 (2n) )) and the lemma is proved. 

Lemma 4.7. // X is a normed linear space, X fails to be finite dimensional if 

and only if for each positive integer n there exist n distinct involutions T \, T 2 , 

• • • T n such that TjTj = T,*T< (i, j = 1, 2, • • , n). //X is finite dimensional 
and k is the largest positive integer for which there exist k distinct mutually pcrmut- 
able involutions, then m = log 2 k. 

The proof of this lemma is so like certain parts of the proof of Lemma 4.6 
that we omit it. 

Lemma 4.8. Let X be a non finite dimensional normed linear space . Let T\ 
and T 2 be one dimensional involutions which do not commute (T\T 2 ^ T 2 Ti). 
Let Mi and M 2 be their respective infinite dimensional subspcLces and let <t> 1 and fa 
be basis elements for their one dimensional subspaces . Then if T is an involution 
on X, T is contained in (Ti, T 2 )** if and only if one subspace of T contains M = 
Mi fl M 2 and the other is contained in N = fa -j- fa • 

We begin with a proof of the sufficiency of the condition. Let U be an arbi¬ 
trary member of (Ti, T 2 )*. Let M+ and M_ be the subspaces of U. By Lemma 
4.4, Ti takes M+ into M + and M_ into M_ and so does T 2 . Hence each of 
these is constant in one of M+ and M_ and is one dimensional in the other. 



ISOMORPHISMS OF NORMED LINEAR SPACES 


255 


Suppose that T\ is constant in one of M+ and M_ and that T 2 is constant in 
the other. Then T\T 2 = T 2 T\ in each of M+ and and hence T\T 2 = T 2 Ti 
contrary to hypothesis. Hence <t> 1 and fa are both contained in the same sub¬ 
space which we may suppose to be M+ . We may suppose without loss of 
generality that fa+ and fa+ are the subspaces of T\ and T 2 respectively in 
which these transformations are 1. Hence N C M+ and Af_ C M. Let T 
be an arbitrary involution whose 1 space is in N and whose — 1 space contains 
M . Then since T(x) + x is always in the 1 space of T and N is in M, we have 
U(T(x) + x) = T(x) + x for all x in X and this may be written in the form 
XJ(T(x)) = T(x) — U(x) + x. On the other hand, since U(x) — x is always in 
AL. and M _ is in M, we have T(U (#) — x) = x — V(x) for all x in X and this 
takes the form T(t/(x)) = T(x) — U{x) + x. Comparing these expressions 
we conclude that UT = TU and hence that T is a member of ( T \, T 2 )**. To 
prove the necessity, let T be an arbitrary member of ( T \, T 2 )**. Given any 
member y of the complement of N, let N' = N 4* y. Since X is infinity dimen¬ 
sional, so is M and hence there exists m in M and not in N f . Let / be the unique 
linear functional defined on N' -f m s nch that f(fa) = f(fa) = 0 and f(y) = 
f(m) = 1. Since any maximal subspace of a finite dimensional normed linear 
space is closed, any linear functional defined on one is continuous. Hence by 
the Hahn-Banach extension theorem [1, p. 55, Th6or£me 2], there exists a con¬ 
tinuous linear functional F defined on X such that F(fa ) = F(fa) — 0 and F{y) = 
F(m) = 1. Consider the continuous linear transformation U(x) = x — 2 F(x)m. 
U(U(x)) = x — 2 F(x)m — 2 F(x)(m — 2 fh) — x. Therefore U is an involution. 
Furthermore for i = 1, 2, U(Ti(x)) — T { (x) — 2 F(Ti(x))rh and Ti(U(x)) = 
Ti(x) — 2F(x)Ti(ni) = Ti(x) + 2F(x)rn. But since 7 7 ,(x) + x is in the 1 space 
of Ti and so in N, F(Ti(x) + x) = 0 and —2 F(Ti(x)) = 2F(x) for all x in X. 
It follows that UTi = T{U so that U is in ( T \, T 2 )*. Hence UT = TU. In 
other words, T(x) - 2 F(T(x))m = T(x) - 2 F(x)T(m) or F(T(x))m = F(x)T{m) 
for all x in A r . Thus T(m) = \in. Since m may be any element of M not in 
the (at most one dimensional) intersection of N and M, it follows by an argu¬ 
ment similar to that used in Lemma 4.5 that T is constant in Af, and hence 
that M is contained in one of the subspaces of T. There is no loss in generality 
in supposing that it is the —1 space. Thus X = — 1 and our next to the last 
equation becomes F{T{x) + x) = 0 for all x in X. In other words, the 1 space 
of T is contained in a subspace containing N and not y. But y was an arbitrary 
element of X — N. Therefore the 1 space of T is contained in N and the lemma 
is proved. 

Definition. If Ti and T 2 are one dimensional involutions , we say that (Ti , T 2 ) 
is a minimal pair if it is impossible to select distinct one dimensional involutions 
T 9 and Ti in (Ti , T 2 )** so that (T z , T A )** is a proper subset of (Ti , I*)** and 
(Tz, Ti)** contains an infinite number of members if (Ti , T 2 )** does. 

Lemma 4.9. If Ti and T 2 are one dimensional involutions defined on a non 
finite dimensional normed linear space X, then Ti and T 2 have a common subspace 
if and only if (Ti, T 2 ) is a minimal pair. 



256 


GEORGE W. MACKEY 


Let Mi , M 2 , M, 0 i, 02 , and N be defined as in Lemma 4.8. If Mi A M 2 
and 0 i+ A <h+, let Nz and Na be two distinct one dimensional subspaces of the 
two dimensional subspace N neither of which is Mi fl N . Then M\ + Nz = 
Mi + Na = X. By Lemma 4.1, there exist involutions Tz and T 4 such that Mi 
is a subspace of both and Ns and Na are respectively their other subspaces. 
Since Nz A Na and Mi = Mi , T 3 and Ta do not commute. Therefore it follows 
from Lemma 4.8, provided that Ti and T 2 do not commute, that Tz and Ta 
are in (Ti , T 2 )**. Again by Lemma 4.8, since M\ and M 2 are both maximal 
and Mi A M 2 > T 2 is not in {Tz , Ta )**. Finally since there are an infinite 
number of ways of choosing a one dimensional subspace of N different from 
M fl N, {Tz , Ta )** contains an infinite number of members. Therefore if T\ 
and T 2 do n 6 t commute, (7\ , T 2 ) is not a minimal pair. If T\T 2 = T 2 Ti , 
then it follows from the argument used in Lemma 4.6 that {Ti , T 2 )** contains 
exactly eight members. If we let Tz = 7\ and Ta — — Ti, the same sort of 
argument tells us that {Tz , Ta)** contains only four membgrs. Therefore in 
any case {T \, T 2 ) is not a minimal pair. Suppose now that M\ = M 2 = M 
and 0 i+ A 02 +• By Lemma 4.8, if Tz and Ta are one dimensional members 
of {Ti , T 2 )** then M 3 = Ma — M and 0 3 and 0 4 are in N. Hence if 0 3 and 0 4 
are linearly independent so that T 1 T 2 A T 2 Ti and 0 3 + 0 4 = N , then again by 
Lemma 4.8, {Tz , Ta)** = {Ti , 7 7 2 )**. If 03 + = 04 + then Ta = — Tz and it 
follows from Lemma 4.6 that (7 7 3 , Ta)** contains only a finite number of mem¬ 
bers. But by means of the argument used in the first part of the proof it can 
be shown that {T X , T 2 )** contains an infinity of members. Hence (7\ , T 2 ) is a 
minimal pair. If Mi A M 2 and 0 i+ = 02 + the argument is similar. It depends 
upon the fact that if A/ 3 A Ma and M 3 fl M 4 Z> M then M 3 H Ma = M and 
the fact that there are two linearly independent continuous linear functionals 
vanishing on M. „ Finally, if Mi = M 2 and 0i+ = 0 2 +, then T x = +T 2 and 
hence {T x , T 2 )** contains only 1, —1, T x , and — T x . Therefore T\ and — T x 
are the only one dimensional involutions present. Thus (Ti, T 2 ) is a minimal 
pair and the proof of the lemma is complete. 

Lemma 4.10. Let X be a non finite dimensional normed linear space . Let m 
be a positive integer. Then if T is a one dimensional involution on X and T x is an 
m dimensional one , the one dimensional subspace of T is contained in the m 
{infinity) dimensional subspace of T\ if and only if there exists an involution T f 
having the same one dimensional subspace as T and such that T f T x = T X T' and 
is an m — \{m + 1) dimensional involution . 

Let M m , Mi , Afoo, and M « denote the subspaces of T and Ti . We may 
supposejvithout loss of generality that M*> and M * are the —1 subspaces. If 
Mi C M m , let xi , X 2 , • • • , x m be a basis for M m such that x x is in Mi . By 
Lemma 4.1, there exists an involution V whose 1 subspace is Mi and whose — 1 
subspace is Moo + £2 + * • • + x m . Both T X and T' are — 1 in M* and both 
are 1 in Mi. In x 2 + • • • + x m one is — 1 and the other is 1. Thus TiT = 
TTi and is 1 on Moo + Mi and — 1 on X 2 + • • • + x m ; that is, is an m — 1 
dimensional involution. If Mi C Moo the argument is similar. However, 



ISOMORPHISMS OF NORMED LINEAR SPACES 


257 


instead of a basis, we use the existence of an involution on M « having Mi for a 
subspace. Conversely, suppose that T 9 has Mi for its one dimensional sub¬ 
space and that T'Ti = TiT'. Then, by Lemma_4.4, V takes M* into M* 
and M m into M m . Hence it'is constant on one of M m and M x and a one dimen¬ 
sional involution on the other. In other words, Mi is contained in either M m 
or Mo© and T'Ti is obviously accordingly either anm-lorm+1 dimensional 
involution. 

Lemma 4.11. Let X be a non finite dimensional normed linear space. Let m 
be a positive integer. Then if T is a one dimensional involution on X and 7\ 
is an m dimensional one , the infinity dimensional subspace of T contains the m 
( infinity) dimensional subspace of T\ if and only if there exists an involution T f 
having the same infinity dimensional subspace as T and such that T'Ti = T\T f 
and is an m + 1 (m — 1) dimensional involution. 

The proof is analogous to that of Lemma 4.10. 

We turn now to the proof of the theorem. If X\ and X 2 are isomorphic, the 
isomorphism of G\ and (? 2 is obvious. Suppose that X\ and X 2 are pseudo¬ 
reflexive and mutually pseudo-conjugate. Then as X 2 is isomorphic to Xi 
under a norm for which the elements of Xi define the continuous linear func¬ 
tionals on li, it will be sufficient to prove that G\ is isomorphic to G 3 where G 3 
is the group of automorphisms of Xi under this norm. In the rest of this dis¬ 
cussion wherever the topology of Xi occurs it will be understood to be the one 
under which it is isomorphic to X 2 . Given any T in G \, let T 9 = (7 1-1 )* where 
T* is Banach’s conjugate [1, p. 100]. Then since, as is well known, (TiT 2 )* = 
TtT* and (TYA)" 1 = Ti l T[ x it follows that = T[T 2 . If T = 1 for 

some T, then f(T~ l (x)) = f(x) or f(T~ l (x) — x) = 0 for all/ in X and all x in X. 
Hence [1, p. 55, Th6or&me 3] T~ l {x) — x = 0 whence T = 1. Next any T' 
is continuous and hence, since (T') -1 = (T~ 1 )', has a continuous inverse and is 
in G 3 . In fact if j/ n | is a bounded sequence of elements of X\ , then 
{/»(7 T ~ 1 0r))} is a bounded sequence of real numbers for all x in Xi . In other 
words if g n = T'(/ n ), n = 1, 2, • • • then \g n {x)} is a bounded sequence for all x 
in A" and hence by [1, p. 80, Th6or6me 6], {T'(f n )\ is a bounded sequence of 
elements of Xi. Hence T 9 is continuous [1, p. 54, Th6or&me 1]. Now let U 
be any member of G 3 . Then if we identify each member of x with the corre¬ 
sponding functional on Xi and repeat the argument we have just given we find 
that (t/ -1 )* is a member of Gi and as is easily verified (((t/” 1 )*) -1 )* = U. Thus 
the set of elements of the form T 9 where T is in G\ is precisely G 3 and T —» T 9 
is an isomorphism. 

Suppose, conversely, that Gi and G 2 are isomorphic as abstract groups. If 
either Xi or X 2 is finite dimensional, the isomorphism of Xi and X 2 is an easy 
consequence of Lemma 4.7 and the argument used in the corresponding part 
of the lattice theorem. We suppose then that neither Xi nor X 2 is finite dimen¬ 
sional. Let Ti and T 2 be two one dimensional involutions in Gi having the same 
one dimensional subspace N and distinct infinity dimensional ones Mi and Af 2 . 
Let Ti and respectively be their correspondents in G 2 . It follows from 



258 


GEORGE W. MACKEY 


Lemma 4.6 that T[ and T 2 are also one dimensional and from Lemma 4.9 that 
they have a common subspace. 

Case A. Ti and T 2 have the same one dimensional subspace N'. 

Since T\ and T 2 have different infinity dimensional subspaces and hence do 
not commute, it is clear that T\ and T% have different infinity dimensional sub¬ 
spaces. Let T z be any other involution having N for one subspace. Since T[ 
and T 2 have different infinity dimensional subspaces, T 3 and one of T[ and T 2 
have different infinity dimensional subspaces and hence have the same one 
dimensional subspace. In other words, T z has N ' as a subspace. By the same 
argument if T' is any involution in G 2 having N' as a subspace, then T has N 
as a subspace. Now let N\ be any other one dimensional subspace of X. The 
unique linear functional / defined on N\ + N such that /(0) = /(0 1 ) = 1 where 
0i and 0 are non-zero members of N 1 and N respectively is continuous and 
hence [1, p. 55, Th6or&me 2] has a continuous linear extension. The null space 
M z of the extension is closed and maximal and contains neither N nor N \. 
By Lemma 4.1, there exists an involution T z having M z and N for its subspaces 
and an involution T 4 whose subspaces are M z and N 1 . T z has N ' for one sub¬ 
space and has a subspace in common with T[ . Since r 4 does not have N for a 
subspace, T 4 cannot have N'. Hence T z and T\ have the same infinity dimen¬ 
sional subspace. Let T b be any involution having N 1 for a subspace. T[ and T b 
have a subspace in common. If they have the same infinity dimensional sub¬ 
space, then T z and T' b and hence T z and T b have a subspace in common. Since 
Ni ^ N , it is their infinity dimensional subspace. Hence T b = ±7^ . Hence 
in any case T\ and T b have a common one dimensional subspace. In other 
words it has been shown that if T and U are one dimensional involutions in Gi , 
then T and V have the same one dimensional subspace if and only if T' and U f 
do. From this point on the argument is so much like that used in the ring 
theorem that we omit it except to say that Lemmas 3.1, 3.2, 3.3, 3.4 and 3.5 
are replaced by Lemmas 4.2, 4.1, 4.10, 4.6 and 4.1 respectively and that we con¬ 
clude that Xi and X 2 are isomorphic. 

Case B. Ti and T 2 have the same infinity dimensional subspace. 

It follows at once from the argument used in Case A that no pair of non¬ 
commuting one dimensional involutions in G with a common one dimensional 
subspace can have correspondents in G with the same one dimensional subspace. 
Hence whenever T and U have a common one dimensional subspace, T' and U' 
have a common infinity dimensional subspace, and if we associate with each 
one dimensional subspace N of Xi the common infinity dimensional subspace 
of the correspondents in G 2 of all members of G\ having N as a subspace, we 
readily see that we get a one-to-one correspondence between the one dimen¬ 
sional subspaces of Xi and the closed maximal subspaces of X 2 . Hence if we 
associate with each closed maximal subspace of X 2 , the one dimensional sub¬ 
space of continuous linear functionals having it for their null-space we will have 
a one-to-one correspondence between the one dimensional subspaces of Xi and 
X 2 respectively. In order to show that this correspondence preserves linear 



ISOMORPHISMS OF NORMED LINEAR SPACES 


259 


dependence, we note first that if M is the intersection of a finite number of 
closed maximal subspaces of a normed linear space X then there exists a finite 
dimensional subspace M such that M + M = X and 101 = 0, that while M 
is not uniquely determined its dimension is and is equal to the dimension of the 
space of linear functionals vanishing on Af, and that furthermore this dimen¬ 
sion does not exceed the number of maximal subspaces involved. It follows at 
once that the continuous linear functionals /i, U , • ■ • ,/ n are linearly inde¬ 
pendent if and only if the intersection of their null-spaces contains the infinity 
dimensional subspace of an n — 1 dimensional involution. Let Ni , N 2 , • • • ,N n 
be a set of one dimensional subspaces of X \. Let M i, Af 2 , • • • , M n respec¬ 
tively be their corresponding closed maximal subspaces of X 2 . The Ni 
(i = 1, 2, • • • , n) are linearly dependent if and only if there exists an n — 1 
dimensional involution in Gi whose finite dimensional subspace contains all of 
the Ni . Using Lemmas 4.6, 4.10 and 4.11 and involutions having the Ni as 
subspaces we conclude from this that the Ni are linearly dependent if and only 
if there exists an n — 1 dimensional involution in G 2 whose infinity dimensional 
subspace is contained in the intersection of the A/»* ; that is, if and only if the 
continuous linear functionals defining the Af, are linearly dependent. Let V(f) 
be the one-to-one linear transformation from all of X 2 into all of X\ whose 
existence we can now conclude from Lemma A. Let \\x\\ denote the norm of 
an element x of X\ and set 11 /1 \i = 11 V(f) || for each element/ in X 2 . Obviously 
this defines a norm in X 2 under which X 2 is isomorphic to Xi . Let F be a 
linear functional on X 2 which is continuous with respect to the norm || ||i. 

Let L be the null-space of F and let M = V(L). Then M is a closed maximal 
subspace of Xi . Let T be an involution in Gi having M as a subspace. Let 
N' be the one dimensional subspace of the correspondent of T in G 2 . Using 
Lemmas 4.6, 4.10 and 4.11 we see that a one dimensional subspace of Xi is 
contained in M if and only if the corresponding closed maximal subspace of X 2 
contains N ' and hence if and only if the corresponding one dimensional sub¬ 
space of X 2 is contained in the common null-space of the elements of N\ re¬ 
garded as linear functionals on X 2 . Thus L is the null-space of an element 
of N'. It follows that F(f) = f(x) for some x in N' (and hence in X 2 ) and all/ 
in X 2 . Conversely, given any x in X 2 , let V be a member of G 2 having x+ 
as a subspace, let T be the corresponding member of (7, and let M be the infinity 
dimensional subspace of T. Finally let L = V~\M). Just as before we can 
show that that L is the null-space of x and hence since L is closed that F(f) = f(x) 
is continuous. Thus X 2 is pseudo-reflexive and Xi is isomorphic to its pseudo¬ 
conjugate. In other words X\ and X 2 are pseudo-reflexive and mutually 
pseudo-conjugate and the theorem is proved. 

Concluding Remarks 

The three theorems proved in this paper may be generalized to prove that 
the system consisting of a linear space X and a total subspace L of the space of 
all linear functionals defined on X is characterized to within isomorphism by its 



260 


GEORGE W. MACKEY 


lattice of L-closed subspaces, by the ring of L-continuous linear transformations 
of X into itself and, if the system L, X be identified with the system X, L, 
by its group of automorphisms. Xi, L\ and X 2 , L 2 are said to be isomorphic 
if there exist one-to-one linear transformations of all of Xi into all of X 2 such 
that whenever x in Xi corresponds to x 7 in X 2 and / in L x corresponds to /' in 
L 2 then f{x) = fix'). An L continuous linear transformation is a linear trans¬ 
formation T such that F(x) = f(T(x)) for all x in X is in L whenever/ is. An L 
closed subspace is an intersection of null spaces of linear functionals in L. If 
we let L be the set of continuous functionals for a norm, we get as special cases 
the theorems of this paper. In the author’s thesis, now in progress, the systems 
X, L are investigated systematically. 

Due principally to the fact that distinct convex topologies may have the same 
set of continuous linear functionals, the isomorphism theorems cannot be ex¬ 
tended, as they stand, to arbitrary convex linear topological spaces. The 
author expects to discuss the situation in detail in a later paper. 

Eidelheit [2] shows that, at least in the case of complete spaces, any iso¬ 
morphism between the rings R\ and R 2 can be represented in the form 
T' = UTU~ l where T is in R x , T' is in R 2 and U is a one-to-one bicontinuous 
linear transformation from all of X x into all of X 2 . The author expects to 
discuss the extent to which this is true in the group situation at a later date. 

Other questions suggesting themselves for investigation include the following: 
(1) What properties must an abstract group (ring, lattice) have in order to be 
the group (ring, lattice) associated with some normed linear space or more gen¬ 
erally some system X, L in accordance with the above theorems? (2) What 
special properties do the groups (rings, lattices) of complete, reflexive and other 
special kinds of normed linear space have? (3) Canwe give simple characteriza¬ 
tions of various well known normed linear spaces in terms of properties of the 
algebraic systems associated with them? (4) Do any of the three theorems 
imply any of the others without the intervention of Lemmas A and B? The 
derivation of the ring theorem from the group theorem looks as if it might be 
fairly easy. 

Harvard University 


Bibliography 

1. Banach, S., Theorie des Operations Lineaires , 1932. 

2. Eidelheit, M., On isomorphisms of rings of linear operators , Studia Mathematica, vol. 9 

(1940), pp. 97-105. 

3. Veblen, O. and Young, J., Projective Geometry. 

4. Fichtenholz, G., Sur les fonctionnelles lineaires continues an sens generalise , Mate- 

matiche Sbornik, N. S., vol. 4 (1938), pp. 193-213. 

5. Hausdorff, F., Zur Theorie der linearen metrischen Rdume , Journ. f. reine u. angew. 

Math., vol. 167 (1932) pp. 294-311. 

6. Sobczyk, A., Projections in Minkowski and Banach spaces , Duke Mathematical Journal, 

vol. 8 (1941), pp. 78-106. 



Annals or Mathematics 
Vol. 43, No. 2, April, 1942 


NEUAUFBAU DER ENDENTHEORIE 

Von Hans Frbudenthal 
(Received September 2, 1941) 

Kompaktifizierungen topologischer Raume kennt man bereits aus anderen 
topologische Gebieten; z.B. ist das Bedtirfnis an einem kompakten Substrat der 
geometrischen Untersuchungen wohl eine der Ursachen fur die Erganzung der 
euklidischen zur projektiven oder zur funktionentheoretischen Ebene. Man 
sieht an diesem Beispiel, daft die Kompaktifizierung sehr verschieden ausfallen 
kann. Topologisch wird man die funktionentheoretische Kompaktifizierung als 
die natiirlichere ansehen; es liegt ja topologisch nicht der mindeste Grund vor, 
warum man gewisse divergente Folgen gegen den einen und gewisse gegen den 
andern unendlich fernen Punkt konvergieren lassen soil. Bei der unendlichen 
Geraden hingegen wird man topologisch gerade die Kompaktifizierung durch 
zwei unendlich feme Punkte (“Endpunkte”) vor der durch einen bevorzugen, 
da garnicht einzusehen 1st, warum man die nach verschiedenen Richtungen 
divergierenden Punktfolgen zusammenwerfen sollte. Ebenso wird man den 
unendlichen Zyfinder durch zwei “Endpunkte” kompaktifizieren, die dreimal 
gelochte Sphare (die Badehose) durch drei Endpunkte usw. 

Von einer “natiirlichen” Kompaktifizierung wird man also verlangen: 

1. Das Uncndlichferne, die Endpunktmenge, soil moglichst diinn sein (ge- 
nauer: nulldimensional). 

2. Das Unendlichferne soli moglichst weitgehend aufgespalten sein (ohne daft 
dabei gewisse allgemeine Raum-Axiome verletzt werden). 

In meiner Dissertation 1 habe ich (zum Zweck topologisch-gruppentheoretischer 
Untersuchungen) zuerst dies Kompaktifizierungsproblem behandelt und gelost 
(durch Einflihrung der “Endpunkte” 2 )fur die topologischcn Raume R, die 
folgenden Bedingungen gentigen: 

a) zweites Abzahlbarkeitsaxiom, 

b) Kompaktheit im Kleinen, 

c) Zusammenhang im Kleinen, 

d) Zusammenhang. 

In Fuftnote 15 meiner Dissertation hatte ich bereits angekiindigt, daft man die 
Bedingung e fallen lassen kann. 

Herr L. Zippin hat 8 andererseits gerade die Bedingung b abgeschwacht und 
ersetzt durch 

b') Semikompaktheit, 4 

1 Berlin 1931: Vber die Enden topologischer Rdume und Gruppen , Math. Zeitschrift 33 
(1931), 692-713. 

* “Endpunkte” sind beil&ufig in sehr speziellen Fallen von B. v. Ker6kj4rt5 ( Vorlesungen 
liber Topologie , Berlin 1923, S. 164) und von L. Zippin (Transactions Amer. Math. Soc. 31 
(1929), 744-770, besonders 763) eingeftihrt worden. 

* On semicompact spaces (Amer. Journ. of Math. 57 (1935), 327-341). L. Zippin war bei 
seinen Untersuchungen teilweise unabh&ngig von den in Fuftnote 1 zitierten. 

4 In Wirklichkeit verlangt Zippin noch Metrisierbarkeit, was aber tiberfltlssig ist (siehe 
4.2 und 5.4). 


261 



262 


HANS FREUDENTHAL 


d.h. die Existenz beliebig kleiner Umgebungen (zu jedem Punkte) mit kompakter 
Berandung. b' ist in der Tat die “wahre” Bedingung, wenn man wiinscht, 
daft das Unendliche nulldimensional, also insbesondere jeder endliche Punkt in 
beliebig kleinen Umgebungen enthalten sein soli, die keine „unendlichfemen” 
Punkte auf ihrem Rande besitzen. 

In der vorliegenden Arbeit will ich das Versprechen aus meiner Dissertation 
erfiillen und die Theorie der Endpunkte aufbauen ohne die Bedingung c. Dabei 
werden ganz merkwiirdige Schwierigkeiten auftauchen, die ich in 6.4-5 liber- 
winden werde; dort liegt also der Schwerpunkt der Arbeit. 

Es zeigt sich nun, daft man auch d fast ganz fallen lassen kann; man ersetzt 
d durch 

d') Kompaktheit des Komponentenraumes von 72, 
d.h.: jede abnehmende Folge nichtleerer offener abgeschlossener Teilmengen 
von 72 soil einen nichtleeren Durchschnitt besitzen. 

Der Bedingung d' geniigen alle zusammenhangenden Raume und alle Raume 
mit nur endlich vielen Komponenten. Dagegen schlieftt d' aus, daft 72 aus 
unendlich vielen isolierten Komponenten besteht , d' schlieftt allerdings nicht 
notwendig aus, daft 72 unendlich viel Komponenten enthalt, die sich nirgends 
haufen. Beispielsweise ist zulassig folgender Raum (Teilmcnge der cartesischen 
Ebene), der zusammengesetzt ist aus den Komponenten 


C n : X = - 

n 


(n natiirlieh), 


D n : x = 0, n < y < n + 1 (n ganz) ; 

hier haufen sich namlich die D n nirgends, und doch gilt d', da jede offene ab- 
geschlossene Menge, die ein D n enthalt, fast alle C„ , also auch alle D n enthalt. 

Die Bedingung d' ist nun in der Tat unvermeidlich, wenn man einen Raum 
durch Endpunkte kompaktifizieren will. Nimmt man als 72 einen aus unendlich 
vielen isolierten Punkten bestehenden Raum, so bemerkt man sofort, daft die 
Aufspaltung des Unendlichen hier transfinit fortgesetzt werden muft, und daft 
man als Abschlieftung bestenfalls ein pathologisches (nicht dem 1. Abzahl- 
barkeitsaxiom gentigendes) Gebilde erhalt. 

Auch die Zippinsche Abschwachung b' libernehme ich. Wir sahen bereits, 
daft auch hier keine weitere Abschwachung moglich ist, wenn die Endpunkt- 
menge nulldimensional sein soil. 

Meine Voraussetzungen lauten also: 
a) zweites Abzahlbarkeitsaxiom, 
b') Semikompaktheit, 

d') Kompaktheit des Komponentenraumes. 

Man kann noch das zweite Abzahlbarkeitsaxiom fallen lassen, wenn man die 
Semikompaktheit durch die Semibtkompaktheit ersetzt und kein kompaktes , 
sondern ein bikompaktes Resultat anstrebt. Sehr wichtig ist das nicht; Zufiigung 
der Endpunkte macht den Raum namlich auch dann bikompakt, wenn d ; 



NEUAUFBAU DER ENDENTHEORIE 


263 


garnicht gilt. Die Hauptschwierigkeit beim Beweise (siehe 6.4-5) besteht 
gerade darin, zu zeigen, daft aus einem Raum mit zweitem Abzahlbarkeitsaxiom, 
bei Gtiltigkeit von d', wieder ein Raum mit zweitem Abzahlbarkeitsaxiom ent- 
steht (der dann als bikompakter Raum kompakt sein muft). Trotzdem habe 
ich mich soweit moglich auf den etwas allgemeineren bikompakten Standpunkt 
gestellt. 

Dadurch daft ich mich auf die unumganglichen Forderungen a, b', d' habe 
beschranken konnen, habe ich, wie mir scheint, die Endentheorie zu einem ge- 
wissen Abschlufi gebracht. Meine Methode weicht anfangs nicht nennenswert 
von der meiner Dissertation ab (erst in 6.4-5 kommt der eigentliche Unterschied). 
Die “Endpunkte” werden naturlich wieder durch absteigende Folgen von offenen 
Mengen mit kompakter Berandung definiert. 6 Man muft diese Folgen naturlich 
gewissen Feinheitsbeschrankungen unterwerfen. In meiner Dissertation und 
bei Zippin geschieht das auf Grund dcr Forderung c. Der Zusammenhang im 
Kleinen bewirkt namlich, daft man sich auf Gebiete fur die erzeugenden offenen 
Mengen beschranken kann, und daft diese Gebiete bei weiterem Absteigen in 
wesentlich nur endlich viel Teilgebiete zerfallen. So flieftt aus dem Zusammen¬ 
hang im Kleinen einerseits der atomistische Charakter der verwendeten Folgen, 
andererseits die Kompaktheit des Resultats. 

Steht einem c nicht zur Verfligung, so kann man den atomistischen Charakter 
durch Maximalitatsforderungen erzwingen. Es schien mir aber zweckmaftiger 
und anschaulicher, von den erzeugenden Folgen Gi 3 G 2 3 • • • zu verlangen, 
daft fur je zwei offene Mengen 0, P mit kompakter Berandung und mit 6 C P 
gilt: 

ist einmal O n G n ^ so ist G n C P fur fast alle n. 

In meiner Dissertation habe ich auch ein Charakterisierungsproblem be- 
handelt. Ist R* ein Kompaktum und R tiberall dicht in R*, so kann man sich 
fragen, ob vielleicht 72* gerade die Kompaktifizierung von R durch die End- 
punkte ist. Die notwendigen und hinreichenden Bedingungen meiner Disserta¬ 
tion lauten: 

a) R*\R ist abgeschlossen und nulldimensional, 

P) eine Gebiets-Umgebung eines Punktes von R*\R wird durch R*\R nicht 
zerlegt. 

Zippin hat a a.a.O. abgeschwacht zu 

a') R*\R ist ein total zusammenhangsloses F a . 

In der vorliegenden Arbeit lauten die Charakterisierungsbedingungen: 

a') R*\R ist nulldimensional, 

0') die Umgebungen eines Punktes p von R*\R werden durch R*\R nicht 
zerlegt in zwei in R offene Mengen, die beide p als Haufungspunkt besitzen. 

Veranlaftt wurde die vorliegende Arbeit durch schone Untersuchungen des 

* NatUrlich muft man sich auf offene Mengen mit kompakter Berandung beschr&nken, 
wenn man nicht zu pathologischen Resultaten gelangen will. Das ist verschiedentlich 
Ubersehen worden. 



264 


HANS FREUDENTHAL 


Herm J. de Groot, der sich mit folgendem Problem beschaftigt hat: Seien A 
und A 1 homoomorph, A n B = A' n B* = ©. Wann lafit sich jede horn- 
oomorphe Beziehung zwischen A und A ' erweitern zu einer homoomorphen 
Beziehung zwischen A u B und A' u B ,r t 

Verlangt man, wie Herr de Groot es in der Tat tut, daft A u B und A' u B f 
Kompakta seien, und unterwirft man die Mengen weiteren Bedingungen, die 
B bzw. B ' gerade als Endpunktmenge von A bzw. A' charakterisieren, so erz- 
wingt man in der Tat die verlangte Fortsetzbarkeit jeder Homoomorphie. 

Herr de Groot kam unmittelbar von seiner Fragestellung aus, ohne Kenntnis 
meiner und der Zippinschen Untersuchungen, zu Resultaten die sich mit den 
Zippinschen Iiberschnitten. In seiner Note 6 verlangt er (aufier der Uberall- 
dichtheit): 

a') B und B' sind nuldimensional, 

0") B bzw. B' zerlegen A u B bzw. A' u B f nicht im Kleinen. In weiteren, 
noch unpublizierten Untersuchungen schwacht er 0" weiter ab. Seine neuen 
Bedingungen sind gerade mit den friiher genannten a', identisch. 

Ich bemerke aber, daft Herrn de Groot fur diesen Satz (und damit auch fur 
den friiher genannten Charakterisierungssatz) in vollem Umfang die Prioritat 
zukommt, und daft es erst diese sehr allgemeinen Resultate des Herrn de Groot 
waren, die mich veranlaftten, die Untersuchungen meiner Dissertation iviederauf- 
zunehmen. Anfangs schien es mir zwar, als ob die groftere Allgemeinheit der 
Resultate des Herrn de Groot durch die Andersartigkeit der Fragestellung be- 
dingt war; dann gelang es aber, den den de Grootschen Fortsetzungssatz als 
Charakterisierungssatz in eine verallgemeinerte Endentheorie einzubauen. 

Wir wenden zum Schluft (in 8) die Begriffe an auf die gruppentheoretische 
Frage, die den Ausgangspunkt meiner Dissertation bildete. In meiner Disserta¬ 
tion habe ich namlich gezeigt, daft ein Gruppenraum, der a-d geniigt, hochstens 
zwei Endpunkte besitzen kann. Zippin hat diesen Satz ausgedehnt auf Grup- 
penraume, die a, b, c', d geniigen; er muftte meine Uberlegungen dabei ziemlich 
abandern. Durch erneute, sehr weitgehende Abanderung und Vertiefung der 
Methode kann ich nun beweisen, daft ein Gruppenraum, der 

1) dem zweiten Abzahlbarkeitsaxiom geniigt, 

2) semikompakt und 

3) zusammenhangend 

ist, hochstens zwei Endpunkte besitzt. Da R kompakt ist und R*\R aus 
hochstens zwei Punkten besteht, ist R notwendig im Kleinen kompakt. Daraus 
folgt der merkwiirdige Satz: 

Ein Gruppenraum , der 1-3 geniigt , ist von selher im Kleinen kompakt 

Bezeichnungen 

© = leere Menge. 

A u B = Vereinigung von A und B. 

• Proc. Akad. Amsterdam 44 (1941),-■■ Indagationes Math. 8 (1941), - 



NEUAUFBAU DER ENDENTHEORIE 


265 


A n B = Durchschnitt von A und B. 

U = Zeichen fiir die Vereinigungsbildung liber eine Menge von Mengen. 
fl = Zeichen fiir die Durchschnittsbildung liber eine Menge von Mengen. 
a e A : a ist Element von *.4. 
a 4 A: a ist nicht Element von A. 

A\B : Gesamtheit der a e A ) a 4 B. 

A = abgeschlossene Hiille von A. 

R(M) = Rand von M. 

Gotische Buchstaben verwenden wir haufig fiir Mengen, deren Elemente Mengen 
sind. 


1. Allgemeine topologische Begriffe 

1.1. R sei im Folgenden ein topologischer Raum, d.h. in R sei ein System von 
offenen Mengen gegeben, zu dem o, R f mit zwei Mengen ihr Durchschnitt, mit 
beliebig vielen ihre Vereinigung gehort. Die Komplemente der offenen Mengen 
heifien abgeschlossen. Umgebung einer Menge heifit jede sie enthaltende offene 
Menge. Der Durchschnitt aller M enthaltenden abgeschlossenen Mengen heifit 
M y die abgeschlossene Hiille von M. Ein Punkt gehort zum Rand von M, R(M) f 
dann und nur dann, wenn jede seiner Umgebungen Punkte von M und Punkte 
von R\M enthalt. 

1.2. Wir verwenden wiederholt- folgende einfache Folgerungen: Sind zwei 
offene Mengen zueinandcr fremd, so ist jede zur abgeschlossenen Hiille der 
anderen fremd. Ist 0 offen, so gilt R(0) = 0\0. 

1.3. Wir setzen im Folgenden stets die Trennungseigenschaft X voraus: Je 
zwei Punkte lassen sieh durch fremde Umgebungen trennen.—Insbesondere ist 
dann jede abgeschlossene Menge der Durchschnitt ihrer Umgebungen, und ist 
jede einpunktige Menge abgeschlossen. 

1.4. Basis von R heifit ein System, aus dem sich alle offenen Mengen durch 
Vereinigungsbildung erzeugen lassen. Das zweite Abzahlbarkeitsaxiom lautet: 
R besitzt eine abzahlbare Basis. Ein solcher Raum heifit auch separabel. 
Umgebungsbasis von M hei!3t ein System 33 von Umgebungen von AT, wenn zu 
jeder Umgegung V von M ein V c 93 mit V C l existiert. 

1.5. R heifit reguldTy wenn zu jedem Punkte a und zu jeder Umgebung V von 
a eine Umgebung V von a existiert mit U C V. R heiftt normaly wenn das 
Entsprechende fiir jede abgeschlossene Menge gilt. Regulare separable Raume 
sind bekanntlich normal. 

1.6. Ein System a von Mengen aus R heiftt endlich gebunden f wenn der Durch¬ 
schnitt von je endlich viel Mengen aus a nicht leer ist. a heifit gebundeny wenn 
ganz a einen nicht leeren Durchschnitt besitzt. 

R heifit bikompakty wenn jedes endlich gebundene System von abgeschlos¬ 
senen Mengen aus R auch gebunden ist. Bikompaktheit ist bekanntlich Equiva¬ 
lent mit der Vberdeckungseigenschaft : Jede Uberdeckung von R mit offenen 
Mengen enthalt eine endliche Uberdeckung. 

Ein separabler bikompakter Raum heifit ein Kompaktum. 



266 


HANS FREUDENTHAL 


1.7. R heifit semibikompakt bzw. Semikompaktum, wenn eine Basis K von R 
existiert, derart daft jede Menge aus K eine bikompakte Menge bzw. ein Kom- 
paktum als Rand besitzt. 

1.8. Eine Menge, die offen und abgeschlossen zugleich ist, heifit ein Brocken . 
Das Komplement eines Brockens ist wieder ein Brocken. Ein Raum R heifit 
zusammenhangend, wenn er keine Brocken aufier o und R enthalt. Kompo- 
nente von R heifit jede maximale zusammenhangende Teilmenge. Ein Raum 
heifit nulldimensional , wenn seine Brocken eine Basis bilden. 

1.9. Ein Kompaktum kann nur abzahlbar viel Brocken besitzen. Denn das 
zweite Abz&hlbarkeitsaxiom liefert eine abzahlbare Basis fur die Brocken; wegen 
des Uberdeckungssatzes lafit sich aber jeder Brocken bereits aus endlich vielen 
dieser Basisbrocken erzeugen. 

1.10. M heifit iiberall dicht in R, wenn jede offene Menge von R Punkte von 
M enthalt. 

2. Die erzeugenden Systeme 

2.1. Wir betrachten topologische Raume P, in denen eine (zunachst unde- 
finierte) Relation zwischcn offenen Mengen gegeben ist, die wir mit C bezeich- 
nen. Die Beziehung C geniigt den Bedingungen: 

1. Ist 0 CS P, so ist 0 d P. 

2. Ist Oi CP,0 2 C P, so ist Oi u 0 2 C P. 

3. Ist 0 G Pi, 0 C P 2 , so ist 0 C P 1 n P 2 . 

4. Ist O CP, so ist R\P C R\0. 

5. Ist 0iC0 2 ,0 2 CP 2) P 2 cPi, so ist OiCP lt 

Ein System von Relationen C, das diesen 5 Bedingungen geniigt heifit ein 
35-System (auch 35(P)). Gilt 2.1.5 nicht notwendig, so heifit es ein 35'-System 
(auch 35'(P)). 

Aus 2.15 folgt, dafi jedes nichtleere 3)-System die Relationen o e 0 C R 
enthalt. 

Einen mit einem 3)-System versehenen Raum nennen wir einen 35-Raum. 
Man kann jeden Raum R zu einem 35-Raum machen, wenn man 0 C P dann 
und nur dann vorschreibt, falls 0 C P (man sieht leicht, dafi alien Forderungen 
geniigt ist). Man ist aber dazu nicht verpflichtet; man kann als 35-System von 
R auch eine echte Teilmenge davon nehmen. 

Ein 3)- bzw. 35'-System von R erzeugt in jedem Teilraum S wieder ein 35-bzw. 
3)'-System. Man setze namlich 0nSCPnSin5 dann und nur dann, 
wenn 0 C P in R gilt. Beim Ubergang zu Teilraumen legen wir stets das 
induzierte System zugrunde. 

Man kann C anschaulich auch als einen qualitativen Abstandsbegriff 
deuten; man deute namlich 0 C P als: 0 und R\F besitzen einen Abstand . 

2.2. Sei in R ein 35'-System gegeben. Man konstruiere folgendes System 35: 
Dann und nur dann ist die Relation 0 C P in 3), wenn es in 35' eine Relation 
Oi C Pi gibt mit OcOiCPiCP, Wie man leicht sieht, ist das neue System 
in der Tat ein 35-System. 35' heifit dann Basis von 35 und 35 heifit von 3)' 
erzeugt . 



NEUATJFBATJ DEB ENDENTHEOBIE 


267 


2.3. Eine Mengeg von offenen Mengen des 35-Raumes R heifit eine Erzeugende, 
wenn gilt: 

1 . 8 ist endlich gebunden. 

2. Zu jedem G e g existieft ein G't g mit G' C G. 

3. Zu je zwei offenen Mengen 0, P mit 0 C P und mit 0 n G ^ ° fiir alle 
G f g existiert ein Go e § mit G 0 C P. 

2.4. Ist g Erzeugende und Gi eg, G 2 eg, so existiert G 3 eg mit G, C ft n ft , 
Beweis: Nach 2.3. existieren G [, Gj eg mit G', C G y . Wir setzen O = G[ n G' t , 
P = Gi n Gt . Nach 2.1.3 und 2.1.5 ist 0 C P und nach 2.3.1 ist O n G 9 ^ o 
fiir alle G eg. Nach 2.3.3 existiert also ein 0 3 eg mit 6 ! 3 C P = Gy n G 2 , 
w.z.b.w. 

2.5. Wir definieren: 

g < 0: G C 0 fiir ein gewisses G eg. 

f) -< g: i) < G fiir alle G eg. 

gtjj^o; G n H 5 ^ ° fiir alle G e 9 und H el). 

2.6. A us g ■< t) folgt gf) ^ o. A us gf) ^ ° /olgt g -< l) (also auch f) < g). 

Beweis: Ei-ste Halfte klar.—Sei nun gf) ^ o. Sei H el). Wir bestimmen 

nach 2.3.2 ein IV el) mit II' C II. Nach Vorau.ssetzung ist G r\ H' ° fiir 
alle G eg. Also existiert nach 2.3.3 ein G eg mit G C II. Also g < H. Das 
gilt fiir alle H eh. Alsog < f). 

2.7. Schreibt man g = f) statt gi) 5 ^ ° (odcr g -< Ij), so ersieht man aus 2.6, 
dafi die iiblichen Rechengesetze fiir das Gleichheitszeichen gelten. 

3. Der Raum R* 

3.1. Wir beabsichtigcn, die Erzeugenden von R (unter Beriicksichtigung der 
definierten Gleichheit) als Punkte eines neuen Raumes R* zu deuten. Zunachst 
definieren wir: 

0* = Menge aller g < 0. Speziell R* = Menge aller Erzeugenden von R. 

0* C P* dann und nur dann, wenn 0 CP. 

3.2.1. AusO CP folgt 0* CP*. 

2. (0 n P)* = 0* n P*. 

3. (0 u P)* 3 0* u P*. 

Beweis: 3.2.1 ist klar. 3.2.2: Sei g e (0 n P)*. Dann g < 0 n P, also 
g -< 0, g < P. Also g e 0*, g e P*, also g e O* n P*. — Sei umgekehrt g eO* n P*. 
Dann g e 0*, g e P*. Also g < 0, g < P. Also existiert Gi eg, Gi C 0; ft ej, 
ft CP; also auch nach 2.4 ein 0 3 eg mit ft C ft n ft, Dann ist aber 
ft C O n P, also g < 0 n P, also g e (0 n P)*, w.z.b.w.—Nun 3.2.3: Sei 
g eO* n P*. Danng -< 0 oderg -< P, alsog < 0 u P, alsog e (O u P)*. 

3 . 3 . R* wird zu einem topologischen Raum durch die Festsetzung: Die O* 
bilden eine Basis von R*. (Folgt aus 3.2.2.) Die Randbildung in R* heifie 91*. 

3.4.1. Die 0* mit G < g bilden eine Umgebungsbasis vong (siehe 1.4). (Klar.) 
2. Man kann jedes g erzeugen unter ausschliefilicher Verwendung solcher G, die 
in den Relationen der Basis £>' von 3) auftreten. (Man ersetze nur je zwei 
Mengen G, H e g mit G G H durch Gi und Hi, wo Gi C Hi zu 3)' gehoren und 



268 


HANS FRETJDENTHAL 


G C Gi C Hi C H gelten m 6 ge.) 3. Die 0*, die in den Relationen von ©' 
auftreten, bilden eine Basis von P*. (Folgt aus 3.4. 1 - 2 .) 

3.5. fi € 0* dawn und nur dann> wenn 0 n G j* o fur alle G € 0 . 

Beweis: Nach 1.2 und 3.4.1 ist 0 eO* Equivalent mit: G* n 0 * o fUr alle 
(? c 0 . Das ist aber nach 3.4.2 Equivalent mit: G n 0 © fur alle G € 0 , w.z.b.w. 

3.6. Aus 6 C P folgt 0* C P*. 

Beweis: Sei 0 € O*. Dann nach 3,5: (? n 0 ^ o fiir alle G c 0 . Also nach 
2.3.3: es existiert ein G 0 € 0 , G 0 C P. Also 0 -< P, 0 e P*, w.z.b.w. 

3.7. (P\ 0 )* = P*\0* 

Beweis: 


ist Equivalent mit 


0 € R*\0* 


0*6 


oder nach 3.5 mit der , 

Existenz eines G e 0 mit G n 0 = o f 
und die ist Equivalent mit der 

Existenz eines G c 0 mit G C R\0, 
und die ist Equivalent mit 

0 c (P\ 0 )*, w.z.b.w. 

3.8. R* ist regular (erfullt also auch T). 

Beweis: Sei O* Umgebung von 0. Nach 3.4 und 3.2.1 ist G C 0 fur ein 
gewisses G e$. Nach 2.3.2 existiert G f e 0 mit G ' C G . Nach 2.1.1 ist G' C G 
und nach 3.6: G'* Cl G*. Also auch G'* C 0* und G'* ist die Umgebung von 0 , 
die die RegularitEt gewEhrleistet. 


4. Die Bikompaktheit von R* 

4.1. R heifit £>-regulEr, wenn zu jedem Punkte a und jeder Umgebung V 
von a eine Umgebung U von a existiert mit U C V. 

R heiCt ©-normal, wenn zu jeder Relation 0 C P eine Relation 0 G Q C P 
existiert. 

R heifit ©-separabel, wenn ®(P) eine abzEhlbare Basis ©'(P) besitzt. 

4.2. Ein X>-regulares R ist auch regular . Ein Q-regulares und Q-separables R 
ist auch separabel und normal . 

Beweis: Erster Teil folgt aus 2 .1.1. —Zweiter Teil: Sei ©'(P) die abzahlbare 
Basis von ©(P). Sei $ die (abzEhlbare) Menge aller P, die in Relationen 
OCP aus ©' auftreten. Sei Q irgendeine offene Menge. Wegen der ©-Regu¬ 
laritEt existiert zu jedem a c Q eine Umgebung U von a, U C Q. Gemafi dem 
Begriff der Basis existiert in ©' eine Relation 0 C P mit U C 0 C P C Q. 
Also existiert sicher ein P e $ mit a c P C(J. Die Vereinigung aller solcher P 



NEUAUFBAU DER ENDENTHEORIE 


269 


(iiber alle a e Q) liefert Q . Also laftt sich jede offene Menge Q als Vereinigung 
von Mengen von $ darstellen. Da $ abzahlbar war, ist R separabel. Aus der 
Separabilitat und Regularitat folgt die Normalitat. 

4.3. g(a) sei die Menge der Umgebungen von a. 

4.4. R sei ©- regular . Dann ist g eine topologische Abbildung von R in R*. 
Dabei ist 0 die Urbildmenge von 0*. 

Beweis: 1. g ist eine Erzeugende: 2.3.1-2 sind evident wegen der T)-Nor- 
malitat. Mogen 0 und P den Voraussetzungen von 2.3.3 geniigen; also OCP 
und 0 n G 5 ^ o fur alle G e g. Ware a 4 P, so ware naeh 2.1.1 auch a 4 0, also 
a e R\0, also R\0 e g (a), aber ( RO ) n 0 = o im Widerspruch zur Voraussetzung 
liber 0. Also notwendig a e P, also P c g(a) f und P ist brauchbar als das in 
2.3.3 verlangte G 0 . 

2. Sei a t* b. Dann gibt es wegen X: U e g{a), V eg(b), U n V = o. Also 
g(a) 7* g(b). Also ist g eineindeutig. 

3. Wir bestimmen die Urbildmenge von 0*, d.h. die Menge aller a mit g(a) e 0 * 
oder g(a) < 0. Das sind aber naeh Definition gerade die a aus 0. Also ist 0 
das Urbild von 0*. Das liefert auch die Stetigkeit von g. Auch die Umkehrung 
von g ist stetig, denn g(0) = 0* n g(R) ist offen in g(R). 

4.5. Wir konnen und wollen R auch als Teilmenge von R* deuten. Wir haben 
dann auch O* n R = 0. 

4.6. In einem 2)-regularen R gilt: (Erganzung zu 3.2.1.) Aus 0* C P* folgt 
0 d P. (Folgt aus 4.5.) 

4.7. R sei X-regular. R* wird zu einem X-Raum durch die Festsetzung: Die 
Rclationen 0* CZ P* (mit 0 C P) bilden eine Basis X'(R*) von R*. 

Beweis: Es gcniigt, zu zeigen, daft fiir X'(R*) die Bedingungen 2.1.1-4 
erftillt sind: 

(2.1.1:) Sei 0 * C P*. Naeh Definition ist dann 0 C P y also naeh 2.1.1 
(fiir X(R)) OaP, also naeh 3.6: 0* a P*, w.z.b.w. 

(2.1.2:) Seien 0* C P*, Ot C P*. Definitionsgemafi ist Oi C P, 0 2 C P } 
also naeh 2.1.2 (fiir Ck u 0 2 C P. Naeh 3.2.3 ist 0? u 0* C (0, u Oi)*, 

also auch G P*, w.z.b.w. 

(2.1.3:) Analog unter Verwendung von 3.2.2. 

(2.1.4:) Sei 0* G P*, also 0 G P. Naeh 2.1.4 (fur £>(#)) ist R\P C R\0 , 
also ( R\P )* C (j R\0 )* und naeh 3.7: R*\P* G R*\0*, w.z.b.w. 

4.8. R sei X-regular. Dann laBt sich R auch hinsichtlich des X-Systems als 
Teilraum von R* auffassen . (Klar.) 

4.9. Satz I: R sei X-regular. Dann ist auch R* X-regular. 1st R oben- 
drein X-separabel, so ist auch R* separabel und X-separabel . R* ist eine Fort - 
setzung von R , und R ist uberall dicht in R *. 

(Folgt aus 3.4.3, 3.8, 4.2 und 4.8.) 

4.10. R sei Xhregular und X-normal. Sei A abgeschlossen in R ♦, 0 4 A. Dann 
gibt es eine Folge U* von Umgebungen von A mit Ui !D U 2 • • • , derart da£ 

6 < Ut _ 

Beweis: Wir wahlen V eg mit V * n A = © und W\ € g mit Wi C V . Auf 



270 


HANS FREUDENTHAL 


grand der £)-Normalitat bestimmen wir induktiv ein Wn+ 1 mit W n C Wn+i C V. 
Wir setzen U n = R\W n . Dann liefert 2.1.4: Ui 3 Ut S • • • 3 R\V. Weiter 
liefert 3.2.1: Ut 3 (R\V)* ID A. Also sind die U n in der Tat Umgebungen 
von A. Wegen g < V ist g 4 U*. Also erftillen die U n alle Wtinsche. 

4.11. Satz II: R sei ^-regular und ID-normal. Dann ist R* bikompakt. Ist 
R obendrein Q-separabel, so ist R* ein Kompaktum. 

Beweis: o sei eine endlich gebundene Menge abgeschlossener Mengen von R*. 
Wir beweisen die Gebundenheit von a. 

Sei A e a. Zu jedem g e A bestimmen wir eine Folge U* A gemaft 4.10. u sei 
die Menge aller dieser offenen Mengen, wo A ganz a und g ganz R\A durchlauft. 
Wir zeigen, daft u die Eigensehaften 2 . 3 . 1-2 eines g besitzt: 

(2.3.1:) U*n' K A ‘ (k = 1 , • • • , k) seien endlich viel Mengen aus u. Es gibt ein 
l) 6 D« A, . Dann auch t)« (U^**)*. Also existiert ein H e () mit H C Uh*' K AK 
(k = 1 , • • • , k). Da H nichtleer ist, ist auch fl« l Ji n', A ' nichtleer. Also ist u 
endlich gebunden. 

(2.3.2:) Sei W * u. Dann ist t/J+i C U* A . Also gilt 2.3.2. 

Wir bilden aus u alle Durchschnitte zu je endlich vielen; so entsteht 0 . Auch b 
erfiillt 2.3.1 (trivial) und 2.3.2 (wegen 2.3 und 2.5 folgt namlich aus 0 K C P K : 
dtiP,). 

Dagegen braucht 2.3.3 nicht zu gelten. Sei nun 0, P ein Paar, das 2.3.3 in 
bezug auf b verletzt. Also jedenfalls fur alle Gtb. Wir 

bestimmen auf grand der ®-Normalitat eine Folge P n : Pi = P, 0 C P„ + i C P» 
und adjungieren die P n zu b. So entsteht ein System it', u' erfiillt 2.3.1 (da b 
es erfiillte) und 2 . 3.2 (wegen der Wahl der P n ). Der wesentliche Unterschied 
zwischen u' und u ist aber der, daft das Paar 0, P in bezug auf u' nicht mehr 
2.3.3 verletzt. 

Unter Verwendung einer Wohlordnung kann man u also zu einer Menge g 
erganzen, die 2.3.1-3 erfiillt. g e R*. g < alle U von u. Alsog e alle U* mit 
V e u. Also g e flu « u U*. Ware g t „ A, so gabe es ein A 0 e fl mit g 4 Ao 
und ein U t u mit g 4 U*, und das lieferte einen Widersprach. Also ist 
g «PU ,« A, und a ist gebunden, w.z.b.w. 

Der Rest des Satzes folgt nun aus Satz I. 

4.12. Satz III: In R* gilt unter den Voraussetzungen von Satz II: Aus 6 C P 
folgt 0 CP. 

Beweis: Zu jedem g e6 gibt es ein H eg, H* C P; femer ein G «g, G C H. 
Endlich viele der zugehorigen G* iiberdecken das bikompakte 0; z.B. 
G* , • • • , G* . Wegen G, C H, und 2 . 1 . 1-2 ist G = U(r« C Ui/, = H. Also 
G C H und 0 C G* C H* C P, also bach 2.5 und 4.7 : 0 C P, w.z.b.w. 

5. Die Endpunkte 

5.1. Wir betrachten nun spezielle Systeme ®. 

Sei $ das System aller offenen Mengen von R mit bikompaktem Rand und & 



NEUAUFBAU DER ENDENTHEORIE 


271 


das aller mit leerem Rand, d.h. das System aller Brocken.—Aus 0 € ft folgt 

ist so definiert: Man setzt 0 C P, wenn 6 CP und OeS, P c S'. 

D3 ist so definiert: Man setzt 0 C 0 fur alle 0 e £. 

Dafi beidemal 2.1.1-4 gelten, ist leicht zu sehen. 

resp. 35* sind die von 2)3 resp. 35* erzeugten Systeme. 

5.2. Set R semibikompakt . Sei 0 * ft, P offen , 0 C P. Darm 0$/ es ein 
Pi e® mild C Pi C Px C P. 

Beweis: Zu jedem Punkt von 3J(0) gibt es eine ft-Umgebung IJ mit U Cl P. 
Endlich viele Ui , • • • , Uk , iiberdecken 91(0). P = Oj u L f i u ••• u U k 
erflillt die Forderung. 

5.3. Set R semibikompakt . 0 C P gemali 2)* dann und nur dann , mn ew 
Oi eft errislierl mit OCOiCOjCP. 

Beweis: JVnr dann: Da 35* Basis von ®* ist, gibt es Oi, Pi eft mit 0 C Oi C 
Pi C P, also auch 0 Cl Oi Cl Oi Cl P. Dann: Nach 5.2 existiert P e ft mit 
OiCPjC P. Also 0 Cl Oi CL Pi d P, also 0 C P nach 2.1.5. 

5.4. Pin semibikompaktes R ist Qt-normaL 

Beweis: Es geniigt fiir 0, P eft aus 0 C P die Existenz einer Relation 
0 C Q C P abzuleiten. Die ergibt sich aber aus 5.2. 

5.5. Ein semibikompaktes R ist auch 35j i-regular. 

Beweis: Sei U Umgebung von a. Es gibt eine Umg ebung V von a, V eK. 
(R\V) Cl R\(a). Also gibt es nach 5.2 ein Pi eft mit (R\V) Cl Pi d R\(a). 
Dann ist W — R\Pi Umgebung von a, W e K und W d U, w.z.b.w. 

5.6. Die Bezeichnung P* reservieren wir nun fur den aus R mittels £>* 
erzeugten Raum (fiir den Fall, dafi R semibikompakt ist). Wir nennen R* 
auch die Abschliefiung von R durch seine Endpunkte . P*\P heifit die Menge 
der Endpunkte. 

Es ist klar, dafi man bei der Erzeugung von R * an dieg die zusatzliche Forde¬ 
rung g C ft stellen darf. Das werden wir meistens auch stillschweigend tun. 

Bei ®3 sind die Voraussetzungen von Satz I—II nicht mehr erfullt (keine 
®-Regularitat!). Trotzdem bleibt vieles erhalten. Wir nennen den aus R 
mittels X's erzeugten Raum Z(P), den Komponentenraum von R. In der Tat 
sind in einfachen Fallen die Komponenten von R Elemente von Z(R ); allerdings 
zeigt das Beispiel der Einleitung, dafi das nicht immer gilt: die Komponenten 
D n bilden cinen Punkt von Z(R). 

Fiir die ®3-Erzeugenden sind die Forderungen 2.3.1-3 besonders leicht zu 
erftillen; sie reduzieren sich darauf, dafi g aus Brocken besteht und dafig endlich 
verbunden und maximal ist. 

5.7. Die Abbildung f(0) ordne jeder Menge den kleinsten sie enthaltenden 
Brocken zu. Fiir g efi* setzen wir f(g) = Menge der f(0) mit 0 e g. Man 
zeigt leicht, dafi R* durch £ stetig auf Z(R) abgebildet wird. 

Hieraus folgt: Under den Voraussetzung von Satz II ist Z(R) bikompakt. 

5.8. SeiPeft. Dann ist 9?*(P*) = SR(^)- 



272 


HANS FREUDENTHAL 


Beweis: Wegen 3.5 ist $K(P) C 9t*(P*) evident. Wir brauchen also nur zu 
einem g 1 9t*(P*) ein o e 3J(P) anzugeben mit g(a) = g. g < P o (R\P), also 
gilt fur alle G eg: 

G $ P u (/e\P) 

oder 

(1) G n $K(P) ^ o. 

Wir zeigen, dafi die Menge aller 

(2) G n 9i(P) mit (?«g 

endlich gebunden ist: nach 2.4 hat man zu G. eg (k = 1, • • • , k) ein G eg mit 

(-i 

also ist 

* _ k 

fl (G, n 9J(P)) - ( n Gk) n «R(P) => G n 91(P) ^ o 

K-l 

wegen (1). Die Menge (2) ist also endlich gebunden, also wegen der Bikom- 
paktheit von 9i(P) gebunden. Sei 

a € fl (G n 5R(P)). 

O < S 

Dann ist erstens a e $R(P) und zweitens a eQ fur alle G eg, also auch a e G fur 
alle G eg, alsoQ = g(a), w.z.b.w. 

5.9. Set R semibikompakt . Dann ist R*\R nultdimensional. 

Beweis: Sei g eR*\R und 0 * eine Umgebung von g. Es gibt ein P e$, 
Peg, P* CZ 0*. Anwendung von 5.8 zeigt: In R*\R besitzt die Umgebung 
P* n (R*\R) von g keinen Randpunkt, ist also ein in 0* enthaltener Brocken 
rel. R*\R. Also ist R*\R nulldimensional. 

5.10. Satz IV: Sei R semibikompakt Dann ist R*, die Abschliething von R 
durch seine Endpunkte , bikompakt , ebenso Z(R) der Komponentenraum von R. 
R ist iiberall dicht in P*. Die Menge der Endpunkte R*\R ist nulldimensional. 
Z(fi) ist nulldimensional. 

Beweis: Die Bikompaktheit von R* folgt aus Satz II (wegen 5.4-5 sind die 
Voraussetzungen erfullt). Die von Z(R) folgt aus 5.7. Die Dimension von 
R*\R folgt aus 5.9; die von Z{R) ergibt sich trivialerweise. 

6. Die Kompaktheit von R* 

Wir setzen von R nun immer das zweite Abzahlbarkeitsaxiom und Semikom- 
paktheit voraus. 

6.1. Satz V: Z{R) ist dann und nur dann ein Kompaktum , wenn in R jede 
absteigende Folge nichtleerer Brocken einen nichtleeren Durchschnitt hat 
Beweis: Dann : Wegen der Separabilit&t existiert eine abzahlbare Basis 



NEUAUFBAU DER ENDENTHEORIE 


273 


U C 3 von Z (der Menge der Brocken). Aus dem Mengen von U bilde man 
die Vereinigungen zu je endlich vielen; so entsteht das System Q. Sei 0 ein 
Brocken. 0 = U JLi U v mit gewissen U v cU. Die Mengen 0\|jJLi U v bilden 
eine absteigende Folgen von Brocken mit leerem Durchschnitt. Nach Voraus- 
setzung ist eine von ihnen leer, also 0 die Vereinigung endlich vieler U v , also 
0 e Cl. Also besitzt eine abzahlbare Basis, die besteht aus alien Relationen 
Q C Q mit Q e O. Also ist R ©^-separabel, also ist Z(R) nach 3.4.3 separabel 
und nach Satz IV demnach ein Kompaktum. 

Nur dann : Gabe es eine absteigende Folge Mi DM 2 3 • • • nichtleerer 
Brocken mit leerem Durchschnitt in R , so waren N v = M\M V +1 zueinander 
fremde Brocken, und ihre Vereinigung ware der Brocken M . Sei <f> eine Menge 
von nattirlichen Zahlen und \j/ ihre Komplementarmenge. Dann sind die 
Mengen 

U N, und U N v . 

V € <f> 9 € f 

Komplemente voneinander und als Vereinigungen von offenen Mengen offen, 
also sind beide Brocken. Also ist 

(u N,y 

9 «4 

fur jedes <f> ein Brocken in Z(R ); es gibt also in Z(R) Kontinuum viel Brocken, 
und das widerspricht nach 1.9 der Separabilitat von Z{R). 

6.2. Sei A abgeschlossen in R und 9i(A) kompakL Sei Z(R) ein Kompaktum. 
Dann ist auch Z{A) ein Kompaktum. 

Beweis: Wir wenden 6.1 an: Sei Mi 3 Af 2 3 • • • eine Folge von nichtleeren 
Brocken rel. A. Wir zeigen, dafi ihr Durchschnitt nichtleer ist: Sind alle 
M v n di(A) 7 * o, so auch ihr Durchschnitt wegen der Kompaktheit von 
und dann sind wir fertig. Sei also Mk n 5K(A) = o. Sei nun n ^ k . M n 
ist offen in A , also nun auch offen in also auch offen in R. Andererseits 

war M n abgeschlossen in A und A abgeschlossen in R, also auch M n abgeschlossen 
in R. Die M n (n ^ k) sind also auch Brocken in R , und nach Voraussetzung 
und 6.1 ist ihr Durchschnitt nichtleer, w.z.b.w. 

6.3. Satz VI: R sei separabel und semikompakt. Dann und nur dann ist die 
Abschlielh/ng R* von R durch seine Endpunkte ein Kompaktum , wenn der Kom - 
ponentenraum Z(R) ein Kompaktum ist. 

Nur dann folgt aus 5.7, da Z{R) als stetiges Bild von R* notwendig auch ein 
Kompaktum sein mufi. 

Um dann zu beweisen, werden wir fur eine abzahlbare Basis 2)" kon- 
struieren. Nach Satz I wird R * dann in der Tat separabel. Unsere Aufgabe 
ist also die folgende: Wir konstruieren ein abzdhlbares System®' CZ® derart , daJ3 
zuje zwei Mengen 0, P e® mit 0 C P eine Menge Q e®' existiert mit 0 CQ CP. 
Gelingt das, so sind wir in der Tat fertig; denn $>* war eine Basis von $>*, 
und hat man nun eine Relation 0 C P aus ©* , so ist definitionsgem&fi 0, 
Pe®; man kann also (die Losung der “Aufgabe” vorausgesetzt) Oi, Pie®' 



274 


HANS FREUDENTHAL 


konstruieren mit 0 C 0i C Pi C P. Demnach bilden die 0\ C P x mit 0, 
P e Si' eine abzahlbare Basis 35" von 35* . 

Die Losung der Aufgabe geschieht in 6.4-5. Es ist bemerkenswert, dafl die 
Kompaktheit des Komponentenraumes Z(R) eine wesentliche Bedingung fur die 
Losbarkeit ist. 

6.4.1. 0 \, O 2 , • • • mogen eine Basis von R bilden, O n c $. 

6.4.2. ST n sei die Menge aller und R\0, mit v ^ n. 

6.4.3. 93» sei die Menge aller Durchschnitte, gebildet aus Mengen von 2l n . 

6.4.4. S n sei die Menge aller Brocken aller B e 93 n . Wegen 6.2 und der 
Voraussetzung liber Z(R) ist auch Z(B) kompakt, also ist die Menge der Brocken 
jedes B nach 1.9 abzahlbar, also ist abzahlbar. 

6.4.5. $ n bestehe aus alien Vereinigungen von je endlich viel Mengen aus £» . 

6.4.6. $ n bestehe aus den offenen Kernen aller Mengen von $ n . 

6.4.7. St' ist die Vereinigung aller Si' n . 

Man sieht ohneweiteres, daf3 St' C $1 ist. 

6.5. Seien 0, Pe$. Gemafl der “Aufgabe” zeigen wir die Existenz eines 
Q eSP mit 0 C Q C P. 

6.5.1. Wegen der Kompaktheit von SR(O) gibt es unter den Oi, 0 2 , • • * der 
Basis endlich viele, 0 Vt , • • • , 0 Vt , die 5R(0) so iiberdecken, dal3 0 Vt <Z P. Wir 
setzen r = max (vi , • • • , v t ). 

6.5.2. B' r sei die Gesamtheit der B e 93 r mit B n 0 5 ^ o. 93r ist eine t)ber- 
deckung von 0 . 

6.5.3. Wir zerlegen 93r in $ 8 ” und 93r”, derart dafi B e 93” dann und nur dann, 
wenn B n 9 ?( 0 ) ^ o. 

6.5.4. Ist B e 53”, so ist einerseits wegen (6.4.2-3) B flir jedes v ^ r Teilmenge 

eines A e% y also Teilmenge von 0 VT oder R\0 Vt ftlr jedes r. Andererseits ist 
B n 91(0) © und wegen 6.5.1 auch B n 0 Vr o fur ein gewisses r. Zusam- 

men ergibt das: BCO, r C P. Also: alle B e 93” liegen in P . 

6.5.5. Sei nun Be 93r”. Dann ist B n $R( 0 ) = o. Also ist B n 0 eine 
Brocken in B , also B n 0 e £ r . Wir setzen 

T = U B u U (B n 0). 

Dann ist T e $ r (siehe 6.4.5 ); T CZ P (siehe 6.5.4) ;OCT und zwar ist wegen 
6.5.1 auch noch jeder Randpunkt von 0 innerer Punkt von T. 

6.5.6. Q sei der offene Kern von T . Dann ist nach 6.4.6: Qe®' n Cl St'. 
Femer nach 6.5.5: 0 C Q G P. 

Also lost$' in der Tat die Aufgabe. 

7. Der Charakterisierungssatz 

7.1. S sei ein kompaktum. R geniige den Bedingungen von Satz VI , R sei 
uberall dicht in S, S\R sei nulldimensional. Dann existiert eine stetige Abbildung 
f(R*) = S, die in R die Identitdt ist. 

Beweis: Die in S genommene abgeschlossene Htille einer Menge werde mit 
dem Exponenten S angedeutet. 



NEUAUFBAU DER ENDENTHEORIE 


275 


Sei g « R*. Die Menge aller G 8 mit G € g heifie g 5 . Da g s endlich verbunden 
ist, ist sein Durchschnitt nichtleer; er moge den Punkt a e S enthalten. 

V und W seien Umgebungen von a in S; V C W; 9i(F) C R; SR(TF) C R 
(wegen der Nulldimensionalitat von S\R kann man beliebig kleine derartige 
Umgebungen finden). Fiir G e g ist a e G s , also G 8 n V ^ ©, also, da V offen 
ist, G n V o. Nach 2.3.3 existiert G 0 t g mit G 0 C W. Dann ist auch 
Go C W s . Der Durchschnitt aller Mengen von g s ist demnach in jeder Umge- 
bung W von a enthalten und besteht daher nur aus dem Punkt a, den wir auch 
/(g) nennen. /bildet nach dem Vorigen jedesg ■< G 0 ab in W s , also /(Go) C W s 
(wobei Go von W abhangt und W beliebig klein genommen werden kann). Also 
ist die Abbildung/ stetig. f(p) = p fur p e R ist evident. 

7.2. Die Eigenschaft 0 ' lautet: Ist U eine Umgebung von a e S\R in S , so 
ist es unmoglich, U n R in zwei fremde, in R offene Mengen zu zerlegen, die 
beide (in S ) a als Haufungspunkt besitzen. 

7.3. Sei auBer den Bedingungen von 7.1 noch 0 ' erfilllt . Dann ist die Abbildung 
f aus 7.1 topologisch . 

Beweis: Wir brauchen nur die Eineindeutigkeit zu zeigen. Sei a e S\R und 
A das/-Urbild von a. A ist abgeschlossen und nulldimensional. Moge A aus 
mehr als einem Punkt bestehen (wir fuhren diese Annahme zum Widerspruch). 

Ist die Umgebung (in S ) V von a klein genug, so zerfallt/ -1 ^) in zwei fremde 
offene Mengen C und D. Ist V' eine Umgebung von a mit V 7 d F, so zerfallt 
r\V f ) in zwei fremde offene Mengen, C' und D\ deren abgeschlossene Hiillen 
auch noch fremd zueinander sind, und die, beide, Punkte von A enthalten. Die 
Zerlegung C' n R, D f n R von V f n R widerspricht 0 ', w.z.b.w. 

7.4. S = R* besitzt die Eigenschaft 0 \ 

Beweis: Sei Q eine Umgebung von g in R*, die gegen 0 ' verstofie und 0, P 
eine gegen 0 ' verstoCende Zerlegung von Q n R. Sei Q' eine Umgebung von g 

mit Q' CQ. Dann wird Q f n R in 0' == Q f n 0 und P' = Q f n P so zerlegt, 

0' COjF'CP abgeschlossene Hiille gebildet in R). g ist Haufungspunkt von 
O' und P' in R*. Also gilt fiir jedes G«g:(? n 0' o, G n P' ^ o. Nach 

2.3.3 gibt es Gi und G 2 in g mit Gi C 0, G 2 C P. Aus 0 n P = o folgt 

Gi n G 2 = o im Widerspruch zu 2.3.1. Es gibt also keinen Verstofi gegen 0 '. 

7.5. S sei ein kompaktum. R sei uberalldicht in S , und S\R sei nulldimen¬ 
sional und es gelte 0 f . Dann ist R Semikompaktum und Z(R) Kompaktum . 

Beweis: Sei aeR. Die Vereinigung von a und S\R ist nulldimensional. 7 
Es gibt also beliebig kleine Umgebungen von a mit zu S\R fremder, also in R 
kompakter Berandung. 

Zum Beweise der Kompaktheit von Z(R ) verwenden wir das Kriterium 6.1. 
Sei Mi Z D M 2 ZD • • • eine Folge von verschiedenen Brocken aus R. Wir wahlen 
a n e M n \M n +i • Wir ziehen aus a n eine in S konvergente Teilfolge o, n mit dem 
Limes a. Wir haben 

a Pn € M Vn \M, H+l C M Pn \Mp n+1 . 

7 Siehe z. B. K. Monger, Dimen&ionstheorie y Leipzig 1928, S. 115. 



276 


HANS FREUDENTHAL 


Wir setzen a = a n , M Pn = M' n . Statt fl M n ^ © konnen wir auch fl M' n ^ o 
beweisen. W&re das falsch so konnte a nicht in R sein. a ware dann nach 5.8 
innerer Punkt von M$ und U = Mt ware Umgebung von a. Dann lieferten 

V = U (M 2n \M 2n+ 1), W= U (M 2 n-i\M 2n ) 

n—1 n—1 

eine /S' widersprechende Zerlegung von U n R. 

7.6. Satz VII: £ set etn Kompaktum. R sei uberalldicht in S. Dann und nur 
dann ist S wesentlich die Kompaktifizierung von R durch seine Endpunkte , wenn 
gilt : 

a') /S\i? isl nulldimensional . 

/S') Die Umgebungen eines Punktes p e werden durch S\R nicht zerlegt in 
zwei in R offene Mengen , die beide m >S den Haufungspunkt p besitzen . 

Beweis: Dann folgt aus 7.3, wobei 7.5 rechtfertigt, dafl wir unter den 
Voraussetzungen liber R die Semikompaktheit von R und die Kompaktheit von 
Z(R) weggelassen haben. Nur dann folgt aus Satz VI und aus 7.4. 

8. Die Endpunkte von Gruppen 

8.1. R genlige im Folgenden dem zweiten Abzahlbarkeits axiom, sei semi- 
kompakt, zusammenhangend und eine topologische Gruppe (die letzte Forderung 
besagt, daft R eine Gruppe sei, in der a-b eine stetige Funktion von a und b sei 
und aT l eine stetige Funktion von a) ; die Identitat heifte e. 

8.2. Die Links- (oder Rechts-) Multiplikationen mit einem festen Element a 
sind topologische Abbildungen von R auf sich; nach Satz VII lassen sie sich 
topologisch bis in die Endpunkte festsetzen. a% ist also sinnvoll als lim ac v , 
wenn lim c„ = q; und zwar ist ag wieder ein Endpunkt. 

8.3. Seien O eS, P e$, 6 C P. Da 9?(0) kompakt und in P ist, so gibt es 
eine Umgebung U von e mit 

(1) 5R(aO) = aSR(O) C P fur alle a e U. 

Sei 

N - aO n (R\P ); 

9J(AT) C $R(aO) u W(R\P), 

also wegen (1) 

(2) $R(A r ) C S»(P). 

Seien 

(3) N, = ajO n (P\P), 

(4) lim a p = e. 

Waren alle N p nichtleer, so waren wegen des zusammenhangs von R auch alle 
5R(V„) wichtleer und es gabe 

(5) c p € W(N P ) C SR(P) (nach (2)). 



NEUATJFBAU DEE ENDENTHE0R1E 


277 


Dann nach (3) c v e a v O y also 

(6) c v = a v b, mit b„ e 0. 

Die c v besitzen nach (5) einen Haufungspunkt c in 9f(P), also ist wegen (4) c 
auch Haufungspunkt der b v und nach (6) in 0, und das widerspricht dcr Voraus- 
setzung 0 Cl P. 

Also ist die Annahme falsch, und es gibt in jeder Folge a v mit lim a v = e 
unendlich viel v mit leerem N v , also mit a v 0 Cl P. Also gibt es auch eine 
Umgebung V von e mit 

aO CT P fur alle a € V. 

Sei der Endpunkt g definiert durch die Folge G„ mit G n +1 C Z G n . 8 Dann gibt 
es nach dem Vorangehendcn Umgebungen V n von e mit aG n +i Cl G n fur alle 
a eV n . Es ist also ag < G n fiir a e V n • 

Sei a n eine e-Folge. Zu jedem n gibt cs ein k n mit a v e V n fiir v ^ k n . Also 
civg < G n fiir v ^ k n . Hier aus folgt 

lim = (lim a„)g 

(erst fiir e-Folgen und dann allgemein fiir konvergente Folgen). 

8.4. Die Rechtsmultiplikation mit g bildet also R stetig ab in R*\R. Da R 
zusammenhangend, R*\R aber nulldimensional ist, muC die Bildmenge ein 
Punkt sein, und da eg = g ist dieser Punkt g. Also 

ofl = 0- 

8.5. Daher ist fiir lim c v — g und G c g auch ac v tG fiir fast alle v . Dann gilt 
aber fiir jedes kompakte M C R 

auch Me C G fiir fast alle v . 

8.6. Wir fassen die Ergebnisse zusammen in 

Satz VIII: Die Links- ( oder Rechts-) Multiplikationen mit Elementen von R 
sind bis in die Endpunkte hinein topologische Abbildungen. Die Endpunkte sind 
Fixpunkte dieser Abbildungen. Jede kompakte Menge aus R kann durch Rechts- 
(und Links-) Multiplikation in jede Umgebung jedes Endpunktes gezugen werden. 

Wir beweisen nun 

Satz IX: Eine zusammenhangende , semikompakte , dem zweiten Abzdhlbar- 
keitsaxiom genugende Gruppe besitzt hochstens zwei Endpunkte, ist also im Kleinen 
kompakt. 

8.7. Beweis: Seien f, g, f) drei verschiedene Endpunkte (wir fuhren diese 
Annahme zum Widerspruch). 

Wir nehmen F 6 f, G e g, G' e g, H € f), so daft F, (?, H paarweise fremd sind 
und G* CZ G ist. Nach Satz V existiert c mit 

9i(Fc) = $R(F)c C G'. 

8 Wir nehmen im Folgenden die einen Endpunkt erzeugenden offenenen Mengen immer 
als zugehdrig an. 



278 


HANS FBEUDENTHAL 


Setzen wir 

.N « Fen (. R\G'), 

so ist (wie in 8.3(2)) 

dt(N) C 9i(G'), 

also ist N ein Brocken in R\G'; f < N,§ < N wegen f < Fc, f < R\G , i) < Fc 
(Invarianz von f und bei Multiplikation mit c gemaft Satz VIII). Bildet 
man 0 = N n ( R\G ), so sieht man: Es gibt einen Brocken O in R\G mit f < 0, 
t) < 0. Analog findet man einen Brocken P in R\G mit f) < P, f < P. Man 
darf 0 n P = o annehmen (evtl. lasse man aus beiden 0 n P weg). 

Der Durchschnitt aller Brocken von R\G , die f bzw. 1) enthalten, heifte A 0 
bzw. B 0 . Da R zusammenhangend ist, ist Z{R) kompakt, also auch Z(R\G) 
kompakt (siehe 6.2), also wegen 6.1 

A 0 7* B 0 7^ o, 

ferner 

(1) Aq n Bq = o. 

(2) Fur G'CG ist A 0 > 3 A G , B G > 3 B 0 . 

Sei (?i 3 <?2 3 • • • cine g definierende Folge; wir setzen 

A = U A 0n , P = U P 0n . 

n—1 n—1 

Wegen (1) und (2) ist 

(3) A n B — o. 

Aus (2) schlieftt man ohne weiteres, daft A und B unabhangig von der Wahl 
der Folge G n eg sind, und hieraus folgt 



cA 0 — Aco , 

cB a = B, 

also 




II 

II 

*9 

Das ergibt aber fur 




c = 

ba 1 


einen Widerspruch zu (3), w.z.b.w. 

8.8. Satz X: Besitzt eine zusammenhangende, semikompakte, derrt zweiten Ab- 
zahlbarkeitsaxiom geniigende Gruppe zwei Endpunkte , so sind die Endpunkte 
zueinander invers (d.h. ist lim c P ein Endpunkt , so ist lim cj l der andere. Jede 
abgeschlossene Untergruppe y die nicht kompakt ist , besitzt dann auch zwei End¬ 
punkte . 



NEUAUFBAU DER ENDENTHEORIE 


279 


Beweis: f und g mogen die Endpunkte sein. g werde durch die Folge 
G\ 3 (7 2 3 • • • definiert, f < Ci. A a werde wie oben (8.7) als Durchschnitt 
der f enthaltenden Brocken definiert. A = Un-i A 0n ist wieder von der Wahl 
der G n unabhangig, also cA A ftir alle c, also A = R, also e € A. 

Wir wahlen k so, daft 

(1) eeA 0k 
ist. Dann ist 

(2) e e R\G k . 

Nach Satz V gibt es zu dem kompakten 9t((ji) ein c n * G n mit 9 l(c n Gi ) = 
c n 9l((ri) CL Gk . Dann ist der Rand von 

(3) N n = c n G x n (R\G k ) 

ganz in 9? (G k ) f also ist N n ein Brocken in R\G k ; wegen f < Gi und der In- 
varianz der Endpunkte ist / < c n Gi , < N n , also mufi N n fremd zu A Qk sein, 
also wegen (1) 

eiN n . 

Hieraus zusammen mit (2) und (3) folgt 

e 4 CnGi 

oder 

(4) c- 1 i (h . 

Der Tlbergang zur Inversen ist einc topologische Selbstabbildung von R, die 
sich nach Satz VIII topologisch bis in die Endpunkte fortsetzen lafit. lim c n 
existiert also und ist ein Endpunkt,und nach (4) kann das nicht der Endpunkt 
g sein. Hieraus folgt dieselbe Aussage fur jede Folge c n mit lim c n = g y also die 
erste Halfte des Satzes. 

Ist Q eine abgeschlossene Untergruppe und erzeugen die Folgen G n und Gn 1 
die Endpunkte von R, so sind die Folgen Q n G n und Q n G~ l entweder beide 
leer (und dann ist Q kompakt) oder beide nichtleer (und dann besitzt Q ebenso 
wie R zwei Endpunkte), und damit ist die zweite Halfte des Satzes bewiesen. 


Amsterdam. 



Annals or Mathematics 
Vol. 43, No. 3, April, 1942 


ON THE CONTINUATION OF A RIEMANN SURFACE 

By Maurice H. Heins 
(Received November 14, 1941) 

1. Introduction. 

Let F denote a Riemann surface in the sense of Weyl-Rad6 [15, 17], If there 
exists another Riemann surface G such that F admits a (1, 1) directly conformal 
map onto a proper part, F\ of G , then F is said to be continuable and G is said 
to be a continuation of F; otherwise, F is said to be non-continuable or maximal . 
The closed Riemann surfaces are maximal. Rad6 [14] has shown by example 
that there exist open maximal Riemann surfaces. We remark that every open 
topological surface admits a topological continuation. Later Bochner [1] estab¬ 
lished by appeal to the well-ordering hypothesis and transfinite induction that 
every continuable Riemann surface admits a maximal continuation. Shortly 
thereafter, the question of characterizing the continuable Riemann surfaces was 
considered by de Possel [10, 11], first, in two notes which are fragmentary in 
character and do not contain any indication of proofs, and later, in his thesis 
[12], where he gives a characterization of continuable Riemann surfaces in terms 
of “sets of maximal type” [12, p. 4] and the topological structure of the surface. 

We shall consider the family <£ consisting of Riemann surfaces, G , which are 
continuations of a given continuable Riemann surface, F , and of F itself. Let 
it be assumed that F does not admit the Riemann sphere or a closed Riemann sur¬ 
face of genus one as a continuation save when the contrary is mentioned. Under 
these hypotheses, it will be shown that an explicitly given subset, <f> 0 , of may 
be defined in a natural manner to be an jt-space in the sense of Fr6chct [3] and 
that so defined <t> 0 is compact. With the aid of this result, the theorem of Boch¬ 
ner may be established without appeal to transfinite induction. Problems con¬ 
cerning the exhibition of a maximal continuation of a given continuable Riemann 
surface, and the existence of a maximal continuation with specified properties 
to be stated in the course of the present paper find an appropriate setting in 
the study of the structure of 4> and 4> 0 . These questions have not been treated 
hitherto. 


2. Riemann surfaces, Fuchsian and Fuchsoid groups 

We recall that a Riemann surface in the sense of Weyl-Rad6 may be defined 
as a 2-dimensional manifold F such that to each neighborhood U{w 0 ) of a point 
Wo on the manifold there are associated biuniform and bicontinuous transforma¬ 
tions t V ( W q ) of U onto simply-connected regions of the complex plane with the 
property that, if Ui and f/ 2 are neighborhoods of points wi and w 2 of F respec¬ 
tively, and if they have a non-vacuous intersection, then r Vt Tv\ defines a (1, 1) 
directly conformal transformation of tu^Ui-Ui) onto rv^UvU*) [8, 15, 17]. 

By virtue of the restriction which we have placed on F —that it admit neither 

280 



ON CONTINUATION OP A RIEMANN SURFACE 


281 


the Riemann sphere nor a closed Riemann surface of genus one as a continuation 
—F is not simply-connected. P, the universal covering surface of F, may be de¬ 
fined by its covering relation to F as a Riemann surface. When so defined, F 
is conformally equivalent to the interior of the unit circle in the complex plane. 
Thus, being given any point w Q of F , we may assert the existence of a single¬ 
valued map 

(2.1) w = w(z, Wo) 

of | z | < 1 onto F with the following properties: 

1° w(0, Wo) = Wo ; 

2° to each f with | f | < 1 and £/(w) where w = w(f, w 0 ) there corresponds a 
neighborhood of f, N(f), such that r u ^){w{z 1 w 0 )] is analytic and univalent in 

my 

3° every point w of F has an antecedent with respect to (2.1) in | z | < 1; 
4° any local determination of the inverse of w(z, w 0 ), z(w), in C r (co) for which 
f = z(co, Wo ) can be continued along any analytic arc lying in F whose initial 
point is o). 

Since F is not simply-connected, w(z, w 0 ) is not univalent and hence by 2°, 
3°, and 4° it is automorphic with respect to a group ©(F) of linear fractional 
transformations z | Tz which map | z | < 1 onto itself. By 2° the transforma¬ 
tions T are either hyperbolic or parabolic, but never elliptic. To avoid circum¬ 
locutions, we recall that a group @ consisting of linear fractional transforma¬ 
tions mapping | z | < 1 onto itself, which is properly discontinuous for | z | < 1, 
is called Fuchsian if it has a finite number of generators, otherwise Fuchsoid [2]. 
Since we shall consider exclusively groups which have no elliptic transforma¬ 
tions, we shall employ the unqualified terms “Fuchsian group” and “Fuchsoid 
group”, understanding that the groups considered are free from elliptic trans¬ 
formations. 

The group 63(F) being Fuchsian or Fuchsoid, it follows that to any point 
Zo of | z | <1, there corresponds a neighborhood N(z 0 ) such that every simply- 
connected region g containing z 0 and contained in N(z 0 ) satisfies 

(2.2) T#- T ig = 0 (T k , T t € 65(F), T k * Ti) 

We now consider the space F* formed by identifying the points of | z | < 1 
which are equivalent with respect to 65(F) [1G, p. 31]. Neighborhoods U*(wt) 
of a point wt of F* are defined to be those sets of points of F* which correspond 
to regions of the type g under the identification, where g is to contain a point 
Zo of | z | < 1 in correspondence with wt . It is readily seen that with this 
definition F* is a surface. The maps which associate U*(wt) with Tg (T e ©(F)) 
under the identification define F* as a Riemann surface. The Riemann sur¬ 
faces F and F* are in (1, 1) correspondence and are conformally equivalent . We 
shall denote F* by (| z | < 1) (mod ©(F)), and, in general, by E (mod ®) we 
shall denote the space formed by identifying points of a set E of the extended 
2 -plane which are equivalent with respect to a Fuchsian or Fuchsoid group ©. 



282 


MAURICE H. HEINS 


Suppose now that we have another map of the type (2.1) of | z | < 1 onto F. 
This second map is automorphic with respect to a Fuchsian or Fuchsoid group 
©'(F). There exists a linear fractional transformation S mapping | z | < 1 onto 
itself such that 

(2.3) @'(F) = S&(F)ST\ 

Conversely, if S is an arbitrary linear fractional transformation mapping | z | < 1 
onto itself, then the Riemann surface (| z | < 1) (mod S&(F)S~ 1 ) is conformally 
equivalent to F. Thus there is associated with F a class of Fuchsian or Fuchsoid 
groups , each group being obtained from another by taking the transform with re - 
sped to an appropriately chosen linear fractional transformation S mapping | z | < 1 
onto itself ’, such that when we identify | z | < 1 with respect to any one of these groups 
in the manner indicated above , we obtain a Riemann surface conformally equivalent 
to F. 

On the other hand, if we start with a class of groups, consisting of a given 
non-trivial Fuchsian or Fuchsoid group ©, and its transforms, S®S~\ where S 
is an arbitrary linear fractional transformation mapping | z | < 1 onto itself, 
there is associated with this class of groups by the above identification process,, 
a class of conformally equivalent Riemann surfaces, (| z | < l)(mod S&S” 1 ). 
These remarks will be significant in the present discussion. 

It may be shown [2, Chap. Ill] for a Fuchsian or Fuchsoid group ©, which 
is not cyclic, that the limit points of the set { Tz 0 }, where z 0 is any point of | z | < 1 
and T € ©, consist of either | z | = 1 or else of a perfect totally disconnected set 
of points on | z | = 1. In the first case ® is said to be of the first kind , and in 
the second case of the second kind. 

3. The families $ and $ 0 

Let F be a given continuable Riemann surface satisfying the requirement 
imposed in §1 and let Wo denote a given point of F. By hypothesis there exists 
a Riemann surface G and a (1, 1) directly conformal map of F onto a proper 
subset of G. Let W = W(w) denote this correspondence and let W 0 denote 
W(wo). Now let w(Zj w 0 ) and W(Z, Wo) define uniformization maps of | z \ < 1 
and | Z | < 1 onto F and G respectively of the type (2.1) such that w( 0, wo) = w 0 
and W{ 0, Wo) = Wo . The function Z = f(z) which is uniquely defined by the 
requirements 

(3.1) m = 0, W[f(z) t Wo] « W[w(z, Wo)] 

has the following properties: 

Ai) Z = f(z) is single-valued , analytic and of modulus less than unity for | z | < 1. 
A 2 ) If T€@(F), thmf(T) = U T [f(z)] where U T *®(G). 

A*) If f(zi) is equivalent to f{z$) with respect to ©((?), then Z\ is equivalent to z% 
with respect to ©(F). 

We remark that we may fix w(z , wo) (which depends upon a single real parameter 



ON CONTINUATION OF A RIEMANN SURFACE 


283 


0 ) once and for all and hence ©(F). This choice of w(z, w 0 ) having been made, 
we may choose W(Z 9 Wo) so that 
AO /'( 0 ) > 0 . 

Observe that f(z) defines a* homomorphism or an isomorphism of ©(F) into 
©(G). Two cases are to be distinguished: either the image of ©(F) under this 
homomorphism is precisely ©(G), or else the image of ©(F) is a proper part of 
©(G). This latter situation is illustrated by the example of a Fuchsian group 
© of the second kind. The group © is properly discontinuous in the extended 
z-plane save at a perfect totally disconnected set of points E on | z | = 1. If Fi 
is the Riemann surface (| z | < 1) (mod ©) and F 2 is the Riemann surface 

[extended z-plane deleted in E] (mod ©) 

the latter is a continuation of the former. Upon examining the structure of the 
homomorphism of ©(Fi) into ®(F 2 ) defined by f(z) of (3.1), we find that the 
image of @(Fi) is a proper subgroup of ®(F 2 ). 

On the other hand, let there be given two groups ©: { T\ and r: { S] , Fuchsian 
or Fuchsoid, and let there exist a function Z = j(z) which is analytic and of 
modulus less than unity, and which satisfies in addition to the condition/(0) = 0 
the conditions A x -A 4 , where © replaces ©(F) and Y replaces ©(G). The Rie¬ 
mann surface G = (| Z | < 1) (mod T) is a continuation of the Riemann surface 
F = (| z | < 1) (mod ©). This follows from A 2 , A 3 , and the analyticity of 
/(«). 

It is to be observed that only those elements S of r occur in the homomorphic 
image of © defined by f(z) for which £0 is the image of some point of | z | < 1 
under the map Z = f{z). Hence the strict homomorphic image of ©, To, 
defined by f{z) also yields a Riemann surface G 0 which is a continuation of F, 
for/(z), ©, r 0 satisfy the conditions A x -A 4 with T 0 replacing T . We shall de¬ 
note the class consisting of F and all its continuations by <£, and by <3>o that 
subclass of consisting of F and all its continuations {G\} for which some f(z) 
of (3.1) defines a strict homomorphic map of ©(F) onto ®(Gx). 

It is apparent from the conditions Ai~A 4 that, if F admits a continuation 
G € <!>, it always admits a continuation Go e <£ 0 . 

4. The family <i>o as an if-space [3] 

Let there be given a sequence of elements }G*) (fc = 1,2, • • • ) and an element 
Go of #o • The sequence {G*} will be said to have the limit Go , denoted by £*-00 G* , 
if to each G* there corresponds a function fk (z) satisfying the conditions A1-A4 
with ©(G) replaced by ®(G*), and if to Go there corresponds a function /o(z) 
satisfying A1-A4 with ©(G) replaced by ®(G 0 ) such that 

(4.1) lim f k (z) = fo{z) 

for | z | < 1, and each /* and /o define strict homomorphic mappings of ©(F) 
onto ®(G*) and &{Go) respectively. With this definition of a limit, <$o becomes 



284 


MAURICE H. HEINS 


an £-space in the sense of Fr6chet. Our first object will be to show that $o is 
compact . That is, given any sequence of elements {(?*} of # 0 , there exists a 
subsequence {G n (*>} and an element G 0 of 4> 0 such that 

(4.2) £ <?„(*> = Go. 

fc —*00 

As a preliminary step in the proof of the compactness of 4> 0 we establish 

(4.3) m = gib. /'(0) > 0, 

where f(z ) is taken over the class of functions which satisfy the conditions 
A 1 -A 4 for some ©(G), G c 4> 0 (or 4>). If the statement (4.3) were not true, there 
would exist a sequence of functions {/ n (2)} (n = 1, 2, • • • ) such that f n (z) satis¬ 
fies A 1 -A 4 for ©(G n ) and limn-*/I (0) = 0. We denote f' n (0) by a n , and con¬ 
sider in place of the functions / n (z), the functions 

(4.4) <Pn(z) = f n (z)/a n (#l = 1, 2, • • * ). 

From A1--A4 we infer that the function ip n (z) satisfies the following conditions: 
Bi) <Pn(z) is analytic for | z | < 1, <p' n (0) = 1. 

B 2 ) If f„(T) = U ( r n) [f n (z)}, where T e ©(F), f/ ( r B> e @((?„), then <p n (T) = 
n n Wz)], where Vr n) — <r» being the linear transformation z | a n z. 

B s ) If <pn(zt) is equivalent to <p„(z 2 ) with respect to , then Z\ is 

equivalent to Z 2 with respect to ©(F). 

The proper discontinuity of ©(F) coupled with condition B 3 implies that to 
each point zo of | z | < 1 there corresponds a neighborhood N(z 0 ) such that 
every member of the sequence {<p n (z)\ is univalent in N(z 0 ). In particular, 
since <p n (0) = 0 and ^>1(0) = 1, there exists a subsequence, {^ n (m)(z)}, of {<p n (z)} 
which converges uniformly in N(0) and has as its limit function a function 
univalent in N( 0). However, since <p n (0) = 0 (n = 1, 2, ••• ) and v n (z) is 
univalent in N(z 0 ) for each zo in | z | < 1, the family \<p n {z)) is normal for 
| z | < 1 [7, p. 34] and hence by Stieltjes' theorem [7, p. 28], the sequence 
|^n(m) ( 2 )} converges continuously in the sense of Carath6odory for | z | < 1. 
Let (po(z ) denote limm^ Wm)(z). We note ^o(0) = 0, *?o(0) = 1. The linear 
fractional transformation V n T {m) converges to the linear fractional transformation 
Vt y which, a priori , is either the identity or else a translation ofthe/J-plane 
preserving Z = 00 . The condition B 2 yields 

(4.5) <po{T) = VrW*)]; 

and the condition B 8 and the fact that <p 0 (0) = 0, y>J(0) = 1 imply that, if <po(zi) 
is equivalent to wfa) with respect to the group { Vt] , then z\ is equivalent to z 2 with 
respect to ©(F). To see this, we note that, as a consequence of ^>o(0) = 0 and 
?o(0) = 1, <po(z) is univalent in every N(zo) where each member of the sequence 
{<Pn(z)\ is univalent. In particular, <po(z) maps N(z 2 ) (1, 1) and directly con¬ 
formally onto a region in the Z-plane containing #>( 22 ) in its interior. Since 



ON CONTINUATION OP A RIEMANN SURFACE 


285 


939 ( 21 ) and 950 ( 22 ) are equivalent with respect to {W), let F denote that member 
of { V T \ for which 

(4.6) 9 x 3 ( 22 ) = FWft)). 

Now V = lim m _„o v n(m \ where F n<m> t ©(G„<m)). Hence for m sufficiently large, 
y n<m Wi)l lies in the image of N(zt) under 950 ( 2 ), 99 o[#(z 2 )]. Since <po(zi) = 
lim m _„ <p n (m)(zi), V n<m) [<o»(m)( 2 i)] also lies in 9 *o[#(zj)] for m sufficiently large. 
Further, for m sufficiently large and for p(> 0) sufficiently small, the circle 
N,( 950 ( 22 )) with center 950 ( 22 ) and radius p is covered by the images of N(zi) with 
respect to 93 n ( m) (z). The sequence (F’^Vnimitei)]( has for its limit as m —* <x >, 
939 ( 22 ), and hence for m sufficiently large F B<m> [ 9 *»(m)( 2 i)] lies in #,( 939 ( 22 )) and 
hence F" (m) [<p„( m )( 2 i)] has an antecedent f„ (m ) in N(zt ) with respect to the map 
<Pn(m) ( 2 ). By B» £„( m ) is equivalent to Zi with respect to ©(F). Replacing N(zf) 
by a sequence of neighborhoods of 22 , ( M p (zt) }, M p (zt) C N(zi), (p = 1, 2, • • • ) 
where the diameter of M p —* 0 as p —> <», we find by repeating the above 
argument 

(4.7) lim in(m) = 22 . 

m—> 00 

But f„ (m) is equivalent to 21 with respect to ©(F). The proper discontinuity of 
©(F) implies f„( m ) = 22 for m sufficiently large. In other words, 22 is equivalent 
to 21 with respect to ©(F) as we wished to prove. 

The local univalence of 9*9(2) guarantees the proper discontinuity of the group 
{Ft} in the finite Z-plane. Suppose, for example, that {F r j contained a se¬ 
quence {F*} {Vk ^ Z, k = 1, 2, • • • ) which converged to Z itself. Consider 
any neighborhood of 2 = 0, #(0); its image under 9*9(2) would contain a set of 
points \Zk\ {Zk 0) equivalent to Z = 0 with respect to {F r j which has 
Z = 0 as a limit point. The antecedents of Z* in N( 0) would then have 2 = 0 
as a limit point and would be equivalent to 2 = 0 with respect to the group 
©(F). This contradicts the proper discontinuity of ©(F). 

The group {F r ) must necessarily be one of the following: the identity, a 
simply-periodic group of translations, a doubly-periodic group of translations. 

The system of relations (4.5) and the modified statement of Bj with 939 re¬ 
placing 93 „ and {Fr| replacing ©((?„), together with the stated restrictions on 
{ Fr}, imply that F admits the Riemann sphere or a closed Riemann surface of 
genus one as a continuation. But this Is contrary to our assumption on F 
(cf. § 1 ). Hence (4.3) is valid. 

The compactness of $9 now follows. For let us consider a sequence of func¬ 
tions {fn(z) } where /»(z) (n = 1 , 2 , • • • ) correlates the Riemann surfaces F 
and (7» in accordance with A 1 -A 4 , f n {z) replacing /(z) and ©((?„) replacing ©(G). 
The sequence |/„( 2 ) j is bounded by unity and is hence normal for | 2 1 < 1. 
To each point 29 of | z | < 1 there corresponds a neighborhood, #( 29 ), in which 
/„(z) is univalent (n = 1, 2, • • • ). There exists a subsequence {/„(») ( 2 )} which 
converges continuously for | z \ < 1 as m —* *>. Denote lim m _» M / n(in )( 2 ) by 



286 


MAURICE H. HEINS 


/o(z). The requirements Ai and A 4 are obviously satisfied by /o(z). Since 
fnim ) —* fo , i7? <m) —► t/r , where, a priori , 17° is either a linear fractional trans¬ 
formation mapping | Z | < 1 onto itself, or else a constant of modulus unity. 
This latter possibility is to be excluded since / 0 ( 0 ) = 0 . The set {f/rj consti¬ 
tutes a group, ©o, since it is the homomorphic image of ©(F). Finally, by 
virtue of (4.3) / 0 (z) is univalent in the N(z 0 ) specified above for each z 0 of 
| z | < 1 . Hence the group ©o must be properly discontinuous for | Z | < 1 , 
if the proper discontinuity of ©(F) is not to be violated. Obviously ©o contains 
no elliptic transformations. The requirements A 2 and A 8 are met by / 0 (z) and 
©o, the proof that A 8 is satisfied being analogous to the proof that ^ 0 ( 2 ) satisfies 
B 8 with {Vr } replacing <Tn l ®{G n )<r n . But then the Riemann surface G 0 s 
(| Z | < 1 ) (mod © 0 ) is a continuation of F. We have 

Theorem 4.1: The J2 -space 4>o is compact. 

The theorem of Bochner [1, p. 415] is an easy corollary of Theorem 4.1. By 
the compactness of $ 0 , there exists a function / 0 (z) and a Fuchsian or Fuchsoid 
group ®o such that A1-A4 are satisfied with/ = / 0 and 06(G) ss © 0 ', and/'(0) = 11 . 
If the Riemann surface (| Z | < 1) (mod © 0 ), which is a continuation of F, were 
not maximal, there would exist a function h(z ) and a Fuchsian or Fuchsoid 
group fulfilling the requirements A1-A4 with / = h, ©((?) s $ and h'( 0) < 1. 
Obviously we may take $ as the homomorphic image of ©0 under the mapping 
defined by h(z). Consider the function 

(4.8) g(z) = h[f 0 (z)] 

which defines a homomorphic mapping of ©(F) onto $ and satisfies in conjunc¬ 
tion with Ip A 1 -A 4 , where the appropriate replacements are made. Hence 
(| Z | < 1) (mod §) belongs to 4 > 0 and is a continuation of F as well as of 
(| Z | < 1) (mod ®o). But g'( 0) = /i'(0)/o(0) < fx. This is contrary to the 
definition of The theorem of Bochner follows. 

Theorem 4.2: Every continuable Riemann surface admits a maximal con¬ 
tinuation. 

We note in passing that the F satisfying the condition of §1 always admit 
maximal continuations belonging to $ 0 . 

5. Necessary and sufficient conditions that F be continuable 

Let F denote a continuable Riemann surface satisfying the restrictions laid 
down in § 1 . Up to the present we have made use of the functions f(z ) asso¬ 
ciated with the continuations of F to establish the compactness of $ 0 . How¬ 
ever, we have considered only the grossest-properties of f(z) without studying 
the relation of the structure of the homomorphism defined by f(z) of ©(F) into 
©((?) to the nature of the continuations which F admits. We now propose to 
investigate this structure in greater detail. 

It is, a priori , evident that either for every continuation G of F and every 
associated f(z) the mapping of ©(F) into ©((?) is strictly isomorphic, or else 
there exists a continuation G* of F and an associated/(z) such that some element 



ON CONTINUATION OF A RIEMANN SURFACE 


287 


of ©(F) other than the identity is mapped into the identity of ©((?*). If the 
first case occurs, then every f(z) defines a (I, 1) map of \z \ <1 onto a subregion 
of \ Z\ < 1. This is a consequence of A* which states that 

' m-m 

implies z% is equivalent to 22 with respect to some element T 7 , not the identity, 
of ©(F). But this would imply 

fc fiTz) - /(«), 

which is contrary to our assumption on F. 

If the second possibility is realized, then a normal subgroup, g, not reducing 
to the identity, of ©(F) is mapped on the identity of ©((?*) and f(z) is auto- 
morphic with respect to g. Let R denote the image in | Z | < 1 of | z | < 1 
under the map Z = f(z). Any determination of the inverse of /(z), say the 
element at Z = 0 which carries Z = 0 into z = 0, can be continued analytically 
throughout R by virtue of A 3 . The inverse of f(z) takes on each value of 
| z | < 1 precisely once. If R were simply-connected, the inverse of f(z) would 
be single-valued in R , and this would contradict the fact that f(z) is auto- 
morphic with respect to a group g not the identity. Hence R is a multiply- 
connected region lying in | Z | < 1 and Z = f(z) is a uniformization mapping 
which defines | z | < 1 as the universal covering surface of R. 

It may be possible to express the boundary of R , to be denoted by &R, as the 
sum of two disjoint closed sets ai and <r 2 , one of which, say <n , lies wholly in 
| Z | < 1 and is totally disconnected. If this is so, there exists a closed Jordan 
curve c lying wholly in R with diameter as small as we please, such that the 
interior of c contains points of 0R, and no two distinct points of the closure of 
the interior of c are equivalent with respect to ®(G*). If this is not the case, 
then there exists as a consequence of the multiple connectivity of R a continuum 
K lying wholly in | Z | < 1, which is a maximal connected component of PR. 
This continuum K does not contain a pair of points Z\ and Z 2 with the property 
that Z 2 = UZ\ , where U(?* I) e ©((?*). Otherwise, K being maximal, 

(5.1) UK = K y 

and, U being either hyperbolic or parabolic, K would not be at a positive dis¬ 
tance from | Z | = 1. Thus there exists a positive number e for which the 
e-neighborhood of K contains no two distinct points which are equivalent with 
respect to ©((?*), and consequently there exists a closed Jordan curve k lying 
in the intersection of R and the e-neighborhood of K , such that K is in the 
interior of k . 

When it is noted that the region R is invariant under the transformations of 
©((?*), the significance of the existence of the closed Jordan curves cork becomes 
clear. For it is readily verified that F is conformally equivalent to R 
(mod ©((?*)). The choice of c or k was such that no two distinct points in the 



288 


MAURICE H. HEINS 


closure of the interior of c or k were equivalent with respect to ®(G*). Hence 
the set of points in R (mod ©(G*)) 

(5.2) X R • [interior c or k] (mod ®(G*)) 

V «©(»*) 

*s conformally equivalent to the intersection of R and the interior of c or fc, 
and is bounded in part by 

(5.3) 7 = Z U(c or k) (mod ©(G*)), 

U ( (&(G*) 

which is a closed Jordan curve lying in R (mod ffl(G*)). The retrosection y 
divides R (mod ©(G*)) into two disjoint open connected sets one of which is 
(5.2). This implies that R (mod ©(G*)) and hence F which is conformally 
equivalent have boundary elements of the first type [5, p. 164]. We have 

Theorem 5.1: A necessary condition that there exists a continuation G* of F 
such that some f{z) of (3.1) defines a homomorphism of ©(F) intp ©(G*) carrying 
a normal subgroups (3 but ^ I) into the identity of ©(G*) is that F have a boundary 
element of the first type. 

The converse of this theorem, which is also true, may be demonstrated by a 
device due to de Possel [ 12 , p. 17]. If F has a boundary element of the first 
type, then there exists a retrosection y of F such that F — 7 is the sum of two 
disjoint open connected subsets of F, Fi and F 2 , one of which, say Fi, is planar 
(i.e. homeomorphio to a plane region) and multiply-connected. As a conse¬ 
quence of a theorem due to Koebe [ 6 ], Fi may be mapped (1,1) and directly con¬ 
formally onto a plane multiply-connected region ffi lying in the interior .of the 
unit circle | 21 = 1 in the 2 -plane, such that under this map 7 and | 1 1 = 1 are 
in ( 1 , 1 ) continuous correspondence. The Iiiemann surface F* formed from F 
and | 1 1 < 1 by identifying the corresponding points of F\ and ffi is a continua¬ 
tion of F. To show that some associated f{z) of (3.1) defines a homomorphism 
of ©(F) into ©(F*) carrying a normal subgroups (3 but I) into the identity 
of ®(F*), observe that there exists a closed Jordan curve T lying in ,Ti, which 
contains in its interior a part of the boundary of c Ji . The image of T in F 
cannot be deformed to a point in F, whereas the image of V in F* can be de¬ 
formed to a point in F*. Hence our assertion follows. 

Theorem 5.2: If F has a boundary element of the first type , then F admits a 
continuation F* such that some associated homomorphic mapping of ©(F) into 
®(F*) carries a normal subgroup of ©(F), g (3 but 7 * /), into the identity of ®(F*). 

Let us now consider necessary and sufficient conditions that a Riemann sur¬ 
face F admit a continuation G such that some f(z) of (3!l) defines an iso¬ 
morphism of ©(F) into ©(G). We shall give two such conditions, one in terms 
of the structure of ©(F), the other in terms of intrinsic properties of F. 

Recall that, if an f(z) of (3.1) defines an isomorphic mapping of ©(F) into 
©(G), then/(z) defines a (1, 1) map of | z | < 1 onto a subregion of | Z \ < 1. 
Under these circumstances, ©(F) must be of the second kind. Contrary to 



ON CONTINUATION OF A BIEMANN SURFACE 


289 


this assumption, ©(F) would be of the first kind—that is, the set of points {f} 
defined by 

(5.4) . n = t (Te&(F)) 

would be dense on | z | = 1. If T is parabolic, let k denote an oricycle lying in 
| z | < 1 save for the point f and tangent to | z | = 1 at f; and if T is hyper¬ 
bolic, let k denote a circular arc lying in | z | < 1 except for its endpoints f and £ 
(the other fixed point of T). By A 2 when z tends to f along k, /(z) tends to a 
fixed point r\ of IJ T , which is the unique fixed point of IJ T , if U T or T is parabolic 
(in this latter case it may be shown that U T is then necessarily parabolic); 
otherwise,/(z) tends to that fixed point of U T which is limjni^oo Ut the \n] being 
the positive or negative integers depending upon whether the n are positive or 
negative in the relation 

lim T n = f. 

| n | -♦oo 

It follows from a lemma due to Carleman [9, p. 65] that, when z tends radially 
to f, /(z) tends to the fixed point of U T prescribed above. 

Since we are assuming that G is a non-trivial continuation of F (i.e. G is not 
conformally equivalent to F), the boundary of R } where R is the image of 
| z | < 1 under/(z) must contain points in | Z | <1 and hence points in | Z | < 1 
which are accessible from R. Let denote an accessible boundary point of R 
in | Z | < 1, and let a z denote a Jordan arc connecting Z = 0 and w z , lying 
in R save for the endpoint u z . The antecedent with respect to /(z) of a z with 
o) Z deleted is such that when Z tends to wz on a z , its antecedent tends to a 
unique point, , on | z | = 1. Application of Carleman’s lemma shows that, 
as z tends to co z radially, /(z) tends to u z . On the assumption that ©(F) Ls of 
the first kind, there exists a sequence of arcs {p n \ lying on | z | = 1 where 

Pn ^ 0 „+1 (n = 1 , 2, • • • ), each @ n contains co* , the endpoints, and fn 2) , 

of p n are fixed points of transformations of ©(F), and the length of p n tends to 
zero monotonically. For each positive integer n, the point o)z is contained in 
the Jordan region, <r n , which is defined by the following two properties: 

1° It is bounded by the images with respect to /(z) of the radii joining z = 0 
to the points fi 1} and fn 2> respectively and one of the two arcs of | Z | = 1 with 
endpoints the fixed points of transformations of ©(<?) associated with and 
fn 2) in the manner indicated above. 

2° (T n contains the image with respect to f(z) of the region bounded by the 
radii joining z = 0 to f n X) and f L 2> respectively and the arc p n . 

Consider the maximal connected component containing a> z of the intersection 
of the boundary of R and the set of points C : | Z — a> z | ^ p, where p is positive 
and is chosen so small that C lies in | Z | < 1; denote this component by ft. 
I say that ft C <r n (n = 1, 2, • • • ). This is a consequence of the fact that 
€ <r n (n = 1, 2, • • • ). Since the set ft obviously does not consist solely of the 



290 


MAURICE H. HEINS 


point <a z , and since the set of points of the boundary of R which are accessible 
from R are dense on the boundary of R , the set U contains a point wz(^ coz) 
which is accessible from R . Repeating the argument employed before in con¬ 
sidering a ) Z , we find that there exists a point w* on | z | = 1 such that, when z 
tends to <J U radially, /(z) tends to J z . But the point w* must lie on the arc 0 n 
for every positive integer n ; and since the length of p n tends to zero as n —> oo, 
w* must necessarily coincide with co, . The contradiction is manifest. Hence 
©(F) is of the second kind . 

The converse is also true. Let F be a Riemann surface with ©(F) of the 
second kind and let & denote the set of points on | z | = 1 where ©(F) is not 
properly discontinuous. The Riemann surface, 

(5.5) G = [Extended z-plane — 6] (mod ©(F)), 

is a continuation of F. It is very simple to exhibit an /(z) of (3.1) associated 
with this continuation. Observe that the region P consisting of the extended 
z-plane deleted in the set & is a smooth, unbounded, regular covering surface 
of G . Let W(z) denote the mapping of P onto G defined by the identification 

(5.5) , and let z = <p(Z) denote the uniformization mapping of | Z | < 1 onto P, 
where *>(0) = 0 and ^'(0) > 0. Then W[(p(Z)] defines \ Z\ < 1 as the uni¬ 
versal covering surface of G. Observe that W(z) defines \z \ < 1 as the uni¬ 
versal covering surface of (| z | < 1) (mod ©(F)) which is conformally equiva¬ 
lent to F. Now, proceeding as before, we see that/(z) may be defined by 

(5.6) W{z) = wwum 

where /(0) = 0. This implies, since ^>(0) = 0, that the relation 

(5.7) z = *[/(*)] • ‘ 

holds for | z | < 1. The uni valence of z itself implies the univalence of /(z). 
Hence the mapping of ©(F) into ©(G) defined by/(z) is isomorphic. In r6sum6, 
we have 

Theorem 5.3: A necessary and sufficient condition that a Riemann surface F 
admit a continuation G with the property that an associated f(z) of (3.1) defines an 
isomorphic mapping of ©(F) into ©(G) is that ©(F) be of the second kind. 

We may also characterize Riemann surfaces F for which ©(F) is of the second 
kind intrinsically in terms of F. Recall that a transversal a of an open surface 
F is a (1,1) continuous image lying in F, w(t) y of the open unit interval 
(0 < t < 1), and having the property that, whenever the sequence {t k } tends to 
zero or one, the sequence {w(tk )} does not have any limit point in F. We shall 
agree to call a transversal a r-transversal if the following conditions are fulfilled: 

1° It separates F in such a manner that one of the connected components, 9t, 
is simply-connected. 

2° When JR is mapped (1,1) and directly conformally onto the interior of the 
unit circle, | f | = 1, the length of the arc of | f | = 1 corresponding to r is less 
than 2ir. 



ON CONTINUATION OF A RIEMANN SURFACE 


291 


It is readily seen that the existence of a r-transversal on a Riemann surface F 
is a conformally invariant property of F. 

We now propose to show 

Theorem 5.4: A necessary and sufficient condition that ©(F) be of the second 
kind is that F admit a r-tranversal. 

The necessity follows at once. Let e' 9 be a point of | z | = 1 where ©(F) is 
properly discontinuous and let k denote the intersection of | z | <1 with the 
circumference of a circle with center e' 9 and radius chosen so small that 
T mK -T n K = 0 (r w , T n e ©(F); T m * T n ). Then 

(5.8) £ Tk (mod ©(F)) 

T t&(F) 

is a r-transversal of (| z j < 1) (mod ©(F)) and this latter Riemann surface is 
conformally equivalent to F. 

The sufficiency of Theorem 5.4 may be demonstrated as follows. If F admits 
a r-transversal, the antecedent of SR in | z | <1 with respect to the uniformiza- 
tion mapping of | z | < 1 onto F consists of the sum of disjoint simply-connected 
regions. We prefer one of these, say r 0 . Since r 0 is in (I, 1) directly conformal 
correspondence with under the uniformization mapping of | z | < 1 onto F 
by virtue of the monodromy theorem, and since is in (1, 1) directly conformal 
correspondence with | f | < 1 in the manner indicated above, by composing 
these two mappings, we find that there exists a univalent analytic function, 
z(f), which maps | f | < 1 onto r 0 in such a manner that, whenever f tends to 
any interior point of the arc of | f | = 1 complementary to the arc corresponding 
to r, | z(f) | tends to unity. Application of Schwarz’s reflection principle 
[4, p. 45] yields the conclusion that z(f) can be continued analytically across 
the interior of the arc of | f | = 1 complementary to the arc corresponding to r. 
So continued z(f) maps a sufficiently small neighborhood N of an interior point 
of the complementary arc (1, 1) and directly conformally onto a region in the 
z-plane containing a point of | z | = 1 in its interior. We take the neighborhood 
N to be the interior of a circle with its center the point of the complementary 
arc in consideration. Hence the intersection of N and | f | < 1 is mapped by z(f) 
onto a region on the z-plane lying in r 0 and bounded in part by an arc of | z | = 1. 
But r 0 was so defined that no two distinct points of r 0 are equivalent'with respect 
to ©(F). Therefore there are points on | z | = 1 where ©(F) "is properly dis¬ 
continuous. The group ©(F) must be of the second kind. 1 ^ 


1 de Possel has introduced in his thesis [12, p. 15] a concept closely relived to our r- 
transversals. It is that of a simply-connected region D on F which is bounded in part by a 
finite number or denumerable infinity of transversals of F; D has the property that, when 
it is mapped (1,1) and directly conformally onto the interior of the unit circle, the arcs of 
the circumference of the unit circle corresponding to the transversals form a set which is 
not of maximal type (l.c.). We have preferred, however, the concept of r-transversal to 
that of the regions D because the former concept is apparently the more primitive and 
simpler. 



292 


MAURICE H. HEINS 


The following theorem follows readily from the preceding four. 

Theorem 5.5: If ©(F) is of the second kind , then F is always continuable. If 
©(F) is of the first kind , then F is continuable if and only if F admits boundary 
elements of the first type . 

We remark that, if ©(F) is of the first kind and if F is continuable, then the 
boundary elements of the first type of F may be characterized conformally by 
the fact that F admits no r-transversals. 

6 . The exhibition of maximal continuations 

If a given continuable Riemann surface F has no boundary elements of the 
first type, it is easy to exhibit a maximal continuation of F. Indeed, the 
Riemann surface G defined in (5.5) is a maximal continuation of F. To estab¬ 
lish this, it suffices to show that (a) ©(G) is of the first kind, and (b) G has no 
boundary elements of the first type (Theorem 5.5). 

The assertion (a) may be demonstrated as follows. With the region P con¬ 
sisting of the extended 2 -plane deleted in & (§5) we associated its uni- 
formization mapping, <p(Z), which is automorphic with respect to a Fuchsoid 
group, ©(F), which is of the first kind since & is totally disconnected . Now ©(G) 
may be taken to be precisely the properly discontinuous group with respect to 
which W[<p{Z)] is automorphic (§5). Hence, ©(F) being of the first kind, 
©(G) is also and the assertion (a) follows. 

Can G admit boundary elements of the first type? If such boundary elements 
existed, then G would admit a retrosection y separating G such that G — y is 
the sum of two disjoint, open, connected sets, one of which, Gi, is planar and 
multiply-connected. 

The surface G itself may be expressed as the sum of the following disjoint sets: 
1°Fi - (| * | < 1) (mod ©(F)); 2 ° F 2 s (1 < ] 2 | g 00 ) (mod ©(F)); 3° (the 
set of points on | 2 | = 1 complementary to &) (mod ©(F)). The set 3° consists, 
a priori , of a system {a* j (finite or denumerable) of transversals and retrosections 
of G. It cannot however contain any retrosections of G, since, if this were so, 
F% which is conformally equivalent to F would have boundary elements of the 
first type. 

It is clear that the region Gi must contain points of both Fi and F 2 . If 
61 C Fi, then Fi would have boundary elements of the first type; similarly, if 
Gi C F 2 , F 2 , which homeomorphic to Fi, would have boundary elements of the 
first type. Since y is compact on G, it has points in common with only a finite 
number of ^ie cr* . The set Gi deleted in the points of 0 * in Gi is the sum (finite 
or denumerable) of planar regions, g x , each of which must lie wholly in Fi or 
wholly in F 2 , since a point of F% cannot be connected to a point of F 2 without 
crossing some 0* . Furthermore, each g x must be simply-connected; otherwise 
either Fi or F 2 would have boundary elements of the first type. 

Observe that at least one g x is. not compact on G. Otherwise, each g t would 
be bounded by a closed Jordan curve consisting of points of y and of a compact 
subset of the finite number of the 0 * having points in common with 7 , this subset 



ON CONTINUATION OF A RIEMANN SURFACE 


293 


being independent of the index l. It would follow that the intersection of Gi 
with o'* is compact on G. Now Gi itself is not compact on G; hence there 
must exist a (1, 1) continuous image of (0 ^ t < 1), w(t){d Gi), such that, 
whenever t p 1, \w(t p )} is properly divergent. The arc w(t) must lie wholly 
in Fi or wholly in F 2 for t sufficiently near unity. Assume that for t ^ to > 0, 
w(t) C Fi , the treatment of the other possibility being similar. Each g t , being 
assumed compact, for t ^ to , v>(t) must have points in common with an infinite 
number of distinct gi and hence must cross y for values of t arbitrarily near one. 
This is manifestly impossible since y is compact on G. 

Let gi 0 denote therefore a gi which is not compact on G and which lies in Fi. 
The simply-connected region g io is bounded in part by an arc of y having end¬ 
points on distinct <ru . The antecedent of gi 0 in | z | < 1 with respect to the 
identification mod ©(F) consists of the denumerable sum of disjoint simply- 
connected regions, every one of which is bounded by a closed Jordan curve con¬ 
sisting of points of the antecedent of y and of \ z | = 1, this Jordan curve 
containing an arc of | z | = 1 with a point where ©(F) ceases to be properly dis¬ 
continuous in its interior, since gi 0 is not compact on G. This violates the 
monodromy theorem, since each component of the antecedent of g iQ cannot 
contain distinct points which are equivalent with respect to ©(F), and yet each 
component of the antecedent of gi 0 must contain infinitely many distinct points 
equivalent with respect to ©(F) in the neighborhood of the points on its boundary 
where ©(F) ceases to be properly discontinuous. We infer 

Theorem 6.1: If F is continuable and has no boundary elements of the first type, 
then G of (5.5) defines a maximal continuation of F. 

7. A special type of maximal extension 

In accordance with the theorem of Koebe already cited [6] it is possible to 
map a planar Riemann surface F 0 (1, 1) and directly conformally onto a plane 
region. Furthermore, it is possible to map a plane region (1,1) and directly con- 
formally onto a region which is dense in the extended complex plane. This is 
evident, if the region is simply-connected; on the other hand, if the region is 
multiply-connected, it follows from a theorem of de Possel [13] which states that 
any multiply-connected plane region may be mapped (1,1) and directly con¬ 
formally onto a plane region bounded by a totally disconnected set (possibly 
vacuous) and a system of segments parallel to the real axis so that this image 
region is dense in the extended plane. 

Does an arbitrary Riemann surface F admit a maximal continuation G such 
that there is in G a (1,1) directly conformal image of F which is dense in G? 
*This question was proposed to the author by Professor Bochner. As we shall 
see, it is to be answered in the affirmative. Here, too, we shall assume that F 
does not admit the Riemann sphere or a closed Riemann surface of genus one 
as a maximal continuation (§1); the treatment of the former case has been 
indicated and the proof in the latter case may be readily supplied. 



294 


MAURICE H. HEINS 


We shall say that a Riemann surface G which is a continuation of a given 
continuable Riemann surface F and which has a dense subset (1,1) directly 
conformally equivalent to F, is a dense continuation of F. Let denote the 
class of Riemann surfaces G which are dense continuations of F . 

The class is not vacuous. Recall that, if F is continuable, then either F 
has boundary elements of the first type, or else admits a r-transversal (§5). In 
the first case, the existence of a G e SFo is assured by the proof of Theorem 5.2. 
In the second case, let d denote the simply-connected subregion of F which is 
bounded in part by r. The region d may be mapped (1,1) and directly con¬ 
formally onto d x , consisting of the interior of the unit circle in the x{ = X\ + ix*)- 
plane deleted in the set (0 ^ Xi < 1), in such a manner that the transversal r 
corresponds to the circumference | x | = 1 deleted at the point x = 1. Using 
the device of de Possel once again to identify points of | x | < 1 slit in 
(0 Xi < 1) and points of d which correspond under the conformal transforma¬ 
tion between the two regions being considered, we obtain a depse continuation 
of F. Hence is not vacuous. 

If there exists aCe^o which is maximal, the problem is settled. Otherwise, 
let v denote g.l.b./ «e 0 /'(0) where 0 O is the class of f(z) of (3.1) associated with 
the G of for which the corresponding map of F into G defines G as a dense 
continuation of F. Clearly v ^ /x (cf. (4.3)). A remark of importance is that 
f(z) of (3.1) belongs to 0 O , if and only if the image of | z | < 1 under f{z) is 
dense in | Z | < 1. This implies that >ko C $ 0 . 

Let Gi e * 0 and have the property that there is an associated /i(z) e 0 O for 
which f[( 0) < 3i>/2. Such a Gi and /i(z) exist. Since G\ is continuable, we 
apply the argument just employed for F to Gi . We let denote the class of 
dense continuations of Gi ; we carry over the uniformization of (3.1) taking 
| Z\ | < 1 as the universal covering surface of G \, letting Z\ = 0 correspond to 
the point of G\ which is the image of of F. The class 0i bears the same 
relation to Gi that 0o bears to F, and v\ replaces v . It is to be noted that 
vi ^ 2/3; else there would exist an / 6 0 O for which /'(0) < v. Proceeding as 
above, we infer the existence of a G 2 c and an/ 2 (zi) e 0i for which/ 2 (0) < 4vi/3. 
Advancing step by step, we apply the same argument to (j 2 , letting (0*, , 

Wo k) f Zk , vk) for k = 2 bear the same relation to this system for k = 1 that this 
system for k = 1 bears to (0o, 'J'o, Wo , z , v). The connotation of the symbolism 
is clear. We observe that ^ 3/4 and that there exists an fz(z 2 ) €^Sr 2 with 
fz{ 0) < 5v 2 /4. We carry out this procedure inductively obtaining at the n th 
stage the set (9 n , , Wo n) , z n , v n ) and/ n +i(z n ) € , where v n > (n + 1 )/{n + 2) 

and/l + i(0) < (n + 3 )v n /{n + 2). 

Now consider the sequence of functions, \h n (z)}, defined by the inductive 
relations 


(7.1) 


h(z) I-/i(*) f K(z) ^fn(h n ^(z)) 


(n ^ 2). 



ON CONTINUATION OF A RIEMANN SURFACE 


995 


The function, h n (z ), defines G n as a dense continuation of F in accordance with 
the conditions A 1 -A 4 of §3. The sequence \h n (z) | converges continuously in 
the sense of Carath 6 odory for | z | < 1 . This is established by observing that 

(7.2) ' fi n (0) > n (»- 1 , 2 ,...) 

in accordance with (4.3) and that 

(7.3) | h n (z) I ^ I hn-iiz) I 

for | z | < 1 and n = 2, 3, • • • by Schwarz’s lemma. As a consequence of the 
proof of Theorem 4.1, h(z), the limit function of the sequence, \h n (z )), defines 
a homomorphism or isomorphism of &(F) onto a Fuchsoid group W* with the 
property that G* = (| Z | < 1) (mod ©*) is a continuation of F. If we establish 
that (7* is a dense continuation of F and that G* is maximal, then the question 
raised at the beginning of the present section is to be answered in the affirmative. 
Lemma 7.1: The Riemann surface G* is a dense continuation of F . 

Define the sequence \hn\zk)} by the relations 

(7.1) (i> h[ k \z k ) «/*+,(**), hl k) (z k ) ( k = 1,2, • •• jn-2,3, •••). 

The argument applied to {/i„(z)j shows that the sequence ! h ik) (z k ) j converges 
continuously for | z k | < 1 as n —* «>. The limit function, h a) (z k ), defines an 
isomorphism or homomorphism of &{Gk) onto ©* and G* is a continuation of 
Gk (k = 1, 2, • • • ). We also have the further relations 

(7.4) h m [h[ l \zi)\ = h (l \ Zl ) for l < k; h (k \h k (z)) = h(z) (k £ 1). 

Let G* denote the image of F in G* defined by the map of F into G* asso¬ 
ciated with h(z ) in accordance with §3; and, in general, let Gt denote the image 
of Gk in G* defined by the map of G k into G* associated with h (k \zk ) (k = 
1, 2, • • • ). Taking k = l + 1 in the first of the relations (7.4) and fc = 1 in 
the second, we infer 

(7.5) G* C G* C Gt C • • • . 

We shall show that 

(7.6) lim G* n = G*. 

71 —*00 

Note that 

(7.7) lim h ik) (z) m z. 


This follows from the fact that 


dh m (z) 

dZ r-0 


n/Ko). 

i>k 


coupled with the relation lim*_« (0) =^1 (H?_i/i(0) = m)- If (7.6) 

were not true, then there would exist a point w* e G*, not belonging to G n for 



296 


MAURICE H. HEINS 


any whole number n. But then the functions, h {k) (Zk) (k = 1, 2, • • • ), would 
omit for | z k | < 1 the set of points { UZ a ) where Z a is an antecedent of w* in 
| Z | < 1 and U e ©*, and hence the relation (7.7) would be violated. 

It remains to be shown that G* — G> is nowhere dense in G*. Note that 
G * — Gj is closed, being the complement of Gj in G*, and that by (7.1) it is 
equal to 

(7.8) (Gi — G*) + ( G * — G*) + (G* — G*) + • • • . 

In accordance with the relations (7.4) Gf — G* is nowhere dense in G? , and 
G*+ 1 — G* is nowhere dense in G*+i for (k = 1, 2, • • • ). Hence G? — Gj and 
G *+1 — G* (k — 1, 2, • • * ) are nowhere dense in G*. It follows that G* — Gj 
is of the first category and hence nowhere dense in G* since it is closed. 

The validity of Lemma 7.1 is established. If G* is maximal, our original 
question is settled. 

Suppose, therefore, that G* is not maximal. Then there wquld exist a dense 
continuation of G*, say //. Let f*(Z) have the corresponding significance for G* 
and H that f n +i(z n ) has for G n and G n + 1 . Since the Riemann surface H is a 
dense continuation of G n (n = 1, 2, • • • ), it follows that 

(7.9) r[h (n \z n )]eQ „ (» - 1, 2, • • • )• 


But v n of 0 W satisfies v n > (n + 1 )/(n + 2), and hence by (7.7) 


dr(Z) 

dZ 


!z-o 


= l; 


by Schwarz’s lemma f*(Z) = Z, which is manifestly contrary to the assumption 
made. We have 

Theorem 7.1: A continuable Riemann surface always admits a maximal dense 
continuation . 


8. A remark 

We have left out of consideration those Riemann surfaces which admit the 
sphere or a closed Riemann surface of genus one as a continuation in order to 
preserve the unity of presentation. The treatment of these classes of Riemann 
surfaces follows from our discussion with appropriate modifications. 


The Institute for Advanced Study 

Bibliography 

1. S. Bochner, Forisetzung Riemannscher Fldchen , Math. Annalen, vol. 98 (1927),pp. 

406-421. 

2. L. R. Ford, Automorphic Functions , New York, 1929. 

3. M. FrAchet, Sur quelques points du calcul fonctionnel. Thesis, Paris, 1906. 

4. G. Julia, Principes gbomhtriques d f analyse I, Paris 1930. 

5. B. v. Ker£kjArt6, Vorlesungen Uber Topologie I, Berlin 1923. 

6. P. Koebe, Vber die Uniformisierung beliebiger analytischer Kurven III,Gottinger 

Nachrichten, 1908, pp. 337-368. 



ON CONTINUATION OF A RIEMANN SURFACE 


297 


7. P. Montel, Lemons sur les families normales des fonctions analytiques , Paris 1927. 

8. R. Nevanlinna, Ein Satz fiber offene Riemannsche Fl&chen , Annales Acad. Sci. Fenn., 

Ser. A T.54. No. 3. (1940). 

9. R. Nevanlinna, Eindeutige analytische Funktionen , Berlin 1936. 

10. R. de Possel, Sur le prolongement des surfaces de Riemann t C. R. Acad. Sci. de Paris, 

vol. 186 (1927) pp. 1092-1095. 

11. R. de Possel, Sur le prolongement des surfaces de Riemann , C. R. Acad. Sci. de Paris, 

vol. 187 (1928) pp. 98-100. 

12. R. de Possel, Quelques questions de representation conforms. Thesis, Paris 1932. 

13. R. de Possel, Zum Parallelschlitztheorem unendlich-vielfach zusammenh&ngender Gebiete t 

Gottinger Nachrichten, 1931, pp. 199-202. 

14. T. Rad6, Ueber eine nicht-fortsetzbare Rxemannsche Mannigfaltigkeit f Math. Zeitschrift, 

vol. 20 (1924) pp. 1-6. 

15. T. Rad6, Ueber den Begriff der Riemannsche Fldche , Acta Szeged, vol. 2 (1923) 

pp. 101-121. 

16. H. Seifert and W. Threlfall, Lehrbuch der Topologie , Leipzig 1934. 

17. H. Weyl, Die Idee der Riemannschen Fldche , Leipzig 1923. 



AmCALS of Matbxmatics 
Voi: 43, No. 2, April, 1042 


LATTICE-ORDERED GROUPS 

By Garrett Birkhoff 
(Received December 10, 1941) 

1. Introduction 

We shall be concerned below with lattice-ordered groups, or l-groups / in the 
sense of the following definition. 

Definition. An l-group is 

(I) a group, on which is defined a binary inclusion relation which is “homogeneous” 
in the sense that 

(II) x ^ y implies a+x+b^a+y+b for all a, 6, 

and relative to which 

(III) the group is a lattice. 

This defines l-groups as abstract algebras; as such, we can (and shall) apply 
to them such general algebraic concepts as 1-subgroup (subalgebra), isomorphism, 
homomorphism, and so on. 

Three important topics will be included as special cases under the single 
heading of l-groups: the additive and multiplicative groups of ordered fields, 
which have long been studied by Hahn, Artin, and others, and are now exten¬ 
sively used in valuation theory; the study of abstract number and ideal theory 
initiated by Dedekind, and recently amplified by Krull, Ward, Lorenzen, 
Clifford, Dilworth and others; and the semi-ordered function spaces studied very 
recently by Riesz, Freudenthal, Kantorovitch, the-author, Bohnenblust, Stone, 
Kakutani, and others. 

It should be stressed, however, that up to the present time only l-groups which 
are commutative or simply ordered have been studied; and it came as a con¬ 
siderable surprise to the author that the non-commutative case involved so few 
new difficulties. 

The material below breaks up rather naturally into several parts. First 
(§§2-8) Postulates (I)—(III) are discussed, and various other equivalent systems 
of postulates (together with numerous examples) are derived. Then, after 
brief preliminaries on algebraic formalism, the general structure and decomposi¬ 
tion theory of l-groups is treated (§§9-13). After this, a complete classification 
of commutative l-groups whose structure lattice has finite length is given (§§14— 
20). After this, in §§21-26, special properties of complete l-groups are discussed. 
Fifth, two important generalizations are taken up (§§27-28). The paper then 
concludes with a list of sixteen unsolved problems (§§30-31), some of which 
are fundamental. 


1 We are adopting the convenient terminology of M. H. Stone, “A general theory of 
spectra. II.,” Proc. Nat. Acad. Sci. 27 (1941), pp. 83-87. 

298 



LATTICE-ORDERED GROUPS 


299 


2. Explanation of (I) 

The reader will be assumed to be familiar with the definitions of a group and 
of a commutative group, and with the algebraic manipulation of the elements 
of such groups under the additive notation. Thus 0 will denote the group 
identity, —a the group inverse of a, a + b the result of combining a with 6, 
and na (n any integer) will denote the n ih “power” of a in the cyclic subgroup 
generated by a. 

In the commutative case, the rules of manipulation may be summarized in 
the statement that the group behaves like a vector space over the domain of 
integers. As it may be shown that every element of an 1-group is of infinite 
order- -that na = 0 implies n = 0 or a = 0,—even the cancellation laws hold. 
In addition, 2 an equation of the form nx = ma (n 0) has at most one solution. 
If such a solution exists, it may be denoted ( m/n)a , and regarded as a rational 
scalar multiple of a. In particular (op. cit ., §1) the correspondence ( m/n)a —» 
(m/n) is, for any fixed a, and isomorphism of the set of rational multiples of a 
and a subgroup (which always contains all integers) of the additive group of 
all rational numbers. Thus the set of all rational scalar multiples of a may be 
thought of as a generalized cyclic subgroup. 

3. Explanation of (II) 

The concept of homogeneity applies to any binary relation on a group. 

Definition. A binary relation ^ on a group G is called left-homogeneous if 
and only ifx^y implies Jor all a e G , that a + x ^ a + y and right-homogeneous 
if and only if it implies x -f a ^ ?/ + a. A relation which is both left- and right- 
homogeneous is called homogeneous. 

Theorem 1. Homogeneity is equivalent to the assertion that 
an every group translation x —■> a + x + b is a lattice-automorphism. (I) 3 

Proof. The condition (II') is clearly sufficient. To prove its necessity, 
recall first that all group translations are one-one. Second, not only does 
x ^ y imply a + x + b ^ a + y + b, but conversely a+x+b^a+y+b 
implies 

( — a) + a + # + b + (•— b ) ^ ( — a) + a + ?/ + 6 + ( — 6) 

or x ^ y. That is, (II) implies that any group translation is an automorphism 
with respect to the relation as asserted. 

Theorem 2. Homogeneity is equivalent to the assertion that every transformation 
of theforni x—*a — x + bisa dual automorphism . (I) 

* For the special properties of such groups, cf. Reinhold Baer, “Abelian groups without 
elements of finite order,” Duke Jour. 3 (1937), pp. 68-122. 

1 The postulate numbers in parentheses after the statement of a theorem refer to the 
postulates which are needed to prove the theorem in question. Many of the theorems below 
have a generality which far transcends the theory of 1-groups. 

4 Especially interesting are the “inversions” x —► a — x + a, which are of period two and 
have a for fixpoint. 



300 


GARRETT BIRKHOFF . 


(This means that the correspondence replaces the given homogeneous rela¬ 
tion by its converse.) 

Proof. Assuming (II), x y is equivalent to 

a + (— x) + x + (— y) + b ^ a + (—x) + y + ( — y) + b 

by Theorem 1. But this is a — y + b^a — x + b, and so the condition is 
necessary. It is sufficient since it implies that x— » 0 — ((—6) + x + (—a)) + 
0 = a — x *+• 6 is the product of two dual automorphisms, hence an 
automorphism;. 

Definition. An element a of an l-group G is called positive if a ^ 0. The 
set of all positive elements of G mil be denoted G + . 

Theorem 3. Homogeneity is equivalent to the assertion that , for some set of 
“positive” elements invariant under all inner automorphisms x-—>— a + x + a, 
x ^ y if and only if x — y is positive . (I) 

Proof. Assuming (II), clearly x ^ y if and only if x — y y — ^ = 0; 
moreover t ^ 0 implies — a + t + a ^ —a + 0 + a = 0 for all a. Conversely, 
for any set S invariant under all inner automorphisms, the relation (x — y) e S 
is homogeneous since (a + x + b) — (a + y + b) = — ( — a) + (x — y) + ( — a) 
is, for all a,b c (?, the transform of x — y under an inner automorphism. 

Corollary. If G is commutative , homogeneity is equivalent to the assertion 
that , for some set of positive elements , x ^ y if and only if x — y is positive. 

4. Explanation of (III) 

Postulate III asserts that the inclusion relation x ^ y satisfies the usual 
conditions, 

Pi. For all x, x ^ x, 

P 2 . If x ^ y and y ^ x, then x = y, 

P 8 . If x ^ y and y z, then x ^ z, 

L'. Any two elements x and y have a l.u.b. x ^ 2 /, 

L". Any two elements x and y have a g.l.b. x ^ y. 

We recall that in any lattice, 6 the three relations x ^ y, x ^ y = y, and 
x ^ y = x are mutually equivalent; indeed, this is even true in any “partially 
ordered system” satisfying Pi~P 8 . It follows that an automorphism with re¬ 
spect to one of the relation or operations g;, w, ^ is necessarily an automorph¬ 
ism with respect to all three. Hence we get as a corollary of Theorem 1, 

Theorem 4. Left-homogeneity is equivalent to either of the dual left-distributive 
laws 6 

(1) a + (x w y) = (a + x) ^ (a + y ), 

(1') a + (x y) = (a + x) ^ (a + y). 

right-homogeneity to either right-distributive law (I, Pi-Ps). 

• The terminology and notation are identical with that of the author’s book “Lattice 
theory,” New York, 1040, although scant use will be made of the theorems proved there.. 

1 Discovered by Dedekind and independently Freudenthal; see footnote 13. 



LATTICE-ORDERED GROUPS 


301 


(In case L'-L" do not hold, the existence of the join (meet) on one side of 
an equation is intended to be equivalent to the existence of that on the other.) 

We can prove from the left- and right-distributive laws just stated, and finite 
induction, also the following more general finite distributive laws 

a + V yj = V (a + yj) and V x< + a = V (x» + a), 

Vx<+ V Vi = V (Xi + y,), 

2D ( V = V ;(») (2D X»./(*>)» 


and their lattice duals. 

Theorem 5. Homogeneity is equivalent to the li monotonicity law”: 

(2) x £ x' and y ^ y' imply x + y ^ x + y . (I, Pi, P 3 ) 

Proof. Applying homogeneity twice, we get 

* x + y ^ x + y' ^ x' + y', 

whence ( 2 ) follows by P 3 . Conversely, assuming Pi, we get as special cases of 

(2) , x + y ^ x + y and x + y ^ x' + y, implying homogeneity. 

Again, a permutation of the elements of a partially ordered system is a dual 
automorphism if and only if it interchanges the operations and Hence, 
from Theorem 2, we get 

Theorem 6. Homogeneity is equivalent to the laws 

( 3 ) a - (x ^ y) + b = (a - x + b) ^ (a - y + 6 ), 

(3') a ~ (x ^ y) + b = (a - x + h) ^ (a - y + b). (I, Px-P*) 

We note as a special case 

(4) x /-n y = — ( — x w —y) and dually. 

From this we see that the lattice postulate L" is redundant , in the sense that it 
is implied by I, II, P 1 -P 3 , and L'. 

5. Stone’s postulates 

But now P 1 -P 3 and L' are equivalent by pure lattice theory to the assertion 
that our system admits an idempotent, commutative and associative operation 
x ^ y, in which x ^ y means x w y = x. Hence splitting ( 1 ) in tw T o parts 
(right- and left-translations), we get as a corollary of Theorem 4 and the re¬ 
dundance of L", 

Theorem 7 (Stone 7 ). An l-group may be defined as a group , with a second 


7 Stone assumed the group to be commutative, in which case one of the distributive law$ 
(1*) can be omitted (c/. Stone, op. cit.). 



303 


GARRETT BIRKBOET 


binary operation ^ which is idempotent , commutative , and associative , and satisfies 
the distributive laws 

(1") a + (x ^ y) = (o + z) (a + 2/) 

(a: y) + b = (x + b) ^ (y + b). 

It is a curious fact that, in virtue of the duality principle, substitution of 
for in the above system of postulates should also define an 1-group! 
Not only can we delete L" from our list of postulates, but we can even weaken 
L'. 

Definition. By the positive part a + of an element a of an l-group y is meant 
a w 0; a~ = a ^ 0 is dually called the negative part of a. 

Using right-homogeneity, we get 

(5) a w b = (b - a) + + a = (a - b)+ + b. 

Combining (5) with the dualization law (3), we get 

(6) a ^b = —( — a + (a — 6) + ) = — (a — b) + + *a. 

There follows immediately 

Theorem 8 . The lattice hypotheses T/-L" can be replaced by the condition that , 
for all a, a ks 0 should exist. (I, II, Pi-P 3 ) 

Also, a subgroup of an 1-group is an l-subgroup if and only if it contains the 
positive part of each of its members. Substituting in Theorem 7, wc get a 
further corollary. 

Theorem 9. An l-group may be defined as a group with a unary operation 
a —> a* which satisfies 

( 7 ) o* = 0, (8)’ c = c* — (-c)*, 

(9) the operation (a — 6)* + b is associative. 

A worth-while problem would be to find a less clumsy form of (9). In this 
connection, (a*)* = a* might be a useful partial substitute. One might also 
try setting the middle letter of the associative law equal to 0. 

6. Examples 

In the next two sections, we shall be using Theorem 3 as our main tool. 
First, we note that in order to describe an 1-group G up to isomorphism, it is 
sufficient by Theorem 3 to describe the set of “positive” elements; indeed, 
this principle is independent of Postulate III. We shall now describe some im¬ 
portant examples of 1-groups in this way. * 

Example 1. G is the additive gfoup of real numbers; G + consists of all those 
which are non-negative. 

Example 2. G is the additive group of the integers; G + is defined as in 
Example 1. 

Example 3. G is the group of all positive rational numbers under multiplica¬ 
tion (the integer one is the group identity); (? + is the set of all positive integers. 



LATTICE-ORDERED GROUPS 


303 


Example 4. G is the group of all vectors x = {x' f x") with two real com¬ 
ponents; <? + contains x if and only if x* > 0 , or x' = 0 and x" §£ 0 . 

Example 5. G is the additive group of all real functions defined on the 
interval 0 g x ^ 1 ; (? + consists of all those which are non-negative (satisfy 
f(x) ^ 0 for all x ). 

Example 6 . G is the additive group of functions of hounded variation on 
0 x g 1 with /(0) = 0 ; G + defined as in Example 5. 

Example 7. G as in Example 6 ; G + consists of all “increasing” functions 
(functions for which x ^ y implies f(x ) ^ f(y)). 

We shall now list some examples of non-commutative 1-groups. The simplest 
example consists of the two-parameter non-Abelian Lie group, lexicographically 
ordered as follows. 

Example 8 . G consists of all couples (x, y) of real numbers, where addition 
is defined by the formula 

(x, y) + {x, y) = (x + X, e'y + y')\ 

G + consists of all those couples with x > 0 or x = 0 , y ^ 0 . 

Example 9. G has three generators of infinite order, and defining relations 
a + & = 6 + a, a c = c + b, ?> + c = c + a; G + contains ma + mb + nc 
if and only if n > 0, or n = 0 while m ^ 0 and m ^ 0 . 

Example 10. G consists of the x > 0 of any ordered field or skew-field 7a 
(division ring) under multiplication; G + consists of all x ^ 1. 

For purposes of comparison, we shall also list various other examples which 
satisfy Postulates I—II and part, but not all, of Postulate III. 

Example 11 . G is any group; (? f consists of the identity 0 alone. 

Example 12 . G is the multiplicative group of all non-zero elements of any 
algebraic number field; G + is the subset of all (algebraic) integers in G. 

Example 13. G is the group of all elements of any integral domain of char¬ 
acteristic infinity under addition; G + is the subset of all sums of squares. 

7. Postulates of order reinterpreted 

It is trivial that the systems described in Examples 1-13 satisfy Postulates 
I—II if a ^ b is defined to mean (a — 6 ) e G f (cf. Theorem 3). We shall now 
give simple tests for the validity of parts P 1 -P 3 of Postulate III. 

Lemma 1. The rejlexive law Pi is equivalent to (9). The group identity is posi¬ 
tive . (I, II) 

For (a — a) e G+ is equivalent to 0 € G + by group theory. This condition is 
evidently satisfied in Examples 1-12 above. 

Lemma 2. The transitive law P 3 , the monotonicity law (2) of Theorem 5, and 
the condition that 

(10) Any sum of positive elements is positive , are mutually equivalent . (I, II, Pi) 

Proof. By Theorem 5, P 3 implies (5) modulo I, II, Pi. Again (5) implies 


*Cf. K. Reidemeister, “Grundlagen der Geometrie ,, l p. 40. 



m 


GARRETT BIRKHOFF 


the closure of (x + as the special case a ^ 0 and b ^0 imply a + 5^0 + 0 + 0. 
Finally, since (o — b) + (b — c) = (a — c), the closure of (? + implies P 8 as in 
Theorem 3. 

Corollary. The positive elements of any l-group form a semigroup * 

It is also a corollary that P 8 holds in Examples 1-13 above. 

Lemma 3. The antisymmetric law Pa is equivalent to asserting that (11) a and 
—a are both positive only if a = 0. (I, II) 

Proof. If (x — y) « G + and (y — x) € G + imply (x — y) = 0, then P 2 holds. 
Conversely, if P 2 holds, z 0 and — z ^ 0 imply z = 0. 

It may now be checked easily that P 2 holds in Examples 1-11 above, although 
not in Examples 12-13. 

A similar lemma, irrelevant here, is that the symmetric law (a ^ b implies 
b *jz a) is equivalent to asserting that ac(? + implies — a c (7 + . From this and 
Lemmas 1-2 it follows that G + defines an equivalence relation if and only if 
it is a subgroup of G. (I, II) 

Finally, we can read off from Lemmas 1-2 and Theorem 8, the following not 
very satisfactory result. 

Lemma 4. The lattice hypotheses L'-L" are equivalent to the following condition: 
(12) Given a , there exists a + such that u and (u — a) are both positive if and only 
if (u - a+) is. (I, II, Pa, P 3 ) 

From Theorem 3 and Lemmas 1-4, we conclude as the final theoretical result 
of this section, 

Theorem 10. An l-group may be defined as a group G with a subset of 
“positive” elements which satisfies conditions (9)-(12). 

We also conclude that Examples 1-10 above are 1-groups. In Examples 
1, 2, 4, 8, 10 this is true because the ordering is simple: for all a, either a or 
—a is in G ; and so a + is a or 0 accordingly. In Example 3, (m/n) + is the nu¬ 
merator of m/n when written in lowest terms; in Examples 5-6,/ + is the “positive 
part” of/as usually defined (equal, for all x, to the larger of/(x) or 0); in Example 
7, f* is the “positive variation” of /. In Example 9, (ma + mb + nc) + is 
0 if n < 0, (ma + m'b + nc ) if n > 0, and m + a + m'+b if n = 0. 

8. Fifth set of postulates 

We have characterized 1-groups by four sets of postulates. Our definition 
was in terms of the group operation and a binary relation; Theorem 7 in terms 
of the group operation and a binary operation; Theorem 9 in terms of the group 
operation and a unary operation; Theorem 10 in terms of the group operation 
and a unary relation or set. We shall now give a fifth set of postulates for 
1-groups which, oddly enough, is in terms of the group operation alone! 

Evidently any 1-group or other lattice has the following “Moore-Smith” 
property: 


* By a semigroup , we mean a system closed under an associative binary operation and 
having an identity element, in which the laws of concellation hold (o + x ■ a + y implies 
x — y and so does * + a ■ y + o), 



LATTICE-ORDERED GROUPS 


305 


(13) Given a , 6, there exists c with c ^ a and c ^ 6. 

Lemma 1 (Clifford 9 ). The Moore-Smith property is equivalent to the assertion 
that 

(14) Every element is a difference of positive elements . (I, II, P x , P 3 ) 

Proof. Assuming (13) with b = 0, we get a = c — ( — a + c), where c ^ 0 

and a + c = —a + (c — a) + a g: —a + 0 + a = 0. Conversely, if a = 
a' — a" and 6 = 6' + 6", where a', a", 6', 6" are positive, then c = a' + V 
exceeds both a and 6. 

Now let A be any group with a relation g: satisfying II, Pi, P 3 and (14). 
We shall show that A is determined to within isomorphism by the semigroup 
A + of its positive elements. The proof is related to the general theory of the 
extension of semigroups to groups. 

Theorem 11. In the notation of the calculus of complexes , a + A + = A + + a, 
for all a e A. 

Proof. Both sets consist of the elements containing a. 

Corollary. Given a and x in A + , there exist a unique y c A + such that a + x = 
y + a and z in A + such that x + a = a + z. 

The existence follows from Lemma 1; the uniqueness from the cancellation 
postulate defining semigroups. 

Now observe that A consists by (14) of the differences b — c of elements of 
A, equated and combined by the rules 

(15) b — c = b' — c if and only if /, u exist in A + such that b + t = b' + u 
and c + t = c + w, 

(16) (b — c) + ( b ' — c) = (6 + b’) — ( c + c"), where c" is the unique solu¬ 
tion of b' + c" = c + b'. 

The sufficiency of (15) is clear; as regards the necessity, if we choose 5^6, 
t = —b + SjU = — b' + s, then b + t = s = b' + u, while if b — c = b' — c', 
then b + t — t — c — b' + u — u — c', whence —t — c — —u — c and c' + 
u = c + t. 

Clearly equations (15)-(16) describe the group structure of A in terms of that 
of A. Moreover since 

(17) ( b — c) € A + if and only ifb = c + tfor some teA*, 

the lattice structure of A can also be described in terms of the group structure 
of A + . In fact, b — c ^ V — c if and only if t , u exist such that b + t ^ b' + u 
and c + t ^ c' + u. 

Conversely, suppose S is any semigroup in which, for all a, a + S = S + a 
(in multiplicative language, such that the left-multiples of any element are all 
right-multiples, and conversely). Then equations (15)-(16) may be shown to 
define a group. 10 We shall omit the details; one shows that (15) defines an 

• A. H. Clifford, “Partially ordered Abelian groups, M Annals of Math. 41 (1940), pp. 
465-473, esp. p. 467. From the equation a — 1/4((a + 1)* — (a — 1)*), we see that (14) 
holds in Example 13. 

10 R. Baer has proved, in conversation, that a group can be constructed whenever, given 
a and 6, x and y can be found such that a + x — b 4- y. 



GARRETT BIHKHOFF 


306 

equivalence relation, which is a congruence relation with respect to the addition 
defined by (16), and relative to which the latter is associative, and gives any 
element b - can inverse c — b. Furthermore, under (17), the set of positive 
elements forms a subset of A isomorphic with S. 

It follows, by Theorem 10, that we get an 1-group provided (11)—(12) hold. 
But now (11) is clearly equivalent to 
(17') If a + b = 0 in S, then a = b = 0. 

Finally, if any two elements of S have a l.u.b. with respect to the definition 
(17") h ^ c if and only if b = c + t for some t e S , 

then for any a = a' — a" of A (a', a " e S) there exists a + = ( a ^ a') — a", 
which proves that condition (12) holds. There follows 

Theorem 12 (von Neumann 11 ). An l-group may be defined as the extension 
to a group of a (multiplicative) semigroup S, in which (i) ab — 1 implies a = fr = 1, 
(ii) aS = Sa for all a, (iii) any two elements have a least common multiple. In 
this group S consists of the positive elements. 

Corollary 12 . A commutative l-group may be defined as the extension to a 
group of a commutative semigroup S , in which (i) and (iii) hold. 

In fact, (i) is not really essential, if we are willing to introduce an equivalence 
relation. 


9. Distributive law; disjoint elements 

The following material belongs logically directly after §3, and is independent 
of the results of §§4-8 above. 

Theorem 13. In any l-group , we have for all a , b , 

(18) a — (a ss b) + b = b w a. 

Proof. Substituting a for x and b for y in formula (3), Theorem 6, we get 

(18) explicitly. 

Corollary 1 (Dedekind 13 ). In any commutative l-group , 

(19) a + b = (a ^ b) + (a ^ b) for all a, b. 

In Example 3, the modular law (19) specializes to the celebrated identity 
ab = (a, b) [a, b\ of number theory. It also specializes, setting 6 = 0, to 
Corollary 2. For any a, a = a + + a~. 

In words, each element a is the sum of its positive part and its negative part 
(so-called Jordan decomposition). 

Theorem 14. Any l-group is a distributive lattice M . 

11 This result was communicted orally to the author. 

l * This result seems to have been known for ideals, but not in abstracto. The author has 
been unable to find a precise reference; cf. Krull's << Idealtheorie. ,, 

18 Discovered in 1897; cf. Ges. Werke, Brunswick, 1931, vol. II, p. 133, formula (13); 
rediscovered by H. Freudenthal, “Teilweise geordnete Moduln,” Amsterdam Proc. 39 
(1936), p. 642. 

14 In the commutative case, discovered by Dedekind, op. cit. f p. 135, formulas (18)-(19); 
rediscovered by Freudenthal, op. cit. p. 642, formulas (3.2). 



LATTICE-ORDERED GROUPS 


307 


Proof. Bergmann has shown (“Lattice Theory”, p. 75) that a lattice is 
distributive if and only \i a ^ x = a ^ y and a ^ x = a w y imply x — y. 
But they imply by (18), 

x = (a ^ x) — a + (x w a) = (a ^ y) — a + (y ^ a) = y. 

Theorem 15. In any Ugroup 15 , we have 

(20') a b = 0 and a = 0 imply a (b + c) = 0, 

(20") fl v!) = 0 and a ^ c = 0 imply a v_, (b + c) = 0. 

First Proof. By hypothesis and formula (1'), c = (a ^ b) + c = (a + c) ^ 
(b + c). Substituting, 

0 = a^c = a^(a + c) (b + c) = a ^ (b + c), 

since a ^ a + c. The second conclusion follows by duality. 

Second Proof. Since a, b, c are positive, clearly a ^ (b + c) ^ 0. But 
by the distributive law (1'), 

0 = 0 + 0= (o a b) + (fl ^ c) 

= a a a + c ^ 6 -f a ^ b + c ^ d/^(b + c), 

proving (20'). Formula (20") follows dually. 

We can reword Theorem 15 in terms of the important concept of disjointness. 
Definition. Two positive dements a a7id b will be called disjoint—in symbols , 
a _L b,— if and only if a ^ b = 0. 

In Example 3, this specializes to the concept of relative primeness. Theorem 
15 asserts that the set of positive elements disjoint to any a is closed under 
addition. Furthermore, if in Theorem 13 we assume a ^ b = 0 and apply 
the commutative law to b ^ a, wc get the 

Lemma 1. Disjoint (positive) elements are permutable , 

(21) //a rxb = 0, then a + b = b + a. 

Lemma 2. If b ^ c = 0, then (b — c) + = b and (b — c)~ = —c. 

Proof. By our preceding formulas, (b — c) ^ 0 = (b c) — c = 
b — (b ^ c) + c — c = b, and dually. 

Lemma 3. If na ^ 0, then a ^ 0. 

Proof. Expanding by the distributive law (1'), n(a ^ 0) = na ^ 
(ft — l)a ^ (ft — 2)a ^ a ^>0. But if na ^ 0 = 0, this equals 

(ft — l)a ^ (ft — 2)a ^ • • • aO ^0 = (ft — l)(a ^ 0). Now cancelling, 
we get a ^ 0 = 0, as desired. 

16 In the commutative case, observed by Dedekind, op. cit. } p. 132; Proof 1 is Dedekind’s, 
Proof 2 is von Neumann’s. Observe that in the proof, no restriction need be put on the 
group operation ( e.g ., associativity); only distributivity is needed. Theorem 15 can be 
generalized ($$27, 28). 



308 


GARRETT BIRKHOFF 


Combining Lemma 3 with its dual, we get 

Theorem 16. In an l-group , every element is of infinite order except the identity . 

Another corollary is the fact that, in any commutative 1-group, na ^ rib 
implies n(a — b) ^ 0, and so a ^ b. The author has been unable to prove the 
plausible conjecture that this remains true in any 1-group. 

Lemma 4. The positive and negative parts of any element are disjoint; in 
symbols, 

(22) For any a, (a w 0) ^ ( — a w 0) = a + (—a - ) = 0. 

Proof. Clearly —(a ^ 0) = ( — a w 0); hence the two left-hand terms are 
equal. But now by the distributive law, (a w 0) ( — a w 0) = (a —a) w 0, 

so we need only show that —(a ^ —a) = —a w a ^ 0. But clearly' 
a w —a ^ a ^ —a; hence, subtracting, (a w —a) — (a ^ —a) = 
(a w —a) — ( — (aw —a)) ^ 0, or 2(a w —a) ^ 0. Now use Lemma 3 
with n — 2. 

10. Free 1-groups; absolute 

Now let a be any element of any 1-group, and set b = a w 0, c = — a w 0, 
so that b and c are positive and disjoint, and a = 6 — c (cf . Cor. 2 of Thm. 13 
and Lemma 4 above). Further, by Theorem 15 and induction, b ± nc and 
mb _L nc for all positive integers m and n. Further, by Lemma 1, b and c are 
permutable, and so generate a commutative group, in which, for all integers 
m and n, 

(mb + nc) =b (mb + nc) = (m ± m)b + (n zb n)c . 

Finally, (mb + nc) + is mb + nc unless m or n is negative, is 0 if m and n are 
negative, is (by Lemma 2 above and the disjointness of positive integral mul¬ 
tiples of b and c) mb if n is negative but m is not, and is nc if m is negative 
but n is not. 

It follows that the mb + nc form an 1-subgroup, which is closed under lattice 
and group operations (Theorem 8), and is homomorphic with the 1-group of all 
couples (m, n) of integers, in which (m, n) ^ 0 means that m ^ 0 and n ^ 0. 
We shall (cf. §16) refer to this as the square of the 1-group of the integers under 
addition. 

Theorem 17. The free l-group with one generator is isomorphic with the square 
of the Irgroup of integers under addition . 

In this group, a appears as the element (1, —1), a + as (1.0), and a~ as (0, — 1). 
We can read off various corollaries from this representation. 

Theorem 18. In any commutative l-group A, the correspondence a na is, 
for any positive integer n, an isomorphism of A onto an l-subgroup of itself . 

Proof. By pure group theory, it is a group homomorphism; by Theorem 16, 
it is a group isomorphism; by Theorem 17, we get (na) + = (n, — n)* = (n, 0) = 
na+, and so it is isomorphic with respect to the unary operation of taking the 
positive part; by formulas (4)~(5), it is therefore a lattice isomorphism. 

Definition. By the absolute \ a | of an element a of an l-group, is meant 
a w — a. 



LATTICE-ORDERED GROUPS 


309 


Theorem 19. 

In any Ugroup , we have identically: 

(23) 

If a 9 * 0, then | a | >0, while | 0 | = 0 

(24) 

| na | = | n | • | a | for any integer n, 

(25) 

| a | = a + — a“, 

(26) 

| a — 6 | = (a s./ 6) — (a 6), 

(27) 

| (a w b) — (a* w b) | ^ | a — a* | and dually. 


Proof. Formulas (23)-(25) are special cases of the representation of Theo¬ 
rem 17. Again, using (25), 

| a - 6 | - ((a - 5) w 0) - ((a - 6) 0) = ((a w 6) - 6) - ((a ^b) - b) 

from which (26) follows by group algebra. Finally, to prove (27), expand the 
left-hand side by (26) to get a ^b ^ a* — (a ^6) ^ (a* w b), whence by the 
distributive law, | (a ^ 6) - (a* ^ 6) | — (a ^ a*) ^ b — (a ^ a*) ^ b. 
This reduces (27) to the case a ^ a*, or a = a* + t (t 0). But 
((a* + <) b) = a* ^ (b — t) + t g (a* w b) + t> which takes care of this 
special case. 

Remark. In a commutative 1-group, we can also prove the triangle inequality 
\ a + b \ g |a| + | b |, but this does not seem to hold in general; also, the 
author has been unable to generalize Theorem 7.8 of “Lattice Theory” to 
1-groups which are not commutative. 

Concerning the free 1-group with two or more generators, much less can be 
said. As an Abelian group, one can show that it has an infinite number of 
disjoint independent elements. On the other hand, using the three distributive 
laws (1), (1'), and that of Theorem 14, one can represent every element as a 
finite meet of finite joins of finite sums 

A V 23 nl' 1 at 

* i 

of the given generators a* and their inverses. 18 

11. l-ideal$ 

It is well-known that the different homomorphic images of a given abstract 
algebra can all be found by enumerating its different congruence relations. 17 
Also, with any group, the congruence relations correspond one-one with normal 
subgroups: to each normal subgroup A* of a group G corresponds the congruence 
relation dividing G into the cosets of A. Therefore, the congruence relations 

ltt The construction is identical with that used to prove Theorem 5.13 of “Lattice 
Theory.” 

17 By a “congruence relation” on an abstract algebra with binary operations is meant an 
equivalence relation (i.e., reflexive, symmetric and transitive relation) denoted m which 
has, if • is any binary operation, the *‘substitution property”: (S) a m a' implies a • b m 
a' *b and b • a ■ b • o'. 



310 


GARRETT BIRKHOFF 


on an l-group are those decompositions into cosets of normal subgroups which 
have the substitution property ( S) for the two lattice operations—or equiva¬ 
lently, by (4)-(5), make a as b imply a + ss b + . But these are easy to describe. 

Definition. By an 1-ideal of an l-group G, is meant a normal subgroup of G 
which contains with any a, also all 18 x with | x | ^ [ a |. 

Clearly G and 0 are 1-ideals of (?; they are called improper 1-ideals; all other 
1-ideals of G are called proper 1-ideals. 

It is a corollary that any 1-ideal is a convex 1-subgroup in the sense of con¬ 
taining with any a and 6, also — a, a + b, a ^b y a ^ b y and every x between 
a ,*sb and a ^ b. Indeed, if a ^b ^ x ^ a ^ b, then 

| x | = x ^ — x ^ (a v./ b) w — (a ^b) 

= a ~^b ^ -b ^ -a = \ a\ w | 6 | ^ |a| + |6|. 

Theorem 20. The congruence relations on any l-group A are the partitions of A 
into the cosets of its different l-ideals. 

Proof. If N is the set of elements congruent to 0 under a congruence rela¬ 
tion, then a eN and | x | ^ | a | imply a ^ —a ^ x ^ a ^ —a; hence 0^0^ 
x ^ 0 v/0 mod N, and so x e N. Conversely, if N is an 1-ideal, then x = x 
mod N implies | {x ^ y) — {x ^ y) | ^ | x — x | by (27), and therefore 
x ^ y 25 x K^y mod N. Using left-right symmetry and duality, we see that N 
defines a congruence relation with respect to both lattice operations, com¬ 
pleting the proof. 

Lemma 1. If x S a + b y where x , a, b y are positive , then x = $ + t y where 
0 ^ s ^ a, 0 ^ < g 5. 

Proof. Set t = x ^ b; then x = s + t y where O^s^x — {x^b)~ 
x ^ b) — b ^ (a + b) — b = a and 0 ^ t ^ .6, as desired. 

Theorem 21. The l-ideals and any l-group form a complete distributive sub¬ 
lattice of the {modular) lattice of all its normal subgroups . 

Proof. Clearly, any intersection of l-ideals is itself an 1-ideal. To prove 
that the sum S + T of any two 19 l-ideals S and T is an 1-ideal, suppose that 
s tS y teT y and \x \ ^ s + t. Then for some s' e S y t' e T y —{s + t) = s' + t' y 
and so • 

I $ + t I = (s + t) w (a* + t') ^ (0 V-/ S s') + (0 t y^ t'). 

Hence ^ \ x\ ^ | s + 1 1 ^ s" + t" {s" e S, t" e T). Using Lemma 1, we 
can show now that S + T contains x + y likewise —aT, and so x = x + + x ~. 
Therefore S + T is an 1-ideal. 

18 The terminology is that of Stone {op. cit.) ; the concept is due to the author, who called 
l-ideals “normal subspaces”; F. Riesz, “Sur la th^orie g6n6rale des operations lin&iires”, 
Annals of Math. 41 (1940), pp. 174-206, called them “Families presque completes.” Another 
good term would be “absolute (normal) subgroup.” Kakutani uses l-ideals in a slightly 
different sense. 

19 From this it follows that the sum of any number of l-ideals is an 1-ideal—by the 
general logical principle that for any “closure” involving only finite operations, the closure 
of any family of “closed” sets is the set-union of joins of finite subfamilies of “closed” sets. 



LATTICE-ORDERED GROUPS 


311 


It remains to prove that if 8 , T, and U are 1-ideals, then S ^ (T + U) = 
(S ^ T) + (S ^ U). But since, in any case, S ^ (T + U) 

(S ^ T) + (S U ) by the lattice-theoretic semi-distributive law, and x =» 
x + — (— x ~), it suffices to show that every positive x in S ^ (T + U) is in 
(S ^ T) + (S ^ U). But x e S ^ (T + f/) means that xcS and x = t + u 
(t € T f u e U). Hence, as above, x = 1 1 + u | ^ t" + u", where t" e T, u " e U 
are 'positive . Therefore, by Lemma 1, x = V + w', where t' = x ^ t" is in 
S ^ T and u ^ x u" is in S ^ f/. This proves x «(£ T) + (S ^ U), 
as desired. 

Corollary. The congruence relations on any l-group form a complete dis¬ 
tributive lattice™ 

Remark. If A is any commutative 1-group, and T is any 1-ideal of an 1-ideal 
S of A, then T is itself an 1-ideal of A : the property of being an 1-ideal is thus 
hereditary . This follows because any subgroup of a subgroup is itself a sub¬ 
group, and by the transitivity of inclusion. However, as Example 9 illustrates, 
the same law does not hold for all non-commutative 1-groups—essentially be¬ 
cause a normal subgroup of a normal subgroup of a group G need not be normal 
in G. 

12. Disjoint 1-ideals 

The following sections, through §20, will deal with non-commutative 1-groups 
only incidentally. In the main, they will be devoted to obtaining a more 
complete picture of the structure of commutative 1-groups, including a deter¬ 
mination of all possible structure lattices of finite length, and of all those 
“simple” commutative 1-groups which have no proper 1-ideals. 

Definition. Two elements a and b of an l-group G are called disjoint if and 
only if | a | ^ \ b\ =0. 

Theorem 22. The set {a}* of all elements disjoint from any fixed element a 
is a subgroup which contains with any b, all x satisfying | x | ^ | b |. 

Proof. By (23), {a}* contains 0; by Theorem 15, it is closed under addition; 
since | —6 | = | b |, it contains with any element its group inverse; hence it is a 
subgroup. The second assertion follows by the monotonicity law. 

Corollary. In a commutative l-group , the set of all elements disjoint from 
any fixed element is an l-ideal. 

Example 9 shows that, in the non-commutative case, the set need not be a 
normal subgroup. 

We note also, since {a}* cannot contain a unless a = 0, either a = 0, {a}* = 0, 
or {a}* is a proper 1-ideal. This suggests the concept of a weak unit. 

Definition. An element a of an l-group is called a weak unit 21 if the only 
element disjoint to it is 0. 

80 This is the “structure lattice” of the l-group in the sense of the author, “On the 
structure of abstract algebras,” Proc. Camb. Phil. Soc. 31 (1935), p. 450. It describes the 
structure (in the usual sense) of the l-group. Theorems 20-21 are due to the author. 

81 The concept is due to Freudenthal, op. cit.; the useful terms “weak unit” and “strong 
unit” (infra) to Bohnenblust. We note that any separable Banach lattice has a weak unit. 



312 


GARRETT BIRKHOFF 


13. Simply ordered groups 

A partially ordered set is called 1 ‘simply ordered” when, of any two elements, 
one includes the other, so that 

P 4 . Given x , y f either x y or y x. 

This automatically implies L'-L". 

Definition. A simply ordered group 22 is an l-group in which P 4 holds. 

We note without proof the following trivial results. An 1-group is simply 
ordered if and only if, for any o, either a or its inverse —a is positive. An 
1-group is simply ordered if and only if every subgroup is an 1-subgroup. The 
structure lattice of any simply ordered 1 -group is itself simply ordered (a chain ). 23 

Definition. Two l-ideals of an Ingroup are called disjoint if and only if their 
intersection is 0 . 

It is easy to show that this is the case if and only if every element of the first 
ideal is disjoint from every element of the second. 

Theorem 23. A commutative l-group has two disjoint proper l-ideals unless it 
is simply ordered . 

Proof. Unless the 1-group is simply ordered, it has an element a which is 
neither positive nor negative, so that neither a + nor a~ is 0. Hence S = {a + }* 
will be a proper 1-ideal containing aT but not a + . Moreover the set S* of all 
elements disjoint from all elements of S will contain a + but not a". Further¬ 
more, being an intersection of l-ideals, it will be an 1-ideal. Finally, every 
element of S is disjoint from every element of S*. 

Corollary. The structure lattice of a commutative l-group A is simply ordered 
if and only if A is simply ordered . 

Example 9 shows that the hypothesis of commutativity is essential in the 
preceding results. 

Digression. We have seen (Theorem 16) that in an 1-group, every element 
is of infinite order. We shall now show that, in the commutative case, this is 
the only group-theoretic restriction implied by being an 1 -group. 

Theorem 24 (F. Levi 24 ). Any abstract commutative group whose elements are 
all of infinite order , is the additive group of a simply ordered l-group. 

Proof. Let A be any Abelian group without any element of finite order 
except the identity. By a well-ordered rational basis for A, we mean a well- 


In fact, if \xi) is any everywhere dense countable set of positive elements, and 
\i » 1/2* || Xi || for all i , then e ** 2 \iZi is a weak unit. 

“Often called an “ordered group”; this is consistent with the terminology “semi- 
ordered group” for what we have called a “partially ordered group.” 

u For if the 1-ideal S contains an element not in the 1-ideal T , then the absolute of this 
element must exceed (not being included in) the absolute of every element of T t so that 
T £ 8. 

u “Arithmethische Gesetze im Gebiete diskreter Gruppen,” Rendic. Palermo 35 
(1913), pp. 225-236. 




LATTICE-ORDERED GROUPS 


313 


ordered (finite or infinite) subset of elements a a of A such that every non-zero 
element of A is a finite rational combination riia a{ i) + • • • + WrUa(r) 
(a(l) < • • • < a(r)) of the a a , while 2 — 0 implies every n»* = 0—or 

equivalently, = 0 implies that every (m,-/n<) = 0. The exist¬ 

ence of a well-ordered rational basis can be proved directly by transfinite induc¬ 
tion, just as in the case of vector spaces. 

Moreover relative to such a basis, any clement of A not the identity may be 
called positive or negative according as its first non-zero coefficient is positive 
or negative. This “lexicographic” ordering of A clearly defines from it a simply 
ordered group (commutative 1-group). 

Corollary. A commutative group is the additive group of an l-group if and 
only if it is without elements of finite order except the identity . 

14. Archimedean 1-groups 

A gross way of comparing the magnitude of elements of 1-groups is given by 
the following 

Definition. An element a of an l-group is called incomparably smaller than 
a second element b (in symbols , a « b) if and only if na < b for any integer n. 

Otherwise stated, a « b means that b is an upper bound for the entire cyclic 
subgroup generated by a. Thus in Example 4, (0, 1) « (1, 0). It is easily 
verified that the relation <3C is antisymmetric and transitive; it is closely related 
to the concept of an Archimedean l-group. 

Definition. An l-group is called Archimedean if and only if a «ib implies 
a = 0. (I, II, P 1 -P 3 ) 

The independence of the Archimedean property just stated from the lattice 
property I/-L" is illustrated by the easily proved fact that any subgroup of an 
Archimedean l-group is itself Archimedean with respect to the same order rela¬ 
tion, whether it is an 1-subgroup or not. 

The Archimedean property can be formulated in other ways. It amounts to 
asserting that the l-group has no bounded subgroups except 0. It is equivalent 
to requiring that if the set of all positive multiples of a has an upper bound, 
then a ^ 0 (Clifford). In the case of 1-groups, using Cor. 2 of Thm. 13, it is 
equivalent to the apparently weaker requirement that if a > 0, then the se¬ 
quence a, 2a, 3a, • • • has no upper bound. 

In a simply ordered group, the Archimedean property is thus equivalent to 
the traditional condition that for any e > 0 and any b, ne > b for all sufficiently 
large positive integers n . This means that if we let U denote the set of all 
rational numbers m/n such that nb ^ me, and L the set of those such that 
nb :§ me (n positive), we get non-void sets. Moreover L and U together 
include all elements (by P 4 ), and have at most one element in common. Hence 
they are the two halves of a Dedekind cut . Again, no two distinct elements b 
and b' can determine the same cut, or we would have (b — b ') e. Finally, 
by the monotonicity law (2), addition of elements is isomorphic to the addition 
of cuts. We conclude 



314 


GARRETT BIRKHOFF 


Theorem 26. Any simply ordered Archimedean Ugroup is isomorphic to a 
subgroup of the additive group of all real numbers , and so is commutative . 25 

Theorem 26. An Archimedean Ugroup may have a non-Archimedean homo¬ 
morphic image. 

Proof. Consider the 1-quotient-group of the 1-group of all functions on the 
interval 0 s; x < +°°, modulo the 1-ideal of bounded functions. In this, 
x 2 > 0, yet x 2 « x A . 

Digression. We have seen that in any 1-group, for any element a, the equa¬ 
tion nx = ma (n 0) has at most one solution, which we can denote (m/n)a 
if it exists. It is worth remarking now that in any Archimedean 1-group, we can 
define uniquely scalar products Xa of a by any real number X. To see this,, 
suppose a positive; there is at most one x such that (m/n)o < x for all m/n < X 
and (i m/n)a > x for all m/n > X. (If two, x and x\ then x — x' <K a.) This x 
we may denote Xa, and prove that, whenever all terms exist, the usual laws of 
the vector calculus hold. 

16. Strong units: principal 1-ideals 

We have just seen that in any Archimedean simply ordered 1-group, to any 
e > 0 and b corresponds a positive integer n such that ne > b . This may be 
generalized. 

Definition. By a strong unit of an Ingroup A, is meant 26 an element e c A 
such that for any b e A, ne > b for some positive integer n . 

Thus a strong unit must be positive. Many 1-groups do not have any strong 
unit. For example, the 1-group of all continuous real functions on the domain 
0 ^ x < +oo has the weak unit f{x) = 1 but no strong unit; this is a weak 
corollary of the Theorem of du Bois-Reymond. 27 On the other hand, in the 
1-group of all bounded real functions on any domain, the function f(x) = 1 is a 
strong unit. We also note 

Lemma 1. Any strong unit is a weak unit . 

Proof. For any e, e ^ a = 0 implies ne ^ a = 0 for all e (Thm. 22). But 
if e is a strong unit, ne > a for some n and so e ^ a = 0 implies a = ne ^ a = 0, 
whence 6 is a weak unit. 

Even in 1-groups without strong units, 1-ideals may have strong units. In 
fact, in any commutative 1-group, every positive element is a strong unit for an 
appropriate 1-ideal. 

Theorem 27. (F. Riesz. 28 ) In a commutative Ugroup , for any a > 0, the 
set J{a) of all b such that \ b | g na for some positive integer n forms an Irideal 
having a as strong unit. Moreover J(a) is the smallest Irideal which contains a. 

“ This result is due to H. Cartan, “Un th 6 or&me sur les groupes ordonnes, ,, Bull. Sci. 
Math. 63 (1939), 201-206. 

* The concept goes back to Archimedes; the term to Bohnenblust. 

91 Cf. for instance, G. H. Hardy, “Orders of Infinity,” Cambridge Tracts, 2 d ed., 1924, 
p. 8 . In this example, our relation a b is practically the usual relation / — o(f 7 ). 

,a F. Riesz, op. cit. f p. 188. 



LATTICE-ORDERED GROUPS 


315 


Proof. If | b | £ ma and \ c\ g na, then clearly | b ± c | g (ra -f n)a; 
while if | b | ^ ma and \x \ ^ | b |, then | x | g ma\ hence J(a) is an 1-ideal. 
Obviously, a is a strong unit of J(a). Finally, any 1-ideal containing a must 
contain every na and so all b with | b | ^ na. 

Corollary. Any commutative non-Archimedean l-group has a proper l-ideal. 

For if a « b for some a ^ 0, then J(\ a |) is an 1-ideal which fails to contain b, 
yet contains a j* 0, and so is proper. 

Definition. An l-ideal of an l-group will be called principal if and only if it 
has a strong unit . 

Theorem 28. If the structure lattice of a commutative l-group has finite 
length r , every l-ideal is principal. 

Proof. Let J be an l-ideal of such an l-group A. The case J = 0 is trivial. 
If J > 0, choose any a\ ^ 0 in J and form J(| ai |). If J > «/(| ai |), choose 
any a 2 in J but not in J(\ a\ |) and form J (| ai | + | a 2 |). After repeating this 
process at most r times, we will get a principal 1-idcal equal to J. 

Theorem 29. In any commutative l-group A, the principal l-ideals form a 
topologically dense sublattice of the structure lattice of A. 

Proof. It can be proved easily that 

J(a ^b ) = J(a) ^ J (by and J(a + b) = J(a) + J(b ), 

hence they form a sublattice. This sublattice is dense in the structure lattice 
of A , since any l-ideal J is the supremum (in fact, set-union) of the finite joins 
V i € a J(ai) of the principal l-ideals contained in J, and these form an ascending 
directed set of principal l-ideals which thus converges to its supremum in the 
sense of Moore-Smith. 


16. Extension problem 

It is natural to say that an l-group is simple if and only if it has no proper 
l-ideals—or, equivalently, no proper congruence relations. Analogy with pure 
group theory then suggests the program 29 of first determining all simple 1-groups, 
and then showing how the most general l-group whose “structure is of 

finite length can be built up from its simple quotient-l-groups. 

The first problem has been solved in the commutative case. Indeed, a simple 
commutative l-group must be simply ordered (by Theorem 23) and Archi¬ 
medean (by the Cor. of Thm. 28). Hence (by Theorem 26) we have 

Theorem 30. The only commutative simple l-groups are the subgroups of the 
additive group of real numbers. 

The second problem involves in particular the specific task of enumerating 
all the l-groups having a given l-ideal J and 1-quotient-group A/J (“Extension 
Problem”)* While not attempting a complete solution of this, some frag¬ 
mentary results may be stated. 

The logical outline is the same, but the technique is very different. Cf. O. Schreier, 
“tJber die Erweiterungen der Gruppen, ,, Monats. Math. u. Phys. 34 (1926), p. 165, and 
Hamb. Abh. 4 (1927), pp. 321-346. 



316 


GARRETT BIRKHOFF 


Given two 1-groups S and T, one can form the 1-group ST of all couples 
(s, t) (8 e S, t c T), where both the group operation and the lattice operations 
are performed on the ^-components and T-components independently, so that 

(a, t)o(s ', t') = (sos\ tot') 

where o is +, or v^. This is the direct union of S and T in the sense of 
universal algebra; we shall call it the cardinal product ST of S and T. The 
elements (0, t) form an 1-ideal of ST = A isomorphic with T, and the 1-quotient- 
group A/T is isomorphic with S. Hence the extension problem always has at 
least one solution . 

One can also form the lexicographic or ordinal product SoT of any two 
1-groups. This consists of the couples ($, t) (s e S, t e T) just as before. But 
the set of positive elements consists of those couples (s, t) with s > 0 or $ = 0 
and t ^ 0, instead of those with s ^ 0 and t 0 as in the case of cardinal 
products. In any case, SoT is a partially ordered group. If S is simply 
ordered, it is an 1-group, in which the elements (0, t) form as before an 1-ideal 
isomorphic with T> whose 1-quotient-group is isomorphic with S. Hence if S 
is simply ordered, the extension problem has at least two solutions. 

17. Direct decompositions 

In the cardinal product ST — A, both S and T correspond to 1-ideals. More¬ 
over they correspond to complementary 1-ideals, in the usual sense that 
S ^ T = 0 and S + T = A. Just as in the case of pure group theory, the 
converse also holds. 

Theorem 31. An l-group A is isomorphic to the cardinal product ST if and 
only if it contains complementary l-ideals isomorphic with S and T respectively. 

Proof of Converse. 31 Suppose A has l-ideals S and T. Then by group 
theory, each element a c A has a unique representation a = $ + t (s t S, t e T), 
while group operations are performed on the S - and T-components independ¬ 
ently. As regards order, s + t ^ s' + t' if and only if 

(-$' + s) g (t f - t) ^ \t’ - 1 1, 

whence ( —s + s) ^ 1 1' — 1 1 ^ | — s + s' | e T ^ S = 0, and likewise 
t — t! ^ 0. This means s ^ s and t ^ t\ q.e.d. 

From the preceding result, Theorem 21, and the general theory of distributive 
lattices, we obtain just as in “Lattice Theory,” Theorem 5.15, the following 
corollaries. 

Theorem 32. Any two representations of an l-group as a cardinal product have 
a common refinement . 

80 For the general significance of cardinal and ordinal products, cf. the author’s article 
“Generalized arithmetic,” to appear in the Duke Journal of Mathematics. It is shown 
there that the ordinal product of two lattices is itself a lattice if and only if the left-factor 
is simply ordered, or the right-factor has universal bounds. 

81 For a brief proof, relying more heavily on principles of universal algebra, cf. also 
“Lattice Theory,” p. 110, below Theorem 7.11. 



LATTICE-ORDERED GROUPS 


317 


Corollary. If the structure lattice of an Ugroup A has finite length , then A 
has a unique representation as the cardinal product of indecomposable factors, 

18. Main structure theorem 

In the present section, we shall show that the structure of a commutative 
1-group is of a very special kind. Indeed, by Theorem 23, any commutative 
1-group in which the 1-ideal 0 is meet-irreducible (or “prime”), is simply ordered. 
From this (cf. footnote 23) we conclude 

Lemma 1. The structure lattice of a commutative l-group in which the l-ideal 0 
is meet-irreducible, is a chain. 

Blit now if J is any 1-ideal of an 1-group A , the 1-ideals of A which contain J 
form a lattice isomorphic with the structure lattice of A/J) indeed, this is a 
principle of universal algebra, holding for all congruence relations. Combining 
this result with Lemma 1, we get 

Lemma 2. The elements of the structure lattice of any commutative l-group 
which contain any meet-irreducible element, form a chain (simply ordered set). 

If we apply Lemma 2 to the general representation theory of finite distributive 
lattices (“Lattice Theory,” Theorem 5.3), we get a conclusive result. 

Any distributive lattice L of finite length may be described in terms of the 
partially ordered set X of its meet-irreducible elements a,. Every element 
c € L is the meet A a. of the set S c of the meet-irreducible elements which con¬ 
tain c. Moreover, as in “Lattice Theory,” Theorem 5.3, the correspondence 
c —> S c is a dual isomorphism between L and the “/-closed” subsets of X— 
i.e., the subsets of X which contain with any a t all a ; ^ a, . 

This clearly applies to the structure lattice of any 1-group, provided it has 
finite length. Moreover if the 1-group is commutative, Lemma 2 restricts X 
greatly. 

Definition. A partially ordered system X is called a semitree if, for any 
element a e X, the set of all x g a is a chain ; it is a tree if it has a least element 0. 
The dual of a tree ( semitree ) is called a root (semiroot). 3 ” 

We have shown that the meet-irreducible (“prime”) 1-ideals of any l-group form 
a semiroot . But now it is easy to show that any finite semiroot is the sum of 
the subsets contained in its different maximal elements: the elements under¬ 
neath its different maximal elements form components having no connection 
with each other (no common subelements or superelements). 

Consequently, either the structure lattice contains complemented elements 
(namely, the meets of the sets of elements under the different maximal meet- 
irreducible elements), or the set X of meet-irreducible elements has a I . In the 
first case, the l-group is directly decomposable, by Theorem 31. In the second 
case, the /-closed subset consisting of I alone is a least non-void /-closed subset, 


w The Hasse diagram of any “tree” looks like a tree, and that of a “root” like a root 
(tree upside down). Further, the graph of a “tree” (or root!) is a tree in the technical 
sense of the theory of graphs. G. Kurepa has studied roots extensively, under the name of 
“tableaux ramifieB.” 



318 


GARRETT BIRKHOFF 


which thus corresponds under our dual isomorphism to a greatest proper 1-ideal. 
We can state our result as follows. 

Theorem 33. A commutative l-group whose structure lattice has finite length , 
either (i) is a cardinal product , or (ii) has a unique maximal proper l-ideal. 

19. Solution of extension problem 

We shall now show that a commutative 1-group A with a unique maximal 
proper 1-ideal J is a kind of mixed ordinal product of A/J and J. This will 
give us a method for constructing, by successive extensions, all commutative 
1-groups having finite structure lattices. 

First, an element a of A not in J must be either positive or negative. For 
consider the sum of the 1-ideals generated by a + and a~; it contains a, hence is 
not contained in J, hence it is A. But this expresses A as a sum of disjoint 
1-ideals; by hypothesis, A is join-irreducible; hence one of the 1-ideals is A and 
the other (being disjoint) is 0, and a + or a ~ is 0, as desired. 

Second, a is positive or negative in A according as it is positive or negative 
in AI J, since a homomorphism carries positive elements into positive elements 
and dually. Hence A is determined to within isomorphism by its group struc¬ 
ture, the order structure of «/, and the order structure of A/J. The positive 
elements of A are those which have their (A/ J)-component greater than zero, 
or have their (A/J)-component equal to zero and their J-component positive. 

This definition gives, conversely, from any abstract Abelian group A which 
has a lattice-ordered subgroup J and simply ordered quotient-group A/J , an 
1-group which may be called a mixed ordinal product of J and A/J. Clearly the 
mixed ordinal products of J and A/J correspond one-one to the solution of the 
group-theoretic extension problem of finding all Abelian groups A with a sub¬ 
group isomorphic with J and a quotient-group isomorphic with A/J. In case A 
is the direct union of J and A/J , we get the pure ordinal product; otherwise, 
we get something different. 

Now by Theorem 33, and induction ( cf. the last Remark of §11) on the 
length of the structure lattice, we get 

Theorem 34. Any commutative l-group whose structure lattice has finite length 
can be built up from simple l-groups by forming successive cardinal products and 
mixed ordinal products. 

This result can be applied directly to vector lattices. It is known 33 that the 
additive group of real numbers is the only simple vector lattice. Moreover it 
can be shown that for finite-dimensional vector lattices, the only group-theoretic 
solution of the extension problem is given by the direct union. We conclude 

Corollary 1 . Any vector lattice of finite dimension can be built up from the 
group of real numbers under addition by repeated formation of cardinal and ordinal 
products . 


83 Mr. Murray Mannos, a graduate student at Harvard University, is writing a disserta¬ 
tion on vector lattices of finite dimension which includes this and many other results. 



LATTICE-ORDERED GROUPS 


319 


We can state this somewhat cabalisticaliy, using the generalized arithmetic 
notation of the author, as 

Corollary 2. I 1 he most general vector lattice of finite dimension is Y R #, where 
R # denotes the additive group of real numbers , and Y denotes the most general 
semiroot . 34 

Incidentally, the structure lattice of Y R # is B Y , where Y' denotes the semi¬ 
tree dual to Y, ordinal exponentiation is replaced by cardinal exponentiation, 
and B is the chain of two elements. 

Going back to Lemma 2 of §18, the discussion of distributive lattices fol¬ 
lowing it, and using Corollary 2 for the converse, we get a final result. 

Theorem 35. A lattice of finite length is the structure lattice of a commutative 
l-group, if and only if it can be written B Y , where Y is the most general semitree . 

20. Subdirect decompositions 

We shall now consider the representations of commutative 1-groups as 1-sub¬ 
groups of cardinal products of smaller 1-groups—or, as we shall say for short, 
as subdirect products. 

Just as in the case of groups (cf. “Lattice Theory,” p. 52) it may be shown 
that the representations of an 1-group as a subdireet product correspond one-one 
to choices of sets of 1-ideals having 0 for meet. In the case of structure lattices 
of finite length, we can thus show that commutative 1-groups are subdirect 
products of 1-groups in which 0 is meet-irreducible, and hence (§18, Lemma 1) 
of simply ordered 1-groups. We shall now show that the restriction to the case 
of structure lattices of finite length is unnecessary. 

Lemma 1. Let a be any noiv-zero element of a commutative l-group A. There 
exists an l-ideal J in A such that a 4 J yet J/J is meet-irreducible in A / J . 

Proof. By transfinite induction, we can construct a maximal 1-ideal J which 
does not 35 contain a. It follows that any 1-ideal of A which properly contains J 
will contain J(a); hence J is meet-irreducible. We infer that the meet of all 
meet-irreducible 1-ideals of A is 0 in any case; hence that A is a subdirect product 
of 1-quotient-groups A/J in which 0 is meet-irreducible, and so which are simply 
ordered. 

Theorem 36. Any commutative l-group is isomorphic with an l-subgroup of a 
cardinal product of simply ordered l-groups. z& 

84 It has been pointed out to the author by A. H. Clifford and I. Kaplansky that Theorem 
34 and its corollaries may be looked on as generalizing to the lattice-ordered case, the basic 
results of H. Hahn (“tlber die nichtarchimedischen Grossensysteme,” S.-B. Wiener Akad. 
Math.-Nat. Klasse Abt. Ila, 116 (1907), pp. 601-653) on the classification of simply ordered 
groups. 

,B The construction is identical with that used by Stone in constructing prime ideals in 
Boolean rings; it has been used so often that it will not be repeated here. Since an 1-ideal 
J is meet-irreducible if and only if | a | ^ 16 | « J implies | a | e / or | b | c J, there is justifica¬ 
tion for calling the meet-irreducible 1-ideals prime 1-ideals. 

*• This is closely related to Satz 14 of P. Lorenzen, “Abstrakte Begrundung der multi- 
plikativen Idealtheorie,” Math. Zeits. 45 (1939), pp. 533-553. 



320 


GARRETT BIRKHOFF 


A special problem is that of trying to make the simply ordered 1-groups 
Archimedean , so as to get a representation by means of real functions. For this 
to be possible, the original 1-group must certainly be Archimedean; this condition 
would also be sufficient if it were not for Theorem 26. Much work has been 
done in attacking special cases of the problem. 87 

21. Effect of chain condition 

We shall now turn our attention to 1-groups in which all bounded sets have 
l.u.b. and g.l.b. A special case is furnished by 1-groups which satisfy the chain 
condition. 

Definition. An l-group will be said to satisfy the chain condition 38 if and 
only if 

(C) every non-void set of positive elements includes a minimal member . 

Any element which covers 0 will be called a prime. 

Lemma 1. Any two primes are permutable . 

This is a corollary of Lemma 1 of §9. It is a corollary that the primes generate 
an Abelian subgroup, consisting of all elements which can be expressed as sums 
nip\ + ♦ • • + n,p 9 of a finite number of distinct primes. 

Now let a > 0 be given, and consider all those differences a — ]jj£ which 
are positive. By the chain condition, one of these must be minimal, and so 
cannot contain any prime q (otherwise a — (]T) n»pi + q) would be smaller). 
Again by the chain condition, every positive element b except 0 contains a prime, 
namely, some minimal x such that 0 < x g b. Hence our minimal difference 
must be 0 , so that a = ^ n&i . 

But every element can be expressed as a difference of positive elements: 
c — c + — (—c) + for all c ; hence 

Lemma 2. Any element not 0 can be expressed as a sum of integral multiples 
of a finite number of distinct primes , as a — n x p x + • • • + n s p ,. 

Putting Lemmas 1-2 together, we infer that our 1-group is commutative. 
Now if we distinguish positive and negative coefficients, we get an expression 
for any a ^ 0 as 

a = mipi + • • • + m r p r — n x q x — ... — n,q 9 . (m», n,* > 0) 

Clearly a cannot be positive unless g, ^ m x p x + • • • + m r p r for all j. But 
since distinct primes are disjoint, by Theorem 15 g,- is disjoint from ^ ; 

hence a cannot be positive unless no negative coefficients occur. 

Lemma 3. In Lemma 2, a is positive.if and only if every n% is positive . 

* 7 Cf. F. Bohnenblust, op. cit.; S. Kakutani, “Weak topology, bicompact set, and the 
principle of duality,” Proc. Imp. Acad. Tokyo 16 (1940), pp. 63-67, Thm. 6; Stone, op. cit.; 
M. and S. Krein, Doklady 27 (1940), pp. 427-430; and K. Yosida, “On vector lattice with a 
unit,” Proc. Imp. Acad. Tokyo 17 (1941), pp. 121-124. 

* 8 Ore uses the word “Archimedean” to mean the same thing, but our terminology is 
more common. In the simply ordered case, (C) implies that every non-void set of positive 
elements has a least member (well-ordering condition), so that the integers form the only 
simply ordered 1-group satisfying the chain condition. 




LATTICE-ORDERED GROUPS 


321 


It is a corollary that a is zero (positive and negative) if and only if every 
is positive and negative, which is absurd. It is a corollary that the representa¬ 
tion of Lemma 1 is unique; for if a had two different representations, their 
formal difference would give-a representation of 0. We can summarize. 

Theorem 37. Let A be any l-group which satisfies the chain condition . Then 
A is commutative , and each non-zero element of A can be expressed uniquely as a 
sum of integral multiples of distinct primes™ Such a sum is positive if and only if 
no coefficient is negative . 

It is a corollary that A is determined to within isomorphism by the cardinal 
number of the set of its primes. 

22. Application to ideal theory 

This suggests an approach to the so-called 1 ‘fundamental theorem of ideal 
theory” quite different from the modern approach, 40 and much nearer to the 
classical one. Let F be anyjield, and let H be any subring of “integers” of F 
which contains unity. By an ideal in F, we mean a subset which contains with 
any two elements their sum and difference, and with any element all its integral 
multiples. Multiplication of ideals is according to the usual definition. 

It is clear that the non-zero ideals form a lattice with respect to set-inclusion, 
which in many important cases can be proved by extremely general arguments 
to satisfy the chain condition. 41 

It is also clear that multiplication of ideals is commutative and associative, 
and that ideal multiplication is distributive on addition (the lattice-join). 
Therefore we have all of the postulates of Theorem 7 satisfied except the exist¬ 
ence of inverses. 

It follows that, in the most important cases, in order to establish the unique 
factorization of ideals into primes, we need only supplement general arguments 
by proving that every ideal has an ideal inverse —or equivalently, that the product 
of every ideal by a suitable ideal gives a principal ideal. 

23. Completeness 

Many important 1-groups are complete, in the sense of the following 

Definition. An l-group A is called complete (a-complete) if and only if every 
non-void (resp. countable ) bounded set has a g.l.b. and a l.u.b. 

Remark 1. By Theorem 2, the existence of g.l.b. implies that of l.u.b.; and 

19 In the commutative case, this result is essentially well-known. Cf. for example M. 
Ward, “Residuated distributive lattices," Duke Jour. 6 (1940), pp. 641-651; also A. Clifford, 
“Arithmetic and ideal theory of abstract multiplication," Bull. Am. Math. Soc. 40 (1934), 
p. 329, Thm. 2. 

40 For the modern treatment of E. Noether, cf. van der Waerden’s “Moderne Algebra," 
1st ed., vol. Z, pp. 98-102. For the classical treatment cf. D. Hilbert, “Th^orie des corps de 
nombres alg£briques," Paris, 1913. Remarks much like ours are made on p. 13 of Kruil’s 
“Idealtheorie." 

41 Cf . van der Waerden, op. cit., §80. By a “positive" ideal, we mean one which contains 
ff , which is an identity for multiplication. The “negative" ideals are thus the ideals which 
are integral , in the usual terminology. 



322 


GARRETT BIRKHOFF 


using Theorem 1, one can even show that it is enough to require that every non¬ 
void set of 'positive elements have a g.l.b. 

Remark 2. The chain condition implies completeness. For if S is a non¬ 
void set of positive elements s a , then the finite meets V s«<o include a minimal 
member a by the chain condition. But every $ a ^ a, being itself a finite meet 
and so not properly contained in a, will be a. Thus a is a lower bound for *S; 
it obviously contains every lower bound. 

Next, let x be any element of any 1-group A. If $ is any upper bound for the 
set {nr}, then by Theorem 1 so are s + x and s — x. It follows that {nx\ 
cannot have a least upper bound unless x ^ 0 and x g 0. 

Lemma 1. The set of all integral multiples of a non-zero element cannot have a 
l.u.b. (I, II, P 2 ) 

Corollary. Unless A = 0, any l-group A contains a countable set without a 
least upper bound. 

It also follows that, if A is a-complete, the $$t nx cannot have an upper 
bound (or it would have a l.u.b.). In other words, 

Theorem 38. Any o-complete l-group is Archimedean. 

Corollary. If an l-group can be embedded group - and order-isomorphically 
in a complete l-group , then it is Archimedean. 

Conversely, Clifford (op. cit.) has proved that any commutative Archimedean 
l-group 42 can be completed by cuts in the sense of Dedekind-MacNeille, to give 
a complete commutative l-group. Combining, we have 

Theorem 39 (Clifford). A commutative l-group can be embedded in a complete 
l-group if and only if it is Archimedean. (I, II, 1\-P 3 , (14)) 

24. Infinite distributivity 

It was proved (Theorem 1) that in an l-group, any group translation is a 
lattice automorphism. Consequently, it carries infinite joins and meets into 
infinite joins and meets, respectively. The formulas expressing this fact appear 
as the infinite distributive laws 

a + V x a = A (a + x a ) a + A x a = A (a + x a ) 

(28) 

V Xa + b = A (x a + b) A Xa + b = A (x a + b) 

Similarly, since every correspondence of the form x —> a — x is a dual auto¬ 
morphism, we have the formal laws 

(29) a — V x a = A (a — x«) and dually. 

Now let v = V x a . Then, for all a and a, 

0 ^ (a ^ v) — (a /-n x a ) ^ v — x a by (27). 


4S Actually, L'-L" may be replaced for this purpose by the far weaker condition (14) 
(Moore-Smith property). In the present case, the cuts appear as so-called r-ideals; cf, 
Krull’s v-Gruppensatz, “Idealtheorie,” p. 120. 



LATTICE-ORDERED GROUPS 


323 


But A (v — x a ) = v — Vx* = v — v = 0 by (29); and by what we have just 
seen, 0 ^ A [(a ^ y) — (a ^ x a )] ^ A (y — x a ); hence 

0 = A[(a ^t;) - (a ^ x a )] = (a ah) - V (a /-n x a ). 

Transposing, we get the first of the further infinite distributive laws 

(30) a /-n V x a = V (a ^ x a ) and a ^ Ax a = A (a x«); 

the second follows by duality. Summarizing, we have 

Theorem 40 (Kantorovitch 43 ). The infinite distributive laws (28)-(30) hold in 
any complete l-group . 


25. Closed 1-ideals 

In a complete commutative 1-group, the complemented 1-ideals may also be 
characterized in terms of closure properties. To see this, let us define for any 
set S of elements of an 1-group <?, the polar 44 *S* of S as the set of all elements 
disjoint from every element of S. 

If S is a complemented 1-ideal with complement T , then y e T implies, for 
all x c S, that 

ll*l~MI = l*l~lv|3iM and \y\. 

Hence |x| ^ | y | € S T = 0, and x _L y, proving y e S*. Conversely, if 
z = x + y (x € S, y e T) is in £*, then 

o = l*|~M = (|*| + M)~|*| = |*|, 

whence z = y is in T. This shows T = S*; by symmetry, S = T* = ( S *)** 

But now for any subset T, the set T* is an 1-ideal by Theorem 22, provided G 
is commutative. Further, by (30), if G is complete, then T* is a closed 1-ideal 
in the sense of the following definition. 

Definition. An l-ideal J of a complete l-group G is called closed 45 if and only 
if J contains with any bounded subset {x a j, also V x a . 

Remark. Since the correspondence x —> — x leaves J setwise invariant and 
inverts order, it follows that J also contains A x a . Further, since any 1-ideal 
is convex (§11), closure in the sense of the preceding definiton is equivalent to 
topological closure in the intrinsic topology. 46 

48 “Lineare halbgeordnete Raume, ,, Math. Sbornik, 2 (44) (1937), pp. 121-168, esp. 
Theorems 10-21. Kantorovitch assumed commutativity, but this does not play an essen¬ 
tial role. 

44 It follows from the general theory of relations (c/. “Lattice Theory”, §32), since the 
relation of disjointness is symmetric and anti-reflexive, that (i) if we denote ( S*)* by S , 
then the operation S —> S is a closure operation, (ii) if we call S “closed” when S » S, then 
any intersection of “closed” sets is itself closed, (iii) 0 is closed, (iv) the correspondence 
S —► S* is a dual automorphism of the lattice of “closed” sets. 

45 Closed 1-ideals are the “families completes” of F. Riesz, op. cit., Riesz proved Theorem 
42 for principal 1-ideals. Condition (ii) below shows the concept also specializes to that of 
a “v-ideal” (Krull). 

4# As defined on p. 32 of the author’s “Lattice Theory.” 



321 


GARRETT BIRKHOFF 


We have seen that any complemented 1-ideal is the polar of its complement, 
and that the polar of any subset of a complete commutative 1-group is a closed 
1-ideal; we shall now complete the circle of reasoning by showing that any 
closed 1-ideal is complemented, yielding 

Theorem 41 (Riesz). For any Uideal J of a complete commutative l-group , 
the following assertions are equivalent: (i) J is complemented , (ii) J = (/*)*, 
(iii) J is closed . If (i) holds , then J* is the complement of J . 

Completion of Proof. If J is a closed 1-ideal of any complete 1-group G , 
then for any positive a * G we can form the /-component a s of a, as 

flj = V * t / " s a = V * i ^ a. 

Since G is complete, and 0 ^ x ^ a ^ a for all x S 0, aj exists. Moreover 
since J is closed and every x ^ a is in /, aj is in J. Hence for all positive 
z € /, since (z + af) is positive and in /, 

aj ^ (z + aj) ^ a ^ V * « /,*£(>£ ^ a = aj . 

But now by the distributive law (1), 

(z + aj) ^ a = z ^ (a — aj) + aj, 

whence, cancelling, z ^ (a — af) = 0. Thus a — aj is in J*. It follows that 
J + J* includes all positive elements a = aj + (a — aj) of G —and hence all 
elements of G by Cor. 2 of Thm. 13, so that / + /* = (?. But evidently 
J /-n /* = 0, which shows that J is complemented with complement /*, as 
asserted in the Theorem. 

Corollary. Any intersection of complemented l-ideals of a complete commu¬ 
tative l-group is itself complemented . 

26. Weak units and direct decompositions 

Since any intersection of closed 1-ideals is itself closed, it is natural to try to 
describe explicitly the intersection of all closed 1-ideals which contain a fixed 
positive element a —in other words, the closed 1-ideal generated by a. 

We can answer this question (in complete commutative 1-groups) by direct 
appeal to Theorem 41. Using condition (ii), we see that (a*)* is the smallest 
closed 1-ideal which contains a; further, it is the largest closed 1-ideal having a 
for weak unit. We can also describe (a*)* in another way, using Theorem 27. 
Clearly any closed 1-ideal which contains a will contain all x such that x = 
V na ^ x = A na ^ x; but conversely, the set of all such £ is a closed 1-ideal 
containing a . It is of course the topological closure of the principal 1-ideal 
generated by a. 

Now let A be any 1-group with weak unit e. If A can be represented as the 
cardinal product Ai • • • A n of smaller 1-groups, then the components of e in the 
different Ai are disjoint elements whose sum is e. 

Definition. By a decomposition of a positive element e of an l-group A , is 



LATTICE-ORDERED GROUPS 


325 


meant a set of disjoint elements e t whose sum is e. By a component of e is meant 
an element e' such that 47 e ^ (e — e ') = 0. 

Theorem 42. The components of any positive element of any Ugroup form a 
Boolean algebra . 

Proof. They are the elements x of the distributive lattice of all elements 
0 g x ^ e which have complements, by definition and Theorem 13. These 
form a Boolean algebra, by Theorem 6.2 of the author’s “Lattice Theory.” 

Theorem 43. Let A be any complete commutative l-group with weak unit e . 
Then the direct decompositions of A correspond one-one with the decompositions of e. 

Proof. We have already seen that the components of e under any direct 
decomposition of A create a decomposition of e . But conversely, let e = 
ei + • • • + e n be any decomposition of e, and let Ai denote the closed 1-ideal 
generated by c t *. Since the are disjoint, we will have e* ZD A, if i -A j, and 
so Ai _L Aj —or, what comes to the same thing, Ai ^ Aj = 0. But the sum 
of the Ai contains + • • • + , and is a closed 1-ideal; hence it contains 

A = 0* - (a*)*. 

This completes the proof; we note in passing that the example of the ordinal 
product of the additive group of the integers, with the cardinal product of the 
additive group of the integers with itself (in symbols, /of//)), shows that the 
hypothesis of completeness is not redundant. 

27. Residuated lattices 

The concept of 1-group can be generalized in two ways: one can weaken either 
the group or the lattice postulates. The least essential group postulate seems 
to be the one requiring the existence of inverses. If this is dropped,we arrive 
at something very close to the usual concept of a residuated lattice?* 

Definition. Let G be any (additively written) groupoid, or associative system 
with identity 0. If G is also a lattice , and satisfies 

(28) a + V x a = V (a + x a ) and V x a + b = V (x a + 6), 

it will be called an 1-groupoid. In any l-groupoid , the left-residual alb of b by a 
is defined as the join of all x such that xb g a. The right-residual a: :b of b by a 
is defined as the join of all y such that by g a. 

Clearly every 1-group is an 1-groupoid, in which alb is a — b and allb is 
—6 + a. Also, by (28), (a:5) + b ^ a and b + (aiib) £ a. The concept 
of 1-groupoid is not self-dual; in any 1-groupoid we have the monotonicity law 
(2), but not the dual of (28), even for finite meets. 

Much of the importance of 1-groupoids stems from 

47 The concepts just defined, together with Theorems 42*43, are due essentially to Freud - 
enthal, op. cit. 

48 This concept, and (implicitly) that of 1-groupoid, are due to M. Ward and R. P. Dil- 
worth (“Residuated lattices,” Trans. Am. Math. Soc. 45 (1939), pp. 335-354, and “Non- 
commutative residuated lattices,” ibid. 46 (1939), pp. 426-444). The main contribution of 
the present section is to show that the concept applies to important systems other than 
ideals. 



326 


GARRETT BIRKHOFF 


Theorem 44 (Ward). The ideals of any ring form an l-groupoid if inclusion 
is taken to mean set-inclusion and if ideal multiplication is taken as the group 
operation . 

We shall omit the proof, which is immediate. A special further property of 
ideals is x + y £ x ^ y; this is not a consequence of the postulates for an 
l-groupoid, and implies x ^ 0 ^ x, or 0 ^ x for all x, as a special case. Con¬ 
versely, if every i ^ 0, then x + y^0 + y = y and similarly x + y £ x, 
whence x + y ^ x y, for all x, y . 

Definition. A residuated lattice is an l-groupoid in which x + y ^ x ^ y 
for all x, y y —or equivalently, in which every element is negative . 

No 1-group is a residuated lattice. However, we have 

Theorem 45. Let G be any l-group, and S any set of negative elements of G 
which contains 0 and is closed under + and V . Then S is a residuated lattice . 

Corollary. The set of all negative elements of any l-group or l-groupoid is a 
residuated lattice. 

For instance, by Theorem 45, the non-positive non-increasing real functions 
on any interval, the non-positive convex functions on any interval, and the 
non-positive subharmonic functions on any plane region, form residuated 
lattices. 49 

Theorem 46. An abstract lattice L is residuated when ^ is taken as the group 
operation , if and only if the dual of L is a Brouwerian logic. In this case , the 
residuation operation : specializes to the implication operation — 

Proof: Compare the definitions given above with Theorem 8.4 of “Lattice 
Theory.” 

Theorem 47 (J. W. Duthie 60 ). The binary relations on any set form an 
l-groupoid, if the relative product is taken as the group operation, while the lattice 
operations are given their usual significance. 

We note also that the V -ideals of any commutative groupoid form a residu¬ 
ated lattice. 

Proof. The different postulates defining an l-groupoid are proved in 
Schroder’s “Algebra der Logik,” vol. Ill, esp. formula (29), p. 100, and formula 
(6), p. 79. The proof can be supplied by anyone familiar with the definitions. 

The relations form a Boolean algebra under inclusion; and it has been proved 
by Ward-Dilworth (op. cit., Thm. 7.4) that the only way to make a Boolean 
algebra residuated is to take lattice-meet as the group operation; hence we know 
in advance that relations cannot form a residuated lattice. 

We note that in every l-groupoid, all left-residuals a:a of elements with them- 


49 These and other function-theoretic examples of the same type were signalized in §133 of 
“Lattice Theory,” where however the connection with residuated lattices was not remarked. 

50 Communicated to the author orally; this result was not mentioned by O. Ore in his 
Colloquium Lectures on relations. For the definitions of relative product, join, and meet, 
for binary relations, cf. A. Tarski, “Introduction to Logic,” New York, 1941, pp. 90-93, or 
E. Schroder, “Algebra der Logik.” It is interesting that the conversion operator a —* & 
should act as an involution on the algebra of relations. 



LATTICE-ORDERED GROUPS 


327 


selves are idempotent; a\a + ala = ala. For by the definition of left-residual, 
we have 

(aia) + ( ala ) + a g ( ala) + a g a, 

* t 

whence (a:a) + (a:a) g a:a. Conversely, a + 0 = a, whence 0 g a:a, and so 
(a:a) = (a:a) + 0 g (a:a) + (a:a), completing the proof.—In residuated 
lattices, a:a — 0, and the result just proved is trivial. 

Finally (cf. Dilworth, op. cit ), we can prove 

(20") a^b = a^c = 0 implies a ^ (b + c) = 0 

in any 1-groupoid; in fact, the proof of Theorem 15 applies as it stands! 

28. Riesz’ Interpolation Property 61 

The most basic of the lattice postulates (see §4) seem to be the reflexive law 
Pi and the transitive law P 3 ; in general, a system with a reflexive and transitive 
relation is called a quasi-ordered set. 

Definition. A quasi-ordered group is a group G with a homogeneous reflexive 
and transitive relation. If the relation is also anti-symmetric (satisfies P 3 ), then 
G is called a partially ordered group (or semiordered group). 

It is well-known (“Lattice Theory", Thm. 1.2) that in any quasi-ordered set, 
if a w b is defined to mean that a ^ b and a g b, we get an equivalence rela¬ 
tion, and that we can consistently identify “equivalent" elements to get a par¬ 
tially ordered set. It is easily shown that in a quasi-ordered group, the x ~ 0 
form a normal subgroup N, while the other equivalence classes form the cosets 
of N. This gives 

Theorem 48. The algorithm of identifying a and b whenever a ^ b and a g 6, 
yields from any quasi-ordered group a partially ordered group. 

Existence postulates such as the Interpolation Properties to be discussed 
below and the lattice postulates L'-L" apply to quasi-ordered groups just as 
well as to partially ordered groups; only uniqueness properties are lost. How¬ 
ever, by Theorem 48, no real generality is lost if we restrict ourselves to partially 
ordered groups. 52 

Definition. Let m , n be any cardinal numbers. A partially ordered set will 
be said to have the (m, n) Interpolation Property if and only if, given Xi , • • • , x m 
and 2 / 1 , • • • , y n , such that Xi g y ; - for all i , j , we can find a z such that Xi g 
z g y i for all i,j . 

Special Cases. The reflexive law makes the (m, 1) and (1, n) Interpolation 
Properties trivial. The (0, 2) Interpolation Property is the Moore-Smith prop¬ 
erty discussed in §8; it implies the (0, n) Interpolation Property for all finite n. 

81 The author is greatly indebted to conversations with George Mackey and John von 
Neumann for material of the present section; the basic ideas go back to F. Riesz, op. cit. 

6 * Partially ordered groups can be bizarre enough. For instance, consider the additive 
group of real numbers, and let the 4 ‘positive’' elements be those which exceed unity; na > 0 
need not imply a > 0. 



328 


GARRETT BIRKHOFF 


The (0, 0) Interpolation Property for all P is equivalent to the existence of a 
universal element I, and can hold in no partially ordered group except 0 (§22, 
Lemma 1, Cor.). Again, the (a, p) Interpolation Property for all non-zero 
cardinals is equivalent to conditional completeness : the condition that every non¬ 
void set bounded above have a least upper bound, and dually. To see this, 
given any bounded set of elements , form the non-void set y, of upper bounds 
to the Xi ; then Xi ^ t/y identically, so that z will exist with Xi =£ y,- for all i, j\ 
by definition, z = V x*. Similarly, the (a, 0) Interpolation Property for all 
cardinals, zero included, is equivalent to completeness. 

But, algebraically speaking, the most interesting case is the (2, 2) Riesz Inter - 
potation Property. By induction, this implies every (ra, n) Interpolation Prop¬ 
erty with m , n finite and not zero. It is clearly weaker than the lattice property, 
since if Xi £ y, for all i , j , then Xi ^ V Xi S A y,* ^ y,- for all i } j. 

For example (F. Riesz, op. cit .), the polynomials, and also the rational func¬ 
tions with non-vanishing denominator, on any bounded closed region, form par¬ 
tially ordered groups which have the Riesz Interpolation Property but are not 
lattices. 

Theorem 49. The following conditions on any partially ordered group G are 
equivalent: 

(i) The Riesz Interpolation Property , 

(ii) The condition of Lemma 1, §11. 

If G is commutative , they are both equivalent to 

(iii) The condition that if ai + = bi + b 2 , and a \, a 2 , bi , b 2 are positive , then 

2 

there exist positive elements Cn , Cn , c 2 \, C 22 , such that c a = an d 
2 

c n = bj . (Riesz Refinement Postulate ) 

t-i 

Proof. First, (i) implies (ii). For if 0 ^ x, a, b g a + 6 then 0 g x, a and 
x — b ^ x, a (transposing); hence if (i) holds, there exists s with 0 ^ s ^ a, 
s x whence x = s + £ (* ^ 0), and x — b £ s whence s+J^x^s+6 
and so t £ 6. Conversely, (ii) implies (i). By right-homogeneity, it suffices 
to prove that if 0, x g y, x + b then there exists s with 0 7 x ^ s ^ y, x + b. 
But indeed, O^x + b^y + b since x ^ y; hence 1 + I) = s + / (0 ^ s 
0 ^ t £ &), whence z^x + t£x + b = s + t and s ^ x, s ^ x + b. 

Finally, if G is commutative, then (ii) and (iii) are equivalent. For 
a + b (a ^ 0, b ^ 0) is equivalent to a + b = x + (a + b — x), where all 
four summands are positive. To say that under these circumstances x = s + t 
(0 I a, 0 I ^ 6) is equivalent to saying that (a + b — x) = 
(a — s) + (6 — 0 = s' + where !) ^ s' ^ 0, o ^ f ^ 0—whence s, s', t 9 V 
behave as cn for (iii). 

Theorem 60. In any partially ordered group which has the Riesz Interpolation 
Property , we know 

(20') a b = 0 and o ^ c = 0 impZy a ^ (6 + c) = 0. 



LATTICE-ORDERED GROUPS 


329 


Proof. Suppose z ^ a, b + c. Then a and b are upper bounds to 0 and 
x — c, since x — c ^ x ^ a and b — (x — c) = (b 4- c) — x Js 0. Hence an 
element can be inserted between 0, x — c and a, 6. But since a ^ b = 0, this 
element must be 0. Hence x — c^O, x^c as well as x £ a, and x = 0. 

By duality (20") holds also; for the further study of commutative partially 
ordered groups with the Riesz Interpolation Property, with especial emphasis 
on the linear functionals on such groups, see F. Riesz, op . cit . 

29. Unsolved problems, general case 

We shall conclude this paper with a list of problems of varying degrees of 
interest and difficulty. For the purpose of classification, these will be divided 
into those which involve general 1-groups, and those which relate primarily to 
commutative 1-groups. 

Problem 1. Show that na > nb for one positive n implies a > b. 

Suggestions. This is easy in the simply ordered case, or if a and b are 
permutable (see §9, Lemma 3). 

Problem 2. Show that if a > 0 and b > 0, then 

—a — b ~h cl -f- b <3C a -f- 6. 

In words, the commutator of a and b is incomparably smaller than a + b. 

Suggestion. If the commutator is in the center, then 

n(a + b) ^ — na — ?ib + n(a + b) = (“” a “ 6 + a + b). 

Hence if the conjecture of Problem 2 can be proved, we have (a + b) ^ 
1/2 n( — a — b + a + b) for every even integer n, giving the desired result. 
This method, with the aid of finite induction, might be successfully applied at 
least to hypercentral 1-groups. 

Problem 3. Prove that every Archimedean 1-group is commutative. 

This result would be a corollary of the result conjectured in Problem 2. 
Using Theorem 38, we would infer as a second corollary that every complete 
l-group was commutative . Hence to disprove the conjectures of Problems 3-4, 
it would be enough to find a complete non-commutative 1-group, or an Archi¬ 
medean non-commutative 1-group. 

Problem 4. Prove that a complete 1-group either satisfies the chain condi¬ 
tion or has at least the cardinal number of the continuum. 

Problem 5. Find an 1-group without proper 1-ideals which is non-commu¬ 
tative. 

Suggestions. By Theorem 30, it would suffice to find a simple 1-group which 
was non-Archimedean or not simply ordered. By Theorem 25, this is also 
necessary, so that if the author’s conjecture is correct, either a non-Archimedean 
or a non-simply ordered simple 1-group must exist. The author conjectures that 
the former is certainly the case. 

Problem 6. Find all 1-group orderings (homogeneous lattice orderings) of 



330 


GARRETT BIRKHOFF 


the free group with two generators. Is the commutator-subgroup necessarily 
an 1-ideal? 

Suggestion. See Problem 2. 

Problem 7. Find a necessary and sufficient condition that an abstract group 
be isomorphic with the additive group of an 1-group. 

Suggestion. By Theorem 16, it is necessary that every element be of in¬ 
finite order; by Theorem 24, this is sufficient in the commutative case; the author 
conjectures that it is also sufficient in the hypercentral case. 

Problem 8. Suppose that in an 1-groupoid 0: (0: a:) = x and 0:: (0: :z) = x 
for all x . What can be inferred? 

Suggestions. The correspondence x —* 0:x will then be a lattice involution, 
so that the dual of (28) also holds. What about the commutative case? Will 
0:(x -f y) = 0:x + 0 :y? 

30. Unsolved problems, commutative case 

Any commutative group without elements of finite order whose cardinal 
number is at most that of the continuum, is isomorphic with an additive sub¬ 
group of the ordered group of real numbers under addition—proof by rational 
bases,—and so is isomorphic with an Archimedean 1-group. 

Problem 9. Is every commutative group without elements of finite order 
isomorphic with the additive group of an Archimedean 1-group? 

Problem 10. Find a necessary and sufficient condition that a commutative 
partially order group be group- and order-isomorphic with an additive subgroup 
of a cardinal product of simply ordered Archimedean 1-groups—or equivalently, 
by real functions. 53 

Problem 11. Given 1-groups B and C, reduce the problem of finding all 
1-groups A having an 1-ideal J isomorphic with C and 1-quotient-group A/J 
isomorphic with B to a problem in pure group extension, in the commutative 
case. 

This problem was implicitly solved in special cases in §19; the special cases 
B simple and C simple might well be attacked first. 

Problem 12. Find all Lie 1-algebras: Lie algebras over the real field which 
are vector lattices relative to a set of positive elements which is invariant under 
all inner automorphisms. 

Suggestions. Use the known classification of vector lattices with finite basis 
(Cors. 1-2 of Theorem 35). The author conjectures that a Lie algebra can be 
made into a Lie 1-algebra only if it is solvable (or “integrable”). 

Problem 13. Find all Lie 1-groups in the large. 

The problem in the small is contained in Problem 12; the fact that all elements 
have infinite order should simplify it. 

Problem 14. Construct a theory of 1-rings. 

M The work of von Neumann (unpublished), Stone ( cf. footnote 35) et al. shows that any 
such Archimedean group is isomorphic with a homomorphic image of such a cardinal 
product. 



LATTICE-ORDERED GROUPS 


331 


The only postulates known (Stone, op. cit.) cover only a very special case: 
subrings of cardinal unions of simply ordered 1-rings, corresponding to rings of 
functions. Cf. also A. A. Albert, “On ordered algebras”, Bull. Am. Math. 
Soc. 46 (1940), pp. 521-522. 

Problem 15. Find a more direct substitute for condition (8) in Theorem 9: 
i.e., a simple condition or set of simple conditions on the operation a —> a + 
necessary and sufficient to make the operation (o — b) + + b associative. 

Harvard University 



Annals or Mathematic* 
Vol. 43, No. 2, April, 1943 


OPERATOR METHODS IN CLASSICAL MECHANICS, H 

By Paul R. Halmos and John von Neumann 
(Received December 23, 1941) 

Introduction 

The purpose of this paper is two-fold: to map all measure spaces for which 
this is possible on the unit interval, and to apply such mapping theorems to the 
study of ergodic measure preserving transformations with a pure point spectrum. 

“Mappings” between two measure spaces may be interpreted in two ways, 
as set mappings and as point mappings, and accordingly we give below two sets 
of necessary and sufficient conditions for the existence of a mapping from a given 
space to the interval. The first of these, the set mapping or algebraic iso¬ 
morphism theorem, seems to be known, and although it has never been explicitly 
stated in the literature there are many proofs of special cases of it on record. 
We give an explicit proof of it and use a construction of the proof in proving the 
second, point mapping or geometric isomorphism, theorem. This second theo¬ 
rem depends on the new concept of normal measure space: a seemingly artificial 
concept which is, however, useful for two reasons. First, it is purely measure 
theoretic (and not topological), in character, and hence is applicable to the 
measure spaces usually discussed in probability theory; second it is hereditary 
under all the usual operations on measure spaces (such as the formation of 
direct products, decomposition into direct sums, etc.). 

Using the concepts and results of the mapping theorems just described, and 
of the Pontrjagin duality theorem concerning compact .and discrete abelian 
groups, we are able to show that every ergodic measure preserving transforma¬ 
tion with a pure point spectrum is isomorphic to a rotation on a compact abelian 
group. This is a “normal form” theorem for a certain class of measure pre¬ 
serving transformations and can be used to answer many questions, such as the 
existence of square roots, commutative transformations, etc., concerning such 
transf ormations. 

Although this paper is a continuation of an earlier work of one of us 1 it is to a 
large extent independent of this earlier work. The proofs of the main theorems 
mentioned above are logically complete here; only in some of the applications, 
as for example in discussing the relation between point mappings and set map¬ 
pings, do we make use of the results of (I). 

1. General measure spaces; the algebraic isomorphism theorem 

Let X be any set, and 9C any Borel field of subsets of X; let m be a non 
negative, contably additive, finite measure defined on 9C. The system 
{X, 9C, m}, which we shall usually denote by X , or, if necessary to indicate its 

1 See John von Neuman, Zur Operatorenmethode in der klassischen Mechanik , Annals of 
Mathematics, vol. 33, (1932), pp. 687-642. In the sequel we shall refer to this paper as (I). 

332 



OPERATOR METHODS IN CLASSICAL MECHANICS, II 


333 


dependence on 9C and m, by X(9C, m) is called a measure space . Sets E e 9C 
are called measurable; we shall use also the usual terminology of the Lebesgue 
theory in describing functions as measurable, integrable, etc. A measure space 
is complete if every subset of-a measurable set of measure zero is itself measurable 
(and has, of course, measure zero). Since it is always possible to extend the 
definition of m to a Borel field 0C' ID 9C so that X(9C', m) is complete, we shall 
lose no generality, and gain somewhat in simplicity, by assuming completeness. 

In any measure space X we shall write 93 = 93 (9C) for the Boolean algebra 
of measurable sets modulo sets of measure zero. We shall make use of the 
notations of set theory, (C, +, etc.) in 93, and of the fact that we may consider 
m as defined on 93. 

We discuss now the concept of separability in measure spaces. A Borel 
field 9C (or a measure space X(VX, m)) is strictly separable if it contains a count¬ 
able collection of sets such that the smallest Borel field containing all of them, 
(the Borel field spanned by them), is 9C itself. Two sub Borel fields, G and fB, 
of the Borel field 9C of measurable sets in a measure space X(9C, m) are equivalent 
if to every set E in either one of them there corresponds a set F in the other 
such that the symmetric difference (E — F) + (F — E) has measure zero. A 
measure space is separable if there exists a strictly separable Borel field (2 con¬ 
tained in and equivalent to 9C. 2 A concept, which lies logically between separa¬ 
bility and strict separability, more useful than either of these, is proper separa¬ 
bility. A measure space X(VX, m) is properly separable if there exists a strictly 
separable Borel field G C C X, such that to every E e 9C there corresponds an 
F c Cf with E C F and m(F — E) = 0. 3 We observe that this definition is 
self dual: by applying the condition to X — E we readily obtain a set Fed 
with F C E and m(E — F) = 0. We shall make use of the fact that if X is 
separable (or properly separable) and 3 is the strictly separable Borel field 
described in the definitions above then 93(9Q = 93(G). In the case of (properly) 
separable measure spaces it will be necessary to indicate in the notation the 
strictly separable Borel field used; we shall write X = X(£X, G, m). We shall 
call sets of G Borel sets , and functions measurable (G) Baire functions. (A real 
valued function f(x) is measurable (G) if the inverse image under/of ever}" real 
Borel set S, i.e. the set [x j f(x) e S} 9 belongs to G.) 

* This is not the usual form in which this definition is given. Cf., for example, J. L. 
Dobb, One—parameter families of transformation , Duke Mathematical Journal, vol. 4, 
(1938), p. 753. That our definition is, however, equivalent to the usual one is proved by 
Paul R. Halmos, The decomposition of measures , Duke Mathematical Journal, vol. 8, (1941), 
p. 387. We observe X is separable if and only if the Boolean algebra 93(9Q has a countable 
number of generators. 

* The concept of proper separability, first introduced by W. Ambrose and S. Kakutani, 
Structure and continuity of measurable flows , Duke Mathematical Journal, vol. 9, (1942), 
pp. 25-42, is fundamental in measure theory. Although it is possible to give examples of 
separable but not properly separable measure spaces, these examples are all of a more or 
less pathological kind. One such example is the unit interval, with the Borel field of all 
sets of Lebesgue measure zero and their complements in the role of 9C. 



334 


PAUL H. HALMOS AND JOHN VON NEUMANN 


A measurable set E in the measure space X(9C, m) is indecomposable if it 
contains no proper measurable subsets other than the empty set; an element 
E € $B(9C) is an atom if it contains no proper subelements, other than 0, in $8(9C). 
A measure space is non atomic if SB(9C) has no atoms: in other words if every 
measurable set of positive measure contains measurable subsets of smaller posi¬ 
tive measure. From the point of view of a study of the structure of measure 
spaces indecomposable sets and atoms are uninteresting: we shall generally 
assume that the former consist of exactly one point and the latter are absent. 
More specifically our assumption will be described in the following terms. 

A countable sequence, A \, A 2 , • • • , of subsets of X is a separating sequence 
if to every pair of points, x ^ y, we may find an integer n with x e A n , 
y eX — A n . If there exists in X a separating sequence of measurable sets, 
an indecomposable set contains exactly one point. We shall now show that the 
assumption of the existence of a separating sequence of measurable sets has a 
similar effect on atoms. Let E be a set of positive measure which contains no 
measurable subsets of smaller positive measure. It follows that for each n one 
of the two sets, EA n , and E(X — A n ) has measure zero and the other one has 
measure m(E). By a slight change of notation we may assume m(EA n ) = m(E) 
for n = 1, 2, ■ * • . If we write Jln-i A n = A, then we have m(EA) = m{E)\ 
since, however, A can contain at most one point, this implies that for some 
point xeE we have m(E — x) = 0. In other words the existence of a measur¬ 
able separating sequence implies that the weight of an atom is concentrated at 
one point; if, for example, we assume that the measure of a point is always zero, 
we may infer that the space is non atomic. Since in a measure space, which has 
by definition finite measure, there can be at most a countable set of points of 
positive measure, and since their measure theoretic structure is clear, we shall 
generally assume non-atomicity explicitly. 

If Xi(9Ci, mi) and X 2 (9C 2 , m 2 ) are measure spaces, a set isomorphism between 
X\ and X 2 is a measure preserving isomorphism between the Boolean algebras 
33(9Ci) and 93(?X 2 ). More specifically a set isomorphism is a one to one mapping 
T from 33(9Ci) on S(9C 2 ) which is such that 

T(Xx - E) = X 2 - TE , 

^GCn-l E n ) = TE n , 
mi(E ) = nh(TE). 

If such a mapping T exists, Xi and Xt are set isomorphic. 

After one more comment on notation we shall be ready to state and prove our 
first result. Since the unit interval plays a fundamental role in our investiga¬ 
tions and is used as a yardstick with which to compare other measure spaces, 
we find it convenient to introduce a special notation for it. We shall denote 
the unit interval by X, the collection of Lebesgue and Borel measurable sets by 



OPERATOR METHODS IN CLASSICAL MECHANICS, II 


335 


tiC and <J respectively, and Lebesgue measure by ra. In our terminology 
X s» X(tiC, 5, m) is a properly separable measure space. 4 5 

Theorem 1. A necessary and sufficient condition that a measure space of total 
measure one he set isomorphic to the unit interval is that it be separable and non- 
atomic. 

Proof. Since the unit interval is separable and non-atomic and since these 
properties are evidently invariant under set isomorphisms, the necessity of our 
conditions is clear. To prove their sufficiency, let X(9C, G, m) be the given 
measure space, m(X) = 1, and let Ai , A 2 , * * • be a countable sequence of Borel 
sets which span G. We may assume (by adding a superfluous set to the {A„} 
if necessary) that ^n-iAn = X. Then we may make correspond to every 
rational number r, 0 g r g 1, a set B r such that 

(i) {i4 n } and {B r } span the same field; 

(ii) r < s implies B r C B, ; 

(iii) IL>* B r = B,\ 

(iv) JlrBr = Q,Y,rB, = X 

We now define, for every real number a, 0 ^ a g 1, a set B a by B a «n r>a Br • 
It is clear that this definition of B a is consistent with its previous definition in 
case a is rational, and that the family of sets \B a ) satisfies the conditions (ii), 

(iii), (iv), (where in (iii) and (iv) we extend the products and sums over an 
arbitrary countable set of real numbers r for which inf r = s in (iii), inf r = 0 
and sup r = 1, respectively, in (iv)). Moreover, condition (i) implies that 
B a e G for all a and that the Borel field spanned by the B a is G itself. 

Given now the family B a we may find a (uniquely determined) function /(x), 
defined for x e X, 0 ^ f(x) g 1, for which {x \f(x) ^ a} = B a ; we may, for 
example, define 

(1) f(x) = inf \a \ x e B a }. 

The class of all sets of the form 

( 2 ) r\E) = {*!/(*)«£}, 

where S is an arbitrary Borel set in the unit interval, is a Borel field contained 
in G; since it contains all B a , and therefore all A n , it coincides with G. 

Let F(a) = m{x\f(x) g a) = m{B a ) be the distribution function of/(x):. 
F(a) is monotone non-decreasing from 0 to 1 as a ranges between 0 and 1, and 
is continuous from the right. (This much is always true, of an arbitrary distri¬ 
bution function.) In our special case we assert that F(a) is continuous. For 

4 In the sequel we shall sometimes use the notation X(9C, G, m) for the perimeter of the 
unit circle in the complex plane: it is clear that this space has the same measure theoretic 
structure as the unit interval. We shall always make it clear whether the symbol X has 
its real or its complex meaning. 

5 Cf. (I), p. 602; see also J. L. Doob, Stochastic processes with an integral valued parameter , 
Transactions of the American Mathematical Society, vol. 44, (1938), p. 91. 



336 


PAUL R. HALMOS AND JOHN VON NEUMANN 


if a = Oo is a discontinuity of F(a ), then \x \f(x) = Oo} is a set of positive 
measure which therefore, (non-atomicity), has Borel subsets of smaller positive 
measure. Such a subset cannot be put in the form f~ l (E)> contrary to what 
we have already proved. 

For any x, 0 g x ^ 1, we define/(x) = inf {a | F(a ) ^1), It is well known 
(and easily verified) that /(x) is a strictly monotone increasing (not necessarily 
continuous) function of x, which increases from 0 to 1 as x does, and which is 
continuous on the left. Moreover the distribution function of /(x) is again F(a). 

For any Borel set £ C X, consider the set/' 1 ^): we assert that the collection 
of all sets of this form, (which clearly forms a Borel field), coincides with 9. 
This is true since the increasing character of /(x) implies that every interval 
(0, x) has the form/' 1 (£), where £ can even be chosen as an interval. 

Suppose that it ever happens that/“ 1 (j^i) = f~\£ 2 ). (We shall now make 
use of the fact that for an arbitrary Baire function g(x), 0 ^ g{x) S 1, the 
correspondence £ —» g~~ l {£) = {x | g(x) *£}, is a homomorphism of 5 into (J, 
i.e. that g~\X - £) = X - g~\E), and g~\£ x + '£ 2 + •••) = 
g~ l {£ i) + g~ l {£ 2 ) + ••*)• If we write £' = (£ x — £ 2 ) + (£2 — $ 1 ) for the 
symmetric difference between E\ and £ 2 , then it follows from the equality of 
the distributions of f(x) and/(x), that m{f~ l (£')} = M{/ -1 (i?')} = 0. Con¬ 
versely, of course, = f~ l (£ 2 ) implies the same result. 

Consequently the correspondence ^(E) <=* ]~ x {£) is one to one, not neces¬ 
sarily between Q and (f, but certainly between 33 = 33(9Q = 33(Cf) and S = 
93(9C) = 33(9). It is clear that this correspondence preserves measure, and 
the homomorphic nature of the mappings £ —* f~ l (£) and £ —> ]~ l (£) shows 
that it is also an algebraic isomorphism. 

This concludes the proof of Theorem 1. 

2. Normal spaces; the geometric isomorphism theorem 

If Xl(9Ci , mi) and X 2 (VX 2 , m 2 ) are measure spaces, a point isomorphism 
between X\ and X 2 is a one to one mapping from almost all of X\ on almost 
all of X 2 such that E\ e 9Ci if and only if E 2 = TE X e C X 2 , and then mi(i?i) = 
m 2 (E 2 ). If such a mapping T exists, X\ and X 2 are point isomorphic . Our 
problem in this section is to find necessary and sufficient conditions in order 
that a measure space be point isomorphic to the unit interval. The funda¬ 
mental concept in this connection is that of a normal space. 

Definition 1 . A measure space is proper if it is complete , properly separable , 
and non-atomic, and if it contains a separating sequence of Borel sets. 

Definition 2. A proper measure space is normal if to each real valued univalent 
Baire function f(x) there corresponds a set Xo of measure zero such that the range 9 
f(X — Xo), is a Borel set . 

The following lemmas concerning proper and normal spaces will be useful 
in the sequel. 

Lemma 1. On every proper measure space X(9C, (3, m) there exist real valued 
bounded univalent Baire functions . 



OPERATOR METHODS IN CLASSICAL MECHANICS, II 


337 


Proof. Since X is certainly separable and non-atomic the construction of 
the proof of Theorem 1 applies. We assert that the real valued bounded Baire 
function/(x) defined by (1) is univalent. For if the set {x | /(x) = a\ contained 
more than one point, then the intersection of this set with a Borel set separating 
two of its points could not be expressed in the form [x | f(x) eE\. Since, how¬ 
ever, the proof of Theorem 1 establishes that every Borel set has this form, 
f(x) must be univalent. 

Lemma 2. If X(9C, Cf, m) is a proper measure space with the property that the 
condition of Definition 2 is satisfied by every bounded function then X is normal . 

Proof. Let /(x) be any univalent Baire function, and let G(y) be any con¬ 
tinuous function which maps the infinite interval, —qo < $/ < + 00 , in a one 
to one way on a finite interval. Then g(x) = G(/(x)) is a Baire function which 
is univalent and bounded, hence, by hypothesis, there is a set X 0 of measure 
zero such that g(X — X 0 ) is a Borel set. The image of this Borel set under the 
one to one continuous mapping G~ r {y) is the range/(X — Xo ) which is therefore 
also a Borel set. 

Lemma 3. If X(9C, Of, m) is a normal space , B Cl X is a Borel set , and f(x) 
is a real valued univalent Baire function , then there is a set Bo d B of measure zero 
such thatf(B — B 0 ) is a Borel set . B 0 can even be chosen in the form BX ' 0 , where 
X 0 is a Borel set of measure zero , depending onf but not on B . 

Proof. We shall carry out the proof in three steps, first establishing the 
existence of a suitable B 0 corresponding to a fixed ft then showing that B 0 
may even be chosen as a Borel set, and, finally, proving on the basis of our 
separability hypotheses, that we may choose ft in the form described in the 
statement of the lemma. 

(i) We observe that the first statement asserts, essentially, that a Borel set 
in a normal space is itself a normal space. Accordingly, using Lemma 2, we 
may assume that/(x) is bounded. Let/'(x) be a bounded univalent Baire func¬ 
tion on X, (Lemma 1); by appropriate linear transformations of /(x) and of 
/'(x) we can secure 

0 g f(x) g 1 < fix) 

throughout X. Then the function /*(x), defined to be equal to /(x) on B and 
to f'(x) on B' = X — B is a univalent Baire function on X, hence for a suitable 
set Xo of measure zero, /*(X — X 0 ) is a Borel set. The intersection of this 
Borel set with the closed interval (0, 1) is also a Borel set: this intersection is, 
however, precisely f{B — Bo), where B 0 = BXo . 

(ii) Let B\ be a Borel set of measure zero, Bi 3 B 0 . Applying the result of 
(i) to X — Bi we may find a set ft ft of measure zero such that /(X — ft) 
is a Borel set. We proceed similarly by induction, choosing J? s ^ ft to be a 
Borel set of measure zero, choosing ft I D ft so that /(X — ft) is a Borel set, 
and so on. We have £ 0 C Bi C ft C ft C • • • ; all B n are of measure zero; 
or n odd ft is a Borel set; for n even f(X — ft) is a Borel set. We write 
Bt = ]5£n-o B n . Then B? has measure zero, and, because of the monotone 



338 


PAUL R. HAMLOS AND JOHN VON NEUMANN 


character of the sequence {B n }, -Bo = ]^n-o B 2n +i, so that B* is a Borel set. 
Similarly X - B 0 * = X - En-o B 2n = ]I«-o (X - B 2n ), so that/(X - B 0 *) is a 
Borel set, and also B(X — B?) = (B — B 0 )(X — B?), so that/(B(X — B?)) = 
/(B — B 0 )/(X — B*) is a Borel set. We may accordingly change notation and 
denote by B 0 the intersection of B and B* : this new B 0 is a Borel set of measure 
zero with the property that /(B — B 0 ) is a Borel set. 

(iii) Let A \, A 2 , • • • be a sequence which spans (2, and apply the result of 
(ii) to find, for each n, a Borel set A° n C A n , of measure zero, such that 
f(A n — A 0 n ) is a Borel set. We write A 0 = S*-i , and we apply (ii) once 
more, this time to X — A 0 , to find a Borel set X 0 3 A 0 , of measure zero, such 
that/(X — Xo) is a Borel set. Let us write A' n = A n — An , and let (2' be the 
Borel field (C (2) spanned by the A' n . Then we have (X — Xo)A n = 
(X — X 0 )A f n for all n, and we see, moreover, that to every Borel set B, (i.e. 
to every set B €(2), there corresponds a set B' cfl' such that (X — X 0 )B = 
(X — Xo)B'. Sine e f(An) is a Borel set, and since the collection of sets A for 
which f(A) is a Borel set is clearly a Borel field, (because / is univalent), it 
follows that for every B' e (2', f{B f ) is a Borel set. Consequently for every 
BcCf 

/(B - BX'o) = /(B(X - Xo)) = /(B'(X - Xo)) - /(B')/(X - Xi), 

so that /(B — BXo) is a Borel set, and the proof of the lemma is complete. 

Lemma 4. If X(?X', (J, m) is a proper measure space , and if for a single real 
valued univalent Baire function g(x) we can find a set X 0 of measure zero such that 
g(B — BXo) is a Borel set whenever B is , then X is normal and, moreover, this 
same set X 0 will satisfy the condition of definition 2 for any real valued univalent 
Baire function fix). 

Proof. We write Y = g(X — X 0 ); for every yo e Y, y 0 = g(x o), we define 
^(2/o) = /(z<>). F(^/) is then a real valued univalent function of the real variable 
y c F. Since 

(3) \y | F(y) < a} = *[{* |/(*) < a}(X - X 0 )], 

and since the right member is a Borel set by hypothesis, F(y) is a Baire function. 
Since/(x) = F(g(x)), we have/(X — X 0 ) = F(Y), and therefore/(X — X 0 ) 
is a Borel set. 6 

An important class of measure spaces is the class of m-spaces. An m-space 
is a complete measure space X(9C, m) on which a metric is defined so that, 
topologically, it is a complete separable space, and which satisfies the following 
two conditions: 

(i) the measure of an open set is positive; 

(ii) for every measurable set E , m(E) = inf {m(0) \E <Z0 y 0 open}. With 
the Borel field Q of Borel sets (in the usual topological sense of the word) X = 
X(9C, (2, m) becomes a proper measure space; it is a known result of topology 

1 See F. Hausdorff, Mengenlehre, Berlin, 1935, p. 266. 



OPERATOR METHODS IN CLASSICAL MECHANICS, II 


339 


that it is even normal in our sense of the word, and that the exceptional set Xq 
of measure zero may even be chosen as the empty set. 7 

We shall use m-spaces later; at present we mention them only as examples of 
normal spaces. The following theorem, the main theorem of the present sec¬ 
tion, applies to m-spaces, (since they are normal), and shows that, measure 
theoretically, they are isomorphic to the unit interval. 

Theorem 2. A necessary and sufficient condition that a measure space of total 
measure one be point isomorphic to the unit interval is that it be normal. 

Proof. The necessity of our condition is obvious: the unit interval is normal 
and normality is invariant under point isomorphism. Before giving a proof of 
sufficiency we remark on the hypotheses. Since the various conditions in the 
definition of a proper space are logically independent, they are obviously indis¬ 
pensable for a sufficiency proof. It is possible that the condition of normality 
could be replaced by a weaker one, but examples seem to indicate that it is the 
best way of expressing that the space is “measurable in itself.” 

For the proof of sufficiency we use the notations of the proof of Theorem 1; 
in particular we use the functions f(x) and f(x) that we defined there. 

We denote by D and D the ranges of f(x) and f(x) respectively. By omitting 
from X a set of measure zero we may, by normality, assume? that D is a Borel 
set; D is also a Borel set. (We observe that the omission of a set of measure 
zero does not change the distribution of / and hence does not change/ at all). 
Form the set R = (D - D) + (D - D). Since f~\D — D) = f~\6) - f ~\D) 
lies entirely in the complement of and since this complement is empty, 

f~\D — D) is empty. Since/ \D —- D) has the same measure as f l (J) — Z>), 
this proves that the measure of/ 1 (5 — D) is zero. Similarly we can prove that 
the measure of both / l (D — D) and f~ l (D — D) is zero, (and, in fact, the 
latter is empty). Hence if we omit from both X and X a Borel set, namely 
f l (R) and / \R) respectively, of measure zero, on the remainder / and / are 
univalent Baire functions with identical (Borel measurable) ranges. 

If to every x c X (after the omission, as described, of a set of measure zero), 
we make correspond the point / 1 (/(#)) e X y the correspondence is one to one. 
Moreover if B is any Borel set in X, and B' = /(B), then B' is a Borel set and 
f"\B') = B . Consequently, considered as an element of the Boolean algebra 
S(VXT), the correspondent, under the set mapping described in the proof of theo¬ 
rem 1, of B is J~ l (B') = B = / l (f(B)), so that the point mapping just described 
induces precisely the same set isomorphism between © and It follows that 
this point correspondence is measure preserving. This concludes the proof of 
Theorem 2. 

3. The relation between set transformations and point transformations 

If T is a measure preserving transformation (i.e. a point isomorphism) of a 
measure space X(9C, m) on itself, then T induces a set mapping (of 33 = 33(9C) 


7 See Hausdorff, op. cit., p. 269. 



340 


PAUL. R. HALMOS AND JOHN VON NEUMANN 


on itself) by making correspond to every set E eX the set TE e X. It is 
known that in an m-space the converse is true: every set isomorphism is induced 
in this way by a point isomorphism. 8 Motivated by this we give the following 
definition. 

Definition 3. A measure space X(X, m) has sufficiently many measure pre¬ 
serving transformations if every set isomorphism of © on itself is induced by a 
point isomorphism of X on itself . 

It follows from Theorem 2 that every normal space has sufficiently many 
measure preserving transformations. In between the two concepts (normal 
spaces and spaces with sufficiently many measure preserving transformations) 
there is, however, room for a pathological occurrence which we shall describe in 
this section. We begin by proving some auxiliary results. 

Lemma 5. If two point mappings, on a measure space X which contains a 
separating sequence E \, Z? 2 , • • • of measurable sets , induce the same set mapping 
on © then they differ on at most a set of measure zero. 

Proof. It is sufficient to consider the case where one of the transformations 
is the identity. If then TE n and E n differ only on a set of measure zero, for 
n = 1, 2, • • • , it follows that all T k E n differ from each other only on sets of 
measure zero. Hence the invariant set 

T7 V 00 rrife jyr T T 00 J7 

In— Z^k 1 tin — JLlfc—oo 1 &n 

has measure zero. We form the invariant set X ' by omitting from X the set 
Fn of measure zero. If now x ^ Tx, then some E n contains one but not 
both of x and Tx, and therefore x is contained in one but not both of E n and 
T~ l E n . Consequently x e F n , so that x 4 X'. 

Lemma 6. Let X(X, m) be a measure space and let X' CL X be any (not neces¬ 
sarily measurable ) subset of X. Let X/ be the collection of all sets of the form 
E ' = X'E, with E € VX; for every E' e c X' y E' = X'E, define m'(E') = m(E). 
With these definitions m' is uniquely determined (so that X'(X', m f ) is a measure 
space) if and only if the outer measure of X f in X is equal to the measure of X . 9 

Lemma 7. If {<f> n (x )}, n = 1, 2, • • • , is a complete orthonormal set of functions 
in L 2 (X), where X(X, m) is a measure space which contains a separating sequence , 
Ei , E 2 , • • • , of measurable sets, then there is a set N e X of measure zero such 
that x, y 4 N and <f> n (x ) = 4 >n(y) for n = 1, 2, • • • , implies x = y. 

8 See John von Neumann, Einige Sdtze iiber messbare Abbildungen, Annals of Mathe¬ 
matics, vol. 33, (1932), p. 582. In definition 5, p. 576, all descriptive properties of the 
transformation (such for example as Mi -f M 2 —* M l M 2 ) should be modified by the phrase 
“neglecting sets of measure zero.” 

•The outer measure of E 0 , m*(E 0 ), is defined by m*(E 0 ) *■ inf \m(E) | 2?o C E <9C|. 
Similarly we may define the inner measure, m*(E 0 ) =* sup ( m(E) | Eo Z) E c 9C). If X is 
complete then Eo is measurable (i.e. Eo «9C) if and only if m*(E 0 ) « m*(E 0 ) *■ m*(Eo) “ 
m(Eo). In case X is properly separable it is sufficient to take the supremum and infimum 
over Borel sets E. For the proof of Lemma 6, see J. L. Doob, Stochastic processes depending 
on a continuous parameter, Transactions of the American Mathematical Society, vol. 42, 
(1937), pp. 109-110. 



OPERATOR METHODS IN CLASSICAL MECHANICS, II 


341 


Proof. Let f m (x) be the characteristic function of E m \ we have 

(4) ypnS^X) = ^ ]nmmldnm<bn(x), 

in the sense of convergence in the mean (or order two). Consequently, for 
each m, a subsequence of the partial sums of the series in (4) converges to 
almost everywhere; for each m we choose a fixed subsequence with this 
property and we let N be the union of all the sets of measure zero at which 
these subsequences do not converge to ^ m (x). If x, y 4 N and <f> n (x) = 4 > n (y) 
for all n, then it follows that ^ m (x) = \p m (y) for all ra, whence (using the fact 
that Ei , £ 2 , • • • is a separating sequence) x = y. 

Lemma 8. Let ^(VX*, (f , m) be the perimeter of the unit circle in the complex 
plane , and let y(£) be any measure (i.e. a countably additive , non-negative set 
function with y(%) = 1) defined for £ e Ct. If for a single number X, with | X | = 1 
and (arg \)/2w irrational , ji is invariant under rotation through arg X, i.e. 
p(\E) = p(E) for every £ cCf, then y(E) = m(£). 

Proof. Let A\ and A 2 be any two closed intervals (arcs) of the same length 
in X . Since the sequence jX n ( of powers of X is everywhere dense in X , we may 
find a sequence {n 7 ) of positive integers, so that 

(5) = I 2) 10 

and consequently 

(6) limy_* 00 /x(X r,y ^ 1 ) = y(A 2 ). n 

Since /z(X n, JLi) = y(A 2 ), we have proved that y(A\) = ^(^ 2 ). Thus y(A) is a 
function of the arc length of A, i.e. of m(A). This numerical function is clearly 
monotone and additive, hence proportional to m(A). Considering A = X 
shows that the factor of proportionality is 1. Thus y(£) and m(E) agi*ee for 
arcs, and therefore for all Borel sets. 

As an immediate consequence of this lemma we observe that if for any Borel 
set E 0 we have E 0 = \E 0 , then m(£ 0 ) = 0 or else m(£ 0 ) = 1, for otherwise 

y(E) = fh ( EEo) / m (£ 0 ) 

would contradict what we just proved. 

After these preliminaries we are now ready to introduce the pathological con¬ 
cept we mentioned at the beginning of this section. 

Definition 4. A (not necessarily measurable ) subset E of a measure space X 
is absolutely invariant if for every measure preserving transformation T of X on 
itself y the symmetric difference (E — TE) + (TE — E) is measurable and has 
measure zero . 

Lemma 9. If E is measurable and m(E) = 0 or m(E) = m(X) then E is 
absolutely invariant . Conversely if X is separable arid non-atomic and E d X 

10 See S. Saks, Theory of the integral , Warszawa, 1937, p. 5. 

11 See Saks, op. cit. f p. 8. 



342 


PAUL R. HALMOS AND JOHN VON NEUMANN 


is measurable and absolutely invariant , then m(E) = 0 or m(E) = m(X); if X is 
not measurable and absolutely invariant then m*{E) = 0, m*(E) = m(X). 

Proof. The first statement is obvious. To prove the remaining statements 
we observe that if T is a measure preserving transformation and if A is a set 
(almost) invariant under T, in the sense that (A — TA) + (TA — A) is measur¬ 
able and has measure zero, then any measurable cover, A*, and any measurable 
kernel, A* , of A are also (almost) invariant under T. 12 For A Cl A* implies 
TA C TA*; since TA and A are almost equal, and T is measure preserving, 
TA* is a measurable cover of TA, and therefore TA* + (A — TA) is a measur¬ 
able cover of A. It follows (since any two measurable covers of A are almost 
equal) that TA* is almost equal to A*, as was to be proved. A similar argument 
applies to measurable kernals. 

It follows from the preceding paragraph that if E is absolutely invariant then 
so are E* and E *. If we knew that a measurable absolutely invariant set must 
have measure zero or m(X ), we could conclude that for a non-measurable abso¬ 
lutely invariant E, m*(E) = 0 and m*(E) = m(X). In the case where X is the 
perimeter of the unit circle, there arc many examples of measure preserving 
transformations whose measurable invariant sets all have measure zero or m(X) : 
in fact the rotations described in Lemma 8 are such. If a set is invariant under 
all measure preserving transformations it is a fortiori invariant under these and 
hence if it is measurable it will have measure zero or m(X). The general case 
is, however, reduced to the case of the circle by Theorem 1. 

To show that the concept of absolute invariance is not vacuous we shall now 
show that non-measurable absolutely invariant sets exist. In the existence 
proof we make free use of the continuum hypothesis and well ordering. 

Lemma 10. If X = X(9C, Cf, m) is a proper measure space of total measure 
one , there exists an absolutely invariant set E Cl X with m*(E) = 0, m*{E) = 1. 

Proof. Since on a separable measure space there are at most c (= the power 
of the continuum) set transformations (since a set transformation is completely 
determined by its behavior on a countable collection of sets, and the set of all 
functions from a set of power to a set of power c has power c), it follows from 
lemma 5 that we may find a set of at most c measure preserving transformations 
of X on itself with the property that every measure preserving transformation 
differs on at most a set of measure zero from one of the given set. Let this set 
be well ordered, so that to each ordinal a < SI (= the first uncountable ordinal) 
there corresponds a measure preserving transformation T a . We may similarly 
enumerate the collection of all Borel sets of positive measure: let these be denoted 
by E a , a < 12. 

For any x t X and any a < Q we write 

Ca(x) = {IB- Tl\x I - «, * = 1,2, . • •; m - 0, ±1, ±2, • • •}. 

12 A* [or A*] is a measurable cover [or kernel] of A if it is measurable, if A C A* [or A* 
C A], and if m*(A) » m(A*) [or m*(A) » m(A*)]. If A* and A* are measurable covers of 
A then (A* — A*) 4- (A* — Af) has measure zero. 



OPERATOR METHODS IN CLASSICAL MECHANICS, II 


343 


C a {x) is the smallest set containing x and invariant under Tp for all 0 ^ a. 
Further relevant properties of C a (x) are the following. C a (x) is a countable 
set; for a g 0, C a (x) C Cp{x)\ if y 4 C a (x), then C a (y) and C a (x) are disjoint. 

By transfinite induction we now define points x a and y a . x\ is chosen in E\ ; 
yi is chosen in E t but not in Ci(x x ). Since C i(xi) is countable and E x (being a 
Borel set of positive measure) is not, the choice of y x is possible. If x a and y a 
are defined for all a < 0, we define xp as follows. Since the set 

Z .<0 {<?*(*„) + C B (y a ) J 

is countable, we may choose Xp e Ep so that Xp is not in this set. After this is 
done we may add Cp(xp) to this set and choose yp so that yp eEp , but yp is not 
in the enlarged set. 

Concerning the points x a and y a we now assert: for any a and 0, a ^ 0, 
C a (x a ) and Cp(yp) are disjoint. If a ^ 0> then we know, by definition, that 
yp 4 Cp(x a ) so that Cp(yp) and Cp(x a ) are disjoint— d fortiori Cp(yp) and C a (x a ) 
arc disjoint. If a > 0, then again x a is not in C a (yp) so that C a (x a ) and 
C m (y fi ) are disjoint, and therefore so also are C a (x a ) and Cp(yp). 

We write 

A = 

B = Z ]p<nCp(yp); 

it follows that A and B are disjoint. Since A contains x a and B contains yp , 
both A and B have at least one point in common with every Borel set of positive 
measure; consequently X — A and X — B cannot contain any such sets. It 
follows that both A and B have outer measure one (since their complements 
have inner measure zero), and since each is contained in the complement of the 
other, they both have inner measure zero. 

It is now easy to see that A is (almost) invariant under every measure pre¬ 
serving transformation T. Given T we may find 0 < S2, such that T and Tp 
differ on at most a set of measure zero. Also we have 

TpA = Z^«<a TpC a (x a ). 

Since for a ^ 0, C a (x a ) is invariant under Tp , A and TpA can differ at most 
on the countable set TpC a (x a ). Since TpA and TA differ on at most 

a set of measure zero, we have proved that A and TA differ on at most a set of 
measure zero. We may choose either A or B for the E of Lemma 10. 

The following two lemmas establish the connection between absolute invari¬ 
ance and the property of having sufficiently many measure preserving trans¬ 
formations. 

Lemma 11. Let X( L X, m) be a measure space of total measure one with suffi¬ 
ciently many measure preserving transformations , and let X' Cl X be any subset 
of X with m*(X') = 1. If X f is absolutely invariant , then the measure space 
X'{°X f , m f ) (<defined in Lemma 6 ) has sufficiently many measure preserving trans¬ 
formations . 



344 


PAUL R. HALMOS AND JOHN VON NEUMANN 


Proof. The correspondence E «=* E' = X'E is a set isomorphism between 
53 = 53 (9C) and 53' = $8 (SC'). Through this isomorphism any set mapping of X' 
on itself (i.e. any set isomorphism of 53' on itself) induces a set mapping of X 
on itself. Since X has, by hypothesis, sufficiently many measure preserving 
transformations, it follows that to any set mapping T' on X f there corresponds 
a measure preserving transformation T of X on itself, such that T in¬ 
duces the same set mapping of X as T'. Since X' is absolutely invariant, 
(X' — TX') + ( TX ' — X') has measure Zero; let N' be the smallest set invariant 
under T which contains this set of measure zero. We may redefine T on N' 
to be the identity; the resulting T leaves X' strictly invariant and may therefore 
be considered as a measure preserving transformation of X' on itself. It is 
clear that this measure preserving transformation induces the set isomorphism 
V on X' and that, therefore, X' has sufficiently many measure preserving 
transformations. 

Lemma 12. Let X(9C, m) be a measure space of total measure one which has a 
separating sequence of measurable sets, and let X' C X be any subset of X with 
m*(X ') = 1. If the measure space X'(SC', m') {defined in Lemma 6) has suffi¬ 
ciently many measure preserving transformations then X' is an absolutely invariant 
subset of X. 

Proof. We use the notation introduced in the proof of Lemma 11. Let T 
be any measure preserving transformation on X; through the correspondence 
E +=± E' = X'E, T induces a set mapping V on X'. Since X' has sufficiently 
many measure preserving transformations, the set mapping T' of X' is induced 
by some measure preserving transformation, say S, of X' on itself. We shall 
prove that for almost every point x € X', Sx = Tx , 

For any set E «9C we know that ST 1 #' = S~\X'E) and X f T~ Y E differ on 
at most a set of measure zero (since S and T induce the same set mapping on 
X'): we denote this set of measure zero by N E , and we write N for the union 
of all N E , where we allow E to run through a separating sequence. Let x be 
any point in X' — N; we assert that Sx = Tx. If this were not true, we could 
find a set E, belonging to the separating sequence used above, such that Sx tE 
and Tx 4 E. Since x e X', Sx t X', and therefore x t S~ l {X f E ); since Tx 4 E , 
d fortiori x 4 X' • T~ l E. It follows that x e N B C N; since this contradicts the 
choice of x , we must have Sx = Tx. 

We have proved that T leaves almost every point of X' in X': in other words 
X' is almost invariant under T. Since T was arbitrary, it follows that X' is 
absolutely invariant. 

We conclude this section with an isomorphism theorem that makes clear the 
structure of measure spaces with sufficiently many measure preserving trans¬ 
formations. 

Theorem 3. A necessary and sufficient condition that a proper measure space 
of total measure one have sufficiently many measure preserving transformations is 
that it be point isomorphic to an absolutely invariant subset of the unit interval. 

Proof. Since the property of possessing sufficiently many measure preserving 



OPERATOR METHODS IN CLASSICAL MECHANICS, II 


345 


transformations is invariant under point isomorphism, and since, by Lemma 11, 
an absolutely invariant set has this property, sufficiency is clear. 

To prove necessity we first observe that the given measure space, X(9C, (2, m) 
is set isomorphic with 5, m) in virtue of Theorem 1. (It will be most 
convenient in this proof to think of X as the perimeter of the unit circle in the 
complex plane.) Consider on X the measure preserving transformation 
x —> \x, where X e X is a fixed number with (arg X)/2r irrational. The set iso¬ 
morphism between X and X makes correspond to this transformation on X a 
certain measure preserving transformation T on X. A set isomorphism may 
also be considered as a mapping of the characteristic functions of X on the 
characteristic functions of X : this mapping may be extended to all L 2 (X) and 
thus generates an isomorphism between L 2 (X) and L 2 (X). Let <j>(x) be the 
correspondent on X of the function #(x) = x on X; the function <£(x) has the 
following properties: 

(i) 1*0*01 = 1; 

(ii) 4>{Tx) ss \4>(x); 

(iii) [ <t> n (x)\ = {(*(*))”}, n = 0, ±1, ±2, • • • , is a complete orthonormal set 
in L 2 (A r ). 

(To be precise : since 4>(x) is determined only up to a set of measure zero, proper¬ 
ties (i) and (ii) need to be true only almost everywhere. It is clear, however, 
that by changing ^ on a set of measure zero we may assume that (i) and (ii) 
are always true. We may also assume, and we find it convenient to do so, that 
<l>(x) is a Baire function.) 

We apply Lemma 7 to {</> n (x)) to obtain a set N of measure zero with the 
property described there. By increasing iV, if necessary, we may assume that 
N is invariant under T. We now omit the points of N from X : we shall show 
that the remainder (henceforth to be denoted by X again) is in one to one 
measure preserving correspondence with an absolutely invariant subset of X. 

The function x' = <f>(x) defines a mapping from X to X ; we know that this 
mapping is Borel measurable (i.e. that the inverse image of a set in (2 lies in (2), 
and we assert furthermore that it is univalent. For if we had </>(ar) = <f>{y ), 
then we should also have <\> n {x) = <t> n (y) for all n, and this possibility is precisely 
what we eliminated when we threw away the set 2V\ 

The transformation T is carried by the mapping <t> into some transformation 
T of the range 4>(X) = X' C X into itself; since 

T'x' = <K7VV)) = X*', 

we see that X f is invariant under the rotation x —> \x. 

For every Borel set £ CL X (i.e. £ e (2) we define n(E) = m(0~’ 1 (£)). Since 
4>(T<ir\£)) = X' \E , we have T<fT\£) = <tr\X'-\£) = ^(xS). Since T is 
measure preserving it follows that 

m(JB) = m(4T l (£)) = m(T<tr\E)) = m(<ir\\£)) = M (X#). 



346 


PAUL R. HALMOS AND JOHN VON NEUMANN 


Hence, by Lemma 8, p{£) = fh(£). 

Suppose, finally, that £\ and £2 are Borel subsets of X for which X'Ei = X '£ 2 . 
Write £ = (£1 — £ 2 ) + (£ 2 — j©i): it follows that X f £ is empty, so that 
<t>~ l (£) is empty and p{£) = m(<tT l (£)) — m{£) = 0. This implies that fh{£i) = 
fn(£ 2 ); it follows from Lemma 6 that m*(X') = 1. 

To sum up: we have proved that X is point isomorphic with a possibly non- 
measurable subset X ' of X> with m*(X') = 1; since X has sufficiently many 
measure preserving transformations, so does X'. Lemma 12 now applies: X' 
is absolutely invariant and the theorem is proved. 

4. Application of the geometric isomorphism theorem to measure preserving 

transformations 

In this section we shall have occasion to use certain facts about measure pre¬ 
serving transformations and the Pontrjagin duality theory: we describe briefly 
the parts of these theories that we need. Throughout the remainder of our 
work we consider only normal spaces of total measure one. 

Two measure preserving transformations T\ and T 2 , defined, say, on X\ and 
X 2 , are (point —) isomorphic 3 if there is a point isomorphism T from Xi to X 2 
with the property that T r L\T l is almost everywhere equal to r L\ . With every 
measure preserving transformation T we associate a unitary transformation IJ 
defined on L 2 (X) by Uf(x) = f(Tx). A measure preserving transformation T 
has pure point spectrum if U has; in other words if there exists a complete ortho¬ 
normal sequence, {/ n (-t’)} of functions in L 2 (X) and a sequence A = {X n J of 
complex numbers (of absolute value one) such that f n (Tx ) = \ n f n (x) almost 
everywhere, for n = 1, 2, • • • . T is ergodic if f(Tx) = f(x) almost everywhere, 
with /eL 2 (X), is equivalent to f(x) = constant almost everywhere. The 
spectrum , A, of an ergodic measure preserving transformation with pure point 
spectrum is a subgroup of the multiplicative group of complex numbers of ab¬ 
solute value one. The numbers X„ c A are, moreover, a complete set of invariants 
of T , in the sense that if two measure preserving transformations with pure 
point spectrum have the same set A of eigenvalues with the same multiplicities 
then they are isomorphic. 14 

Concerning groups we shall need the following. A compact abelian separable 
topological group, X, as an m-space, in the sense that we may define on it an 
invariant metric d(x, y) and (unique) invariant Haar measure m{E) in such a 
way that it becomes an m-space. 16 Let A' be the character group of X; i.e. 
A' is the set of all complex valued continuous functions f(x) with \f(x)\ = 1 

13 Since this is the only kind of isomorphism for measure preserving transformations that 
we shall use, we shall in the sequel omit the qualifying ‘point —\ 

14 All these statements are proved in (I) for flows: it is easy, however, to make the trans¬ 
lation from the one parametric case to the discrete case. 

1# Invariance means that for all points x t y, and o, and all measurable sets E , we have 
d(x, y) *■ d(ax, ay) and m(aE) - m(E). We find it convenient to write all groups multipli- 
catively, even though they are abelian. 



OPERATOR METHODS IN CLASSICAL MECHANICS, II 


347 


and f(xy) - f(x)f(y). Then A' is countable, and the functions/(x) e A' form a 
complete orthonormal set in L 2 (X). Conversely let A be any countable abelian 
group, and let X be its character group; i.e. X is the set of all complex valued 
functions x(X), defined on A> with | x(X) | == 1 and x(Xm) = x(X)x(m). X may 
be so topologized that it becomes a compact separable (and, of course, abelian) 
group. If to every X € A we make correspond the function /(x) on X, defined 
by /(x) = x(X) then this correspondence is an isomorphism between A and the 
entire character group A' of X. 18 

The fact that Haar measure is invariant means that the rotation x —► ax, 
where a is any fixed element of the group, is a measure preserving transforma¬ 
tion. The point of introducing the seemingly irrelevant compact groups into 
the study of measure preserving transformations is that such rotations are 
normal forms for a large class of transformations. 

Theorem 4. An ergodic measure preserving transformation with pure point 
spectrum on a normal space is isomorphic to a rotation on a compact separable 
abelian group . 

Proof. Let A be the spectrum of the given measure preserving transforma¬ 
tion; let X be the character group of A, and A' that of X. If for every X € A 
we define a(X) s X, then a = a(X) is in X. For every x e X we define Tx = ax; 
we assert that T has pure point spectrum and that its spectrum is simple and 
precisely equal to A. It has pure point spectrum because the characters/(x) e A' 
form a complete orthonormal system on L 2 (X), and every such / is an eigen¬ 
function of T belonging to the eigenvalue/(a), /(ax) = /(a)/(x). This shows, 
moreover, that the spectrum of T, including multiplicities, is obtained by form¬ 
ing the numbers /(a) for all / e A'. Since to each / there corresponds (through 
the isomorphism described above) an element X € A for which /(x) = x(X) for 
all x, we see that we may equally well form the numbers a(X), i.e. X, for all 
X € A. Hence T is ergodic and it follows, from the previously quoted result of 
(7), that the given transformation and T are isomorphic. 

Since in this proof we used only the group A of eigenvalues and not the actual 
transformation we have also the following corollary. 

Corollary 1. Every countable group of complex numbers of absolute value one 
is the spectrum of an ergodic measure preserving transformation with pure point 
spectrum . 

Theorem 4 also enables us to characterize the set of all transformations which 
commute with a given ergodic transformation with pure point spectrum. The 
solution of this problem for general measure preserving transformations is 
probably very difficult. 

Corollary 2. If x —> ax = Tx is an ergodic rotation on a compact abelian 
group X and if S is any measure preserving transformation on X for which 
ST = TS then S is also a rotation . 

*• For the proof of all these statements see L. Pontrjagin, Topological groups y Princeton, 
1939, Chapter V. 



348 


PAUL R. HALMOS AND JOHN VON NEUMANN 


Proof. We have S(ax ) = aS(x), so that if we write b(x) — Sx • x“ x , then 
b(Tx) = S(oa;)(aa:)” 1 = Sx-x“ l = 6(x). 

In other words b(x) is invariant under T; since T is ergodic b(x) — 6 = constant 17 
and 5x =? bx, as was to be proved. 

We shall call a measure preserving transformation fi an involution if fi 2 — I 
( = the identity), and we shall call an involution a factor of a given transforma¬ 
tion T if S = RT is also an involution (so that T = SR), 

Corollary 3. If x ax — Tx is any rotation on a compact abelian group X, 
then T may be factored, T = SR, S 2 = R 2 = I; if T is ergodic every factor R of 
T is a reflection , Rx — bx~ l . 

Proof. Clearly if Rx = bx~ l then R is an involution; also Sx = RTx = 
R(ax) = baT y -x~ l is an involution. Conversely if T is ergodic and if T = SR, 
S 2 = fi 2 = J, then TRT = SR-R SR = 72, so that afi(ox) = fix. It follows 
as in the proof of Corollary 2 that b(x) = x-fi(x) is invariant under (i.e. 
6(ax) = ax-fi(ax) = x-fi(x) = 6(x)) r so that 6(x) = 6 = constant, and 
fix = 6x _1 . 

Corollary 4. .Am/ ergodic measure preserving transformation T with pure 
point spectrum is isomorphic to its own inverse, T~ l = RTR~ l , where R may even 
be chosen as an involution. 

Proof. From Corollary 3 we know that T — SR, S 2 = R 2 = I; since T~ l = 
RT l ST l = RS, we have T~ l = R SR R = R-T-RT 1 . 

There seems to be some reason for the conjecture that the results of Corol¬ 
laries 3 and 4 are valid for an arbitrary measure preserving transformation. 

We have seen that every rotation is a measure preserving transformation with 
pure point spectrum; the question arises as to when a rotation is ergodic. The 
following theorem asserts that for rotations ergodicity (i.e. metric transitivity) 
is equivalent to Regional transitivity. 18 

Theorem 5. If a is a fixed element of the compact abelian group X, the rotation 
x —* ax is ergodic if and only if the sequence {o n } is everywhere dense in X, 

Proof. If x —> ax is ergodic then the iterates of some point, say x 0 , are 
everywhere dense. 19 Since the transformation x —* x-xjT 1 is a homeomorphism, 
it carries the sequence {a n xo} of iterates of Xo into a dense sequence; but 

_n —l n 

a XoXo = a . 

’ Suppose, conversely, that {a n } is everywhere dense. We have already seen 
that any rotation has every function / in the character group A' of X for an 
eigenfunction, and that the functions of A' are a complete orthonormal set in 
L*{X). Since eigenfunctions belonging *to different eigenvalues are orthogonal, 
every function invariant under the rotation x —► ax must be a linear combina- 

17 The definition of ergodicity says that numerically valued invariant functions are 
constant. It is easy to verify that this implies the same result for functions (such ash(x)) 
whose values are in the group X, 

1# For a discussion of the various kinds of transitivity see G. A. Hedlund, The dynamics of 
geodesic flows, Bulletin of the American Mathematical Society, vol. 45, (1939), p. 243. 

lf See Eberhard Hopf, Ergodentheorie , Berlin, 1937, p. 29. 



OPERATOR METHODS IN CLASSICAL MECHANICS, II 


349 


tion of the invariant functions of the set A': if the only invariant function in 
A' is f(x) S3 1, the rotation is ergodic. Suppose then that f(ax) = f(x) for some 
/ 6 A'. It follows (taking x to be the unit element of X, x = 1) that f(a n ) = 
/(1) = 1 for all n; since {a n }'is dense and/is continuous it follows that f(x) = 1. 

To introduce the final result of this paper we observe that Theorem 4, and 
the existence of an invariant metric on any compact separable group, imply 
that every ergodic measure preserving transformation with pure point spectrum 
is isomorphic to an isometric transformation on an m-space. Consersely: 

Theorem 6. If T is an ergodic measure preserving transformation on an m-space 
X(9C, m) such that to every e > 0 there corresponds a 5 = 8(e) > 0 in such a way 
that d(x , y) < 8 implies d(T n x , T n y) < e , n = 0, ±1, ±2, • * • , (in other words 
if the family { T n ] of transformations is equicontinuous ), then T has pure point 
spectrum : in fact it is possible to introduce into X a multiplication so that it be¬ 
comes (with the original topology of X) a compact separable abelian group and T 
becomes a rotation. 

We comment first of all on the hypothesis. Since an isometric transforma¬ 
tion clearly has the described equicontinuity property, on the face of it our 
hypothesis is weaker than isometry. But if our hypothesis is satisfied we may 
introduce into X a new metric, d'(x, y), defined by 

d'(x , y) = sup {min(l, d(T n x, T n y )) | n = 0, ±1, db2, •••}; 

it is easy to verify that d and d! induce the same topology on X, and 
that d'(Tx, Ty) = d'(x, y). We may (and do) therefore assume that T is iso¬ 
metric in the first place. 

We shall make the proof of Theorem 6 depend on the following two lemmas 
which have an interest of their own. 

Lemma 13. If on an m-space X there exists an ergodic and isometric measure 
preserving transformation then X is compact. 

Proof. Let T be an ergodic and isometric transformation; since X is com¬ 
plete we have to show only that it is totally bounded. If it is not, then there 
is an € > 0 and an infinite sequence of points X\ , x 2 , • • • in X such that the 
open spheres S n of radius € with center at x n are pairwise disjoint. Let xq be 
any point of X whose iterates {T k x o} are everywhere dense in X, and choose 
for each n = 1, 2, • • • an integer k = k(n) such that d(x n , T*x 0 ) < c/2. If we 
denote by So the open sphere of radius e/2 with center at x 0 , then for each n, 
5P* (n) £o C S n , so that m(S n ) ^ m(S 0 ) > 0. Since a measure space has, by 
definition, finite measure, there .cannot exist an infinite sequence of pairwise 
disjoint sets whose measure is bounded away from zero; it follows that X is 
totally bounded and therefore compact. 

Lemma 14. Let X be any compact group (not necessarily separable or abelian) 
and let m(E) be any finite measure , defined (at least) for all Borel sets of X, such 
thal the measure of an open set is positive and that the measure of any measurable 
set is the lower bound of the measure of open sets containing it. Then the set Xo 
of attx tX for which m(xE) = m(E) for all measurable sets E is a closed subgroup 
of X. 



350 


PAUL R. HALMOS AND JOHN VON NEUMANN 


Proof. Since x c Xo and y e X 0 implies 

m(xy~ l E) = miy^E) = m(y{y~ l E)) = m(2?), 

Xo, and consequently its closure Xo , is a subgroup; we shall prove Xo Cl Xo. 

Take a: c Xo , and let E be any closed (and hence compact) subset of X. Let 
0 be any open set, O 3 x#, and let N be a neighborhood of 1 (= the unit element 
of X ) such that for a eN, axE C 0. Then Nx is a neighborhood of x, so that 
the intersection of Nx and Xo is not empty; say y = ax, a c N, y e Xo. Then 

m(E) = m(yE) = m(axE), 

and since axE d 0, m(E) ^ m(0). In other words xE (Z O implies that m(E) ^ ' 
m(0 ): our condition on m implies that m(2?) ^ m(xE). Applying this result 
to the compact set xE and the point x~ l e Xo (in place of E and x) we obtain 
m(xE) ^ m(E), so that m(xE) = m(E) for all closed sets E. It follows that 
m(xE) = m(E) for all measurable sets E, as was to be proved. 

Proof of Theorem 6. Let x 0 be any point in X for which {T n x 0 \ is every¬ 
where dense; write x n = T n x 0 for n = dbl, d=2, • • • . For x = x n and y = x m 
we define p(x, y) = x„+ fn , and r(x) = x_„ . If x' = x n >, x" = x n " , y' = x m > * 
y" = x m - , then 

d(p(z', 2/'), ?>(*", 2/")) = d(x n / 4-m # , *n"+m") 

d(x n '-f m ' , X n '-f m ") "f* d(x n '+m" > 

== d(Xm f , Xm") “I" d(x n *, X n ") 

- <*(2/', 2/") + <*(x', *"); 

in other words p(x, ?/) is uniformly continuous throughout its domain of defini¬ 
tion; similarly since we have 

d(r(x), r(y)) ==: d(x__ n > x~ m ) = d(x__ n .f n -f m , x_m+n+m) = d(y, x), 

r(x) is uniformly continuous throughout its domain. The domain of p(x, y) is 
an everywhere dense subset of the product space of X with itself, and the domain 
of r(x) is an everywhere dense subset of X, consequently they each have a unique 
continuous extension, to all the product space and all X respectively. 

The rest of the proof is now easy. We define, for every x and y in X, xy = 
p(x, y) and x~ l = r(x); it is clear that with these definitions X becomes an 
abelian topological group. We may write, for any x = x n and an arbitrary y, 
p f (x, y) = T n y; then p'(x, y) is a continuous extension of our original p(x, y) 
and therefore (because of the uniqueness of extension) T n y = x n y . (For n = 1, 
we obtain, in particular, Ty = xi y for all y. The originally chosen element Xo 
is now the unit element of the group.) If E is any measurable set then T n E = 
xJS has the same measure as E, so that measure is preserved by an everywhere 
dense set of x’s; since, by Lemma 13, X is compact, Lemma 14 implies that for 
all x and all measurable sets E , m(xE) = m(E). The uniqueness of Haar 
measure implies that m is the Haar measure of the group X ; this completes the 
proof that T is a rotation, and hence has pure point spectrum. 

Institute fob Advanced Study 



Annals or Mathematics 
Vol. 43, No. 2, April, 1942 


THE BROWNIAN MOVEMENT AND STOCHASTIC EQUATIONS 

By J. L. Doob 
(R eceived January 14, 1942) 

The irregular movements of small particles immersed in a liquid, caused by 
the impacts of the molecules of the liquid, were described by Brown in 1828. 1 
Since 1905 the Brownian movement has been treated statistically, on the basis 
of the fundamental work of Einstein and Smoluchowski. Let x(t) be the 
x-coordinate of a particle at time t. Einstein and Smoluchowski treated x(t) 
as a chance variable. They found the distribution of x(t) — x(0) to be Gaussian, 
with mean 0 and variance a | t |, where a is a positive constant which can be 
calculated from the physical characteristics of the # moving particles and the given 
liquid. More exactly, such a family of chance variables {x(0! is now described 
as the family of chance variables determining a temporally homogeneous differ¬ 
ential stochastic process: the distribution of x(s + t) — x(t) is Gaussian, with 
mean 0, variance a | 1 1, and if h <•••<£„ , 

x(h) — x(ti), , x(l n ) — x(t n - 1 ) 

are mutually independent chance variables. Wiener, who was the first to dis¬ 
cuss this stochastic process rigorously, proved in 1923 that the functions x(t) 
of this stochastic process are continuous, with probability 1. 2 This is of course 
a desirable result, which makes the stochastic process somewhat more acceptable 
as the mathematical idealization of the Brownian movement. It was not ex¬ 
pected, however, that the above distribution of x(s + t) — x(s) would prove 
correct for small t. Even if the derivation did not break down for small t , the 
mathematical fact that x(s + t) — x(s) has standard deviation a | 1 1 so that 
x(s + t) — x(s) is of the order of magnitude of | t |*, implying that dx(s)/ds 
cannot be finite, would suggest the desirability of modifications of the Einstein- 
Smoluchowski distributions. In fact it is easily seen that (with probability 1) 
x(t) is not even of bounded variation, so that the path curves of the Einstein- 
Smoluchowski process have infinite length! 

A different stochastic process describing the x(t) was in fact derived in 1930 
by Ornstein and Uhlenbeck (15), 8 and later by S. Bernstein (1), (2) and Krutkow 
(11), all using different methods. This new distribution of x(s + t) — x(s) is 

1 For a historical account of the subject up to 1913, see Haas-Lorentz (6). (The 
numbered references will refer to the bibliography at the end of the paper.) 

* Wiener (18, pp. 148-151) has since given a more simple proof. For a discussion of the 
exact meaning of such a statement concerning the continuity of paths, cf. Doob (3) and (5), 
$2. The result means that x{t) can be treated as representing one of a multiplicity of 
continuous functions of t , and integrated, etc. Probability here is formally the study of 
measures on certain spaces of functions. 

* Cf. also Ornstein and Wijk (16) and Wijk (17). References to work since 1913 are given 
in Ornstein and Uhlenbeck (15). 


351 



352 


J. L. DOOB 


Gaussian, with mean 0 and variance — 1 + P 1 1 1), approximately 

a 1 1 1 for large t , but afit 2 /2 for small t . (Here ft is a second physically deter¬ 
mined constant.) 

The purpose of the present paper is to apply the methods and results of 
modem probability theory to the analysis of the Ornstein-Uhlenbeck distribu¬ 
tion, its properties and its derivation. It will be seen that the use of rigorous 
methods actually simplifies some of the formal work, besides clarifying the 
hypotheses. A stochastic differential equation will be introduced in a rigorous 
way to give a precise meaning to the Langevin differential equation for the 
velocity function dx(s)/ds. This will avoid the usual embarrassing situation in 
which the Langevin equation, involving the second derivative of x(s) is used to 
find a solution x(s) not having a second derivative. 

1. The velocity distribution 

The displacement function x(t), as discussed by Ornstein and Uhlenbeck, has 
a derivative u(t), and all the probability relations needed can be derived from 
those of u(t), as will be seen below. The distribution of u(t) can be described 
as follows: the conditional distribution of u(s + t) (t > 0) for given u(s) = u 0 , 
is Gaussian, with mean uae~^ and variance <ro(l — e~ 2fit ). Here <il , /3 are 
physically determined constants. When t —> «>, this distribution becomes the 
Maxwell distribution of velocities, furnishing stationary absolute (uncondi¬ 
tioned) probabilities for the process, if these are desired. Using these absolute 
probabilities, which make the distribution easier to describe, the full description 
of the u(t) distribution can then be stated as follows: for each t , u(t) is a chance 
variable with a Gaussian distribution, having mean 0, variance al ; the transition 
probabilities are as just described; the process is a Markoff process. 4 This last 
fact means that the Maxwell distribution of u(t 0 ) for each fixed k , and the 
transition probabilities just described determine the full set of probability rela¬ 
tions of the process. Under these conditions, if t\ < t 2 , the pair u(t\ ), u(k) has 
a bivariate Gaussian distribution, with zero means, equal variances al , and 
correlation coefficient This stochastic process goes back at least to 

Smoluchowski, although it was first derived by Ornstein and Uhlenbeck as the 
process describing the velocity of a particle in Brownian motion. Ornstein and 
Uhlenbeck were only interested in the transition probabilities. The formal 
manipulations made below will show that there are technical advantages in 
defining (unconditioned) probabilities for the u(t) also. The above described 
process will be called the O. U. process below. 

The following theorem shows that such a process is essentially determined by 
three fundamental properties, of which at least the first two have simple physical 

4 A process is called a Markoff process if whenever h < ••• < t n , the conditional distribu¬ 
tion of for given values of u(ti), • • •, u(tn- 1 ) actually depends only on u{t n -i ). It is 
in this case, and only in this case, that the Smoluchowski equation between the transition 
probabilities, and the Fokker-Planck differential equations for the transitional probabili¬ 
ties are valid. 



BROWNIAN MOVEMENT AND STOCHASTIC EQUATIONS 


353 


significance. (We can exclude Case A of the theorem, since it obviously does 
not fit the physical picture.) 

Theorem 1.1. Let u(t) (—00 < t < + 00) be a one-parameter family of chance 
variables , determining a stochastic process with the following properties. 

1. The process is temporally homogeneous . 6 

2. The process is a Markoff process . 

3. If s, t are arbitrary distinct numbers , u(s) } u(t) have a (non-singular) bivariate 
Gaussian distribution. 

Define m, al by 

(1.1.1) m = E{u(t)\, al = E{[u(t) - m] 2 ). 6 

Then the given process is one of the following two types. 

(A) If h < ••• < tn , u(t \), ••• , u(t n ) are mutually independent Gaussian 
chance variables , with mean m and variance cl . 

(B) (0. U. process) There is a constant /3 > 0 such that if h <-*•< t n , 
u(ti), • • • , u(t n ) have an n-variate Gaussian distribution , wntfi common mean m 
and variance o \, and correlation coefficients determined by the equation 
E{[u(t) - m][u(s) - m]\ * oV* 11 ’* 1 . 

Instead of considering n(2), we can consider (l/<7 O )[n(0 — m], which has 
mean 0 and variance 1. Then we shall assume in the following that u(t) itself 
has these properties: m = 0, <rl = 1. Let p(t) be the correlation function: 
p(t) = E{u(s + 0^( s )}» independent of s by Property 1. If $ < t , the condi¬ 
tional distribution of u(t) for given u(s) has density 


( 1 . 1 . 2 ) 


(2*)' (1 - P "-)‘ 6XP 



1 [u(t) - 

2 1 -p 2 J’ 


p — p(t — s), 


(Property 3). If <1 <•••<<* , u(h), • • • , u(t n ) then have an n-variate Gaus¬ 
sian distribution with density 


(1.1.3) 


n w— 1 
( 2 *)* 




— exp (- i - i £ ("»■ - , 

p!>' V 2 2 , 1 - p, ) 


Pi p(^/+l ^')> W/ — u(tj) 


using Property 2. 
with density 

(1.1.4) 


Now if U \, • • • , u n have an n-variate Gaussian distribution 


-exp 


(- \ Z OiiUiui) , 


A = det(Z?{n,ny}) is the determinant of the matrix of variances and covariances, 
and (an) is the inverse of this matrix. Using these facts w r e can calculate 
p(U — ti) = E[uiUi\ in (1.1.3) with n = 3, and find that p(U — h) = pi/*, 
that is 


• That is, the probability distributions are unaffected by translations of the f-axis. 

• The expectation of the chance variable v will be denoted by E {v\. 



354 


J. L. DOOB 


(1.1.5) p(fa - fa) - p(fa ~ tl)p(t 8 - fa). 

Then pit) is an even function; | pit) | ^ 1 (Schwarz’s inequality); and according 
to (1.1.5) p(s + 0 = p($)p(0 for all positive s, fa Under these conditions either 
p(t) = 0 or there is a constant 0 ^ 0 such that 

(1.1.6) p(0 = e"* 111 . 

In the present case, 0 > 0, by Property 3 (non-singularity of the given bivariate 
distributions). Evidently pit) ss 0 furnishes Case A of the theorem, which 
certainly has the three given properties. If p(t) is given by (1.1.6) with 0 > 0, 
we show first that the matrix (a, ; ), the inverse of (p(fa* — fa)) actually determines 
a Gaussian density distribution (1.1.4). To see this we consider the density 
function (1.1.3) with pj = e“ /3(ly+1 ~ ,/) . The coefficients of the quadratic form 
in the exponent of (1.1.3) are easily evaluated and the matrix of the form is 
found to be the inverse of the matrix Thus (1.1.3) actually is the 

required probability density. Moreover the probability densities obtained in 
this way (as the fa* vary) are mutually consistent, because integrating out any 
variable leaves a quadratic form of the same type, without the integrated 
variable, but with the same rule determining the coefficients. The correlation 
function (1.1.6) therefore determines a stochastic process. The process ob¬ 
viously is a Markoff process because of the form of the probability density (1.1.3): 
an initial factor involving u\ only, followed by the product of functions of pairs 
of adjacent variables. The proof of the theorem is now complete. 

According to a theorem of Khintchine ((9) p. 608), pit) is the correlation func¬ 
tion of a temporally homogeneous stochastic process if and only if it can be put 
in the form 


(1.1.7) 


pit) = f cos X t dF(\), 
Jo 


where F(\) is monotone non-decreasing and bounded. 
(1.1.7) is true when F(\) is given by 


( 1 . 1 . 8 ) 


Fi\) = 


2j8<ro f X d\ 

■k i, 0 2 + \ 2 ‘ 


In Case B of the theorem, 


In the stochastic process of Case B, the variance of u(s + t) — w($) is 
2o%P \t \ for small fa* 

(1.1.9) E{\u{s + t) ~ «(«)]*} = 2<4( 1 - e-»") ~ 2<ro/S | i |. 

Thus u(s + t) — u(s) is of the order of magnitude of 1 1 1*, and du/dt cannot 
exist. Physically this means that the particles in question do not have a finite 
acceleration (if the given stochastic process represents the Brownian movement 
that closely). 

Theorem 1.2. If u(t) is the representative function of the stochastic process of 
Theorem 1.1 Case B, u(t) is a continuous function of t, with probability 1. 



BKOWNIAN MOVEMENT AND STOCHASTIC EQUATIONS 


355 


Let v(l ) be determined by the equation 

(1.2.1) v(t) = t'u (Jp log tj, t> 0. 

Then v(t) has the property that if h < ••• < t n , v(ti), ••• , v(t n ) have an 
ra-variate Gaussian distribution. We find by direct calculation (taking m — 0): 

+ t) — «(s)} = 0, 

(1.2.2) £{H* + t) - r(s)] 2 } - alt, 

E{[v( 82 ) — v(si)Mfe) t>(<i)H = 0, (si < 8 2 ^ h < fe). 

Then v(t) determines a differential process—in fact precisely the original Einstein- 
Smoluchowski process. Since Wiener has proved continuity of the path func¬ 
tions in this case, the theorem follows. 

The transition from u(t) to v(t) just used reduces every property of the 
Ornstein-Uhlenbeck stochastic process to a corresponding property of the 
Einstein-Smoluchowski process, and vice versa. Many properties of the indi¬ 
vidual functions of the latter process, that is, properties possessed by almost all 
the individual functions, in other words possessed “with probability 1,” have 
been proved in recent years, besides the continuity property we have just used. 
The following theorem gives the counterparts of two of these for the O. U. 
process. 

Theorem 1.3. If u{i) is the representative function of the 0. U. process of 
Theorem 1.1 Case B, 


(1.3.1) lim sup — - - 

‘-o (4<ro# log log (1/0) 


7 = 1, lim sup 


u(t) 

<->o r (2 a\ log t) 


I" 1 * 


with probability 1. 

Let v(t) be defined by (1.2.1). Then Khintchine ((10) pp. 68-75) has proved 


/i o o\ r y (l + l) ~ Kl) , 

(1.3.2) lim sup ; —■ ' —- -4 - t = 1, 

i-o (2<rot log log (1/0)* 


.. v(t) — v(0) 

lim sup -—--i 

(2<ro t log log (y 


- 1 , 


and (1.3.2) becomes (1.3.1) when v{t) is expressed in terms of uit). 


2 . The distribution of displacements 

It does not seem to have been realized by earlier writers that the distribution 
of displacements in the O. U. process can be obtained directly from that of the 
velocities. In fact, we have seen that as t varies, u(t) considered as one of a 
multiplicity of continuous functions of t. Integration of u(t) is therefore ad¬ 
missible, and will give the displacement function. If x(<) is the x-coordinate 
of a particle at time t, 


( 2 . 1 ) 


x(<) — x(0) = j£ u(a) ds 



356 


J. L. DOOB 


with probability 1 (that is, neglecting the discontinuous u(t) functions which 
have total probability 0). The main advantages of the rigorous approach to 
stochastic processes depending on a continuous parameter is precisely that the 
u(t) of the process, as t varies, can be regarded as an individual function or 
rather, as one of many functions with whatever regularity properties the given 
probability distributions imply. Theorem 1.3 limits the actual upper bounds 
of the velocity functions u(t). The following result takes advantage of the 
oscillations in sign. 

Theorem 2.1. If u(t) is the representative function of the 0 . U. process of 
Theorem 1.1 Case J?, with m = 0, 

(2.1.1) lim -j f u(s) ds = lim X — — = 0, 

t —*00 t Jq t —*00 t 

with probability 1. 

This theorem is simply the ergodic theorem applied to the u(t) process to give 
the strong law of large numbers, (cf. Doob (4) p. 294). From (2.2.3) below, it 
is quite obvious that the expectation of the square of the left side of (2.1.1) 
goes to 0 as 2 —► «>, so that the left side goes to 0 in the mean. The strength of 

(2.1.1) is that it is a statement about the path of the individual path functions, 
or physically, a statement about the path of a single particle. The same was 
true in Theorems 1.2 and 1.3. 

In order to find the distribution of x(t) — x(0) we proceed as follows. Rie- 
mann integrability of u(t) implies that (with probability 1) 

n 

(2.2.1) x(t ) — x(0) = lim 53 u(tj/n)t/n. 

n-*oo j—1 

Now the n-variate distribution of the variables summed is Gaussian. Then the 
sum is Gaussian, so the distribution of x(t) — x(0) is also Gaussian, if it can be 
shown that the variance of x(t) — x(0) is positive. The distribution of 
x(t) — x(0) is thus completely determined by its first two moments, which we 
proceed to calculate. We shall suppose, that E{u(t)\ = 0, E{u(t) 2 J = <r\ . 
Then we find 

(2.2.2) E{x(t) - z(0)} = [‘ E{u(s )} ds = 0, 7 

Jo 

n E{u(s)u(s')} ds ds' 

d jf jf* e- 151 —' 1 ds ds ' = ^ (e~ 0t - 1 + 0t). 

7 By Fubini’s integration theorem, we can find the expectations under the integral sign, 
before integrating with respect to 8. 


and, if t > 0, 

E{[x{t) - x(0)] s } - 

(2.2.3) 



BROWNIAN MOVEMENT AND STOCHASTIC EQUATIONS 


357 


The same sort of argument shows that if t\ , • • • , t n are any distinct numbers, 
the chance variables 


{ x{t /) - x(0), u(tj), j = 1 , • • • , n) 

have a 2/i-variate Gaussian distribution, which can then be evaluated explicitly 
by finding the first and second moments. For example, the following equations 
determine the bivariate distribution of x(t) — s(0), u(l), (t > 0): 

£?{[a:(0 - z(0)]u(<)} = [ E{u(t)u(s )} ds = (1 - 

Jo p 


(2.2.4) 


E{x(t) - x(0)l = 0, 


^{[*(0 “ x(0)] 2 J - (e^‘ - 1 + fit), E{u(t )} - 0, £{u«) 2 | = al 


Thus the bivariate density of x(t) — x(0), u(t) is Gaussian, with common 
mean 0, and variances {2crl/fH 2 )(eT fit — 1 + (3t), cr o, respectively, and correlation 
coefficient 


( - 2 ' 2 ' 5 ^ 2* (c-«- 1 +0t)*‘ 

It is to be expected that if Si < s 2 ^ t\ < U , x(s 2 ) — x(si) and x(ti) — x(t i) 
become independent as t\ —> «. In fact, these two normally distributed vari¬ 
ables have correlation coefficient. 


( 2 . 2 . 6 ) 


- 1 +W* - s,))* - 1 + fidh ~ <i))* ’’ 


which goes to 0 when ti and h become infinite. 

If in this discussion only the conditional distribution functions are wanted, 
for u(0) = uo , for example, two procedures are possible. Setting u(0) s uo 
instead of using the initial distribution we have used above, carrying out the 
same type calculations as above, now would give the desired conditional proba¬ 
bilities. Or the conditional distributions could be calculated from the distribu¬ 
tions just derived, since the conditional distributions of a multivariate Gaussian 
distribution are easily found. Theorems 1.2, 1.3 and 2.1 hold no matter what 
initial distribution is assigned to w(0). 

Finally, there is one more fact which we shall need in the next section. Define 
B(t) by 

(2.2.7) B(t ) = p[x(t) — x(0)) + u{t) — u(0). 

Then B{i) has for each t a Gaussian distribution, with mean 0. Evidently the 
distribution of B(s + t) — B(s) is independent of s. It is Gaussian, with 
mean 0, and the variance is easily calculated to be 2 <r$0 1 1 1. Moreover, if 

Si < St £ t\ < U , 



358 


J. L. DOOB 


(2.2.8) E{[B(k) - B(ti)][B(Si) - B( Sl )}} = 0. 

Thus the ff(i)-process is again the Einstein-Smoluchowski process. 

3. Derivation of the velocity distribution using the Langevin equation 

Omstein and Uhlenbeck base their investigation on the Langevin equation 

(3.1) ^ = -0 u(t) + A(t), 

which is simply Newton's law of motion applied to a particle, after dividing 
through by the mass. The first term on the right is due to the frictional re¬ 
sistance or its analogue, which is supposed proportional to the velocity. The 
second term represents the random forces (molecular impacts). Probability 
hypotheses are imposed on the A(t) y including relations between A(t) and u(t), 
to determine the u(t) distribution. Unfortunately this u{t) distribution (Case B 
of Theorem 1.1), as we have seen, has the property that the velocity function 
has no time derivative. Then the solution can hardly satisfy (3.1). 

Bernstein ((2) p. 361) replaces (3.1) by a finite difference equation: 

(3.2) A (^) = -m« + a n , n = 1, 2, •... 

Here is a sequence of chance variables, A£ n = £ w +i — £n etc., and 

a \, ct 2 , • • • is a given sequence of mutually independent chance variables. If 
we think of £,• as the analogue of x(jAt), the correspondence between (3.2) and 
(3.1) is clear. The equations of (3.2) determine definite distributions for the 
fy in terms of those of the a j . Bernstein shows that as At —► 0 the distribution 
of A% n /At (~ Ax/At) becomes the u(t) distribution we have been discussing, if 
suitable hypotheses are made on the aj . This approach is essentially different 
from that of Ornstein and Uhlenbeck in that Bernstein, as he states explicitly 
((1) pp. 5, 6) is not writing a difference equation in the displacement functions 
x{t) themselves: (3.2) determines distributions only, and these are approximated 
by the limiting distributions described in Theorem 1.1 Case B. 

In our treatment, we shall replace the Langevin equation by a formalized 
differential equation for the velocity function u(t). This equation is to be 
exact, not merely asymptotically true. The equation will be perfectly proper 
mathematically, so that solution by ordinary methods will provide all the infor¬ 
mation relevant to the desired distributions, and solution of more general prob¬ 
lems, involving external forces, will require no special methods. 

The problem is to find a proper stochastic analogue of the Langevin equation, 
remembering that we do not expect u'(t) to exist. We write the equation in 
the following form: 

(3.3) du(t) = -0u(t) dt + dB(t ), 

and try to give these differentials a suitable interpretation. We shall suppose 



BROWNIAN MOVEMENT AND STOCHASTIC EQUATIONS 


359 


that the 2J(0-process is a differential process: that is, if h < ••• < t n , we 
suppose that 

B(t 2 ) - B(h), ••• ,B(tn) - B(tn_ i) 

are mutually independent chance variables. We also suppose temporal homo¬ 
geneity, that is that the distribution of B(s + t) — B($) is independent of s. 
The physical meaning of these hypotheses is clear, and they will be justified 
further below. Equation (3.3) can be interpreted roughly in terms of small 
changes in momentum. An important particular case is that in which the 
second moments of the J5(0-process are finite: 

(3.4) ' <r\t) = E{[B(s + 0 - B(s)] 2 1 < oo. 

The first moment E{B(s + t) — #($)} then exists. If this first moment 
vanishes, a{t) satisfies the functional equation 

<T 2 (s + t) = c T 2 (s) + <r 2 (t). 

Then <r 2 (t ) must be proportional to t : <r 2 (t) = ta 2 . If f(t) is continuous, 

(3.5) f b f(0 dB(t) 

Ja 


has been defined under these hypotheses (Wiener (18), pp. 151-157, Doob (3)7 
pp. 131-134), even though the functions B(t) are known not to be of bounded 
variation. The definition makes all the formal processes correct. For example, 
if f'(t) exists and is continuous, 


(3.6) f b f(t) dB(t) = fit)\m - B( 0 )] ' - f b [Bit) - B(0))m dt* 

J a a "a 


with probability 1. The usual Riemann-Stieltjes sums converge to (3.5) in the 
mean. Moreover 

E If fit) dB(t)\ = 0, 

(3.7) ^ a ' 

E {[ l> dB(0 ] [ la 9it) dB(0 ]} = ^ dL 

Now it can be shown even without the hypothesis of the finiteness of the second 
moment in (3.4) that the formal integral in (3.5) can be defined, and will satisfy 

(3.6). The form of the characteristic function of B(s + t) — B(s) has been 
derived by L6vy ((14) Chapter VII) and using this it is easy to prove that the 


8 We never write B(t) alone in an equation, since strictly speaking only differences like 
B(t) — B(0) are defined. It is unnecessary to define 71(0) itself, although for convenience 
it can be taken identically 0, without affecting any of the equations to be used. Differential 
processes have been discussed in detail by L6vy ((12), (13), (14) Chapter VII) and Doob 
((3) S3). 



360 . 


J. L. DOOB 


usual Riemann-Stieltjes sums for the integral (3.5) converge in probability. 
The integral is defined as the limit, and (3.6) is readily verified. On the other 
hand, (3.7) cannot be expected to hold, since if fit) = 1 the integral becomes 
B(b) — B(a), and we have not supposed that the expectation of this difference 
is finite. The special case in which the second moment is finite is the only 
important one for the purposes of this section, but less restrictive conditions will 
be needed in §5. We shall justify later the assumption that the B(t) process 
is a differential process. 

We shall interpret an equation in differentials like (3.3) to mean the truth 
(with probability 1, that is for almost all functions u(t)) of 

(3.8) f f(t)du(t ) = -0 f f(t)u(t)dt+ f fit) dBit) 

Ja Ja Ja 


for all a , b , whenever / is a continuous function. Here the first two integrals 
are to be defined as the limits (in probability) of the usual Riemann or Riemann- 
Stieltjes sums. Equation (2.2.7) implies 


(3.9) 


ff(t)du(t) = - p + f f(t) dBit) 

•a »o *a 

= - 0 jf dt + J* fit) dBit). 


Thus (3.3) holds for the u(t) of the O. U. distribution if the Bit) is defined by 
(2.2.7). Moreover (2.2.7) with B(t) replaced by B(t) — B(0) is an immediate 
consequence of (3.3). In this case, B{t) has the property that the differences 
B(s + t) — B(s) have finite second moments and, even Gaussian distributions, 
but we are not making either assumption in solving (3.3). 

If (3.3) is true, then (with probability 1) 

(3.10) J du{r) = —0 J e^ T u(r) dr 4* J e fiT dB(r), 


which implies, since integration by parts is applicable, 

(3.11) u«) = u(0)e^ 1 + e+ l f* (? r dB{r) 

for all t, with probability 1. Conversely suppose that u(t) is defined by (3.11). 
Since B(t) is known to be continuous in t except for non-oscillatory discon¬ 
tinuities (jumps) (L6vy (12) pp. 359-364, (13); Doob (3), pp. 134-138), the 
same must be true of the right side of (3.11), and therefore of u(t). Then u(t) 
is Riemann integrable with probability 1. Moreover 

(3.12) J"f(t) e*' d t jf e* dB(r) = jj{t) dBit), 
so that from (3.11) 



BROWNIAN MOVEMENT AND STOCHASTIC EQUATIONS 


361 


(3.13) d[^u(t) - m(0)] = f f(t) dB(t), 

*o J a 

proving incidentally that the left side exists. The left side can be simplified to 

(3.14) /3 f f(t)u(t) dt + f f(t) du(t), 

and putting this into (3.13) we find that (3.8) is satisfied. Then (3.11) furnishes 
the complete solution of (3.3) under the stated conditions. We stress again 
that although (3.11) implies strong connections between the u{t) and B(t) proc¬ 
esses, we have made no such hypothesis in the derivation not implicit in (3.3). 
L6vy ((14) pp. 166-167) has shown that the only differential processes whose 
path functions B(t) — B(0) do not have jumps have the property that the 
distribution of B(t ) — B( 0) is Gaussian. Then it is only in this case, which will 
lead to the O. U. process, that u(t) will not have jumps. 

The term @u(t) in the Langevin equation is supposed to account for the total 
frictional effect, including the Doppler friction, caused by the fact that more 
impacts decelerate than accelerate the motion of a moving particle. The term 
A (t) in (3.1) or dB(t) in (3.3) represents the “purely random” impulses, that is, 
the residual effect after the frictional effect has been subtracted out. One idea 
running through any treatment of the Langevin equation is that this term or, 
sometimes, x(t) itself, is independent of the given velocity at any time. This 
hypothesis goes back to Langevin, and has caused much controversy. We shall 
make the hypothesis only to the following extent. The chance variable u( 0) 
will be given various initial distributions, but will always be made independent 
of the J5(0-process for t ^ 0. This means that if 0 :§ t\ < • • • < t n the chance 
variable u( 0) is supposed independent of the set of chance variables 

{#(0+0 — #(0)> j = 0 ••• “ !}• 

We shall describe the above hypothesis in the following physical terms: the 
initial velocity u(0) is independent of later residual random impacts . It would 
be a serious drawback to the whole treatment if when u{ 0) is so chosen u(to) 
for each U > 0 were not independent of the Z?(£)-process for t ^ to , that is if 
u(to) were not independent of later residual random impacts for all U • We 
can prove, however, the following statement, which incidentally justifies our 
hypothesis that the fi(0-process is a differential process. Let the B(t) process 
be a differential process , and define u(t) by (3.11). If the chance variable u(0) is 
independent of the B(t)-process for t ^ 0, then u(t o) will be independent of the 
B(f)-processfor t ^ to 9 for all to > 0. Conversely suppose only that the B(t)-process 
is regular enough that the integral (3.5) can be defined as the limit in probability 
of the usual sums , and that (3.6) is true . Then if u(t) is defined by (3.11), and 
if choosing u( 0) independent of the B(t) process for t ^ 0 implies that u(to) will be 
independent of the B(t)-process for t ^ k } for all U > 0, then the B(t)-process is a 
differential process. 



362 


J. L. DOOB 


Proof. Let the B(0-process be a differential process, define u(t) by (3.11) 
and let w(0) be independent of the B(0-process for t ^ 0. Then from (3.11) 
with t = to , u(to) involves only w(0) and the B(0-process for t ^ to . Then 
u(t 0 ) is independent of the B(0-process for t ^ to because the B(0-process is a 
differential one, with differences involving lvalues beyond to independent of 
those involving lvalues before to . Conversely suppose that choosing w(0) inde¬ 
pendent of the B(<)-process for t ^ 0 implies that u(U>) will be independent of 
the B(<)-process for t ^ to , for all to > 0. Then if m( 0) is so chosen, 

u{ 0) + f h ^ dB{r) 

Jo 

and therefore 

[ t0 e? r dB(r) 

Jo 

are independent of the B(2)-process for t ^ to . This fact implies that the pre¬ 
ceding integral determines a differential process, that is, if ti < • • • < t n , the 
integrals 



are mutually independent. Then (applying this fact to subintervals of the 
intervals (tj , tj+ 1 ) 

[ l+l e" flr d, [ e Sr dB(r), j = 1, • • • , n 

Jtj Jt, 

are mutually independent, and these repeated integrals are simply 

B(t j+ 0 - B(tj) j = 1, • • • , n - 1. 

The latter differences are therefore mutually independent, as was to be proved. 
We shall need the following lemma. 

Lemma 3. Suppose that a < 1, and let x 0 , X \, • • • be mutually independent 
chance variables with a common distribution function. If there is a chance variable 
y with a Gaussian distribution such that the distribution function of ]C*-o a n ~ 3 Xj 
approaches that of y as n —> <», then the Xj have Gaussian distributions. 

Many of the hypotheses of the lemma are unnecessary, but its statement is 
general enough for our purposes, and the proof will apply to a situation to be 
discussed in §5, where the distribution of y will not be Gaussian. The hypothe¬ 
ses imply that the distribution of 2^m a 3 Xj approaches that of a m ~ l y as n —» oo. 
If <p(t) is the characteristic function of X\ and \l/(t) that of y , writing 22 ? a 3 xj 
in the form ax i + 22 ? i shows that 

fit) = 

Solving for <p we find that it is the characteristic function of a Gaussian distribu¬ 
tion, as was to be proved. 



BROWNIAN MOVEMENT AND STOCHASTIC EQUATIONS 


363 


In the physical picture under discussion, further conditions on the solution of 
(3.3) are known. In fact the Brownian movement is simply a visible example 
of molecular or near molecular movement. The general principles of such move¬ 
ments are therefore applicable, and the principle of equipartition of energy leads 
to the Maxwell distribution of velocities. Let k be the Boltzmann constant, 
and T the absolute temperature. We can formulate the significance of the 
Maxwell distribution (as much as we shall need it) as follows. 

Mi . Tendency towards the Maxwell distribution. Whatever the initial distri¬ 
bution of w(0), the transition probabilities have the property that when t —> oo 
the distribution function of u(t) converges to the Gaussian distribution function 
with mean 0 and variance kT/m. (Here m is the mass of the moving particle.) 

M 2 . Stability of the Maxwell distribution. If u(0) is independent of later 
residual random impacts, and if it has the Gaussian distribution described in 
Mi , u{t) will have this same distribution for every positive t. 

These two statements are closely related, but neither apparently can be de¬ 
duced from the other without further assumptions. Since these principles act 
the part of a deus ex machina in a discussion of the Langevin equation, we shall 
use them as little as possible. It will usually be sufficient to use a weakened 
form of Mi : 

Mi . There is an initial distribution of w(0), such that the transition proba¬ 
bilities have the property that when t —> 00 the distribution function of u(t) 
converges to the Gaussian distribution function with mean 0 and variance kT/m. 
It is understood here as before that w(0) is to be independent of later residual 
random impacts. 

Conditions Mi and M 2 restrict the possibilities for the 2?(£)-process. In fact 
suppose that condition Mi is satisfied. Then (3.11) shows that 

e^‘ f e^ r dB(r) 

Jo 

is nearly Gaussian for large t, with mean 0 and variance kT/m . We write this 
integral as a sum, replacing t by nt: 

(3.15) e~ Pnt f“ e ST dB(r ) = £ , A 

Jo 0 

where 

(j-H) t 

(3.16) x,- = / e S(r-y<> dB(r). 

Jjt 

Since the 5(/)-process is a differential process, and is temporally homogeneous, 
the Xj are mutually independent, with identical distributions. According to 
the lemma, the right side of (3.15) cannot become Gaussian for large t unless 
the distribution of Xi is Gaussian. Thus, since t is arbitrary in the above 
discussion, 



364 


J. L. DOOB 


has a Gaussian distribution for all s 9 t. Since the chance variables 

/.(j+lX/n 

(3.17) / J'dBir), j = 1, ,n 

Jjt/n 

are mutually independent and Gaussian, the chance variable 

n—1 /»(j+l )</« 

(3.18) £ e~ mn / e^ T d£(r) 

also has a Gaussian distribution. When n becomes infinite, (3.18) becomes 
B(t) — i*(0), with probability 1. The latter difference thus has a Gaussian 
distribution, with mean 0. The Z?(<)-process therefore has finite second mo¬ 
ments a 2 (t) = ta 2 as defined in (3.4). According to (3.7) the last term in (3.11), 
which we now know has a Gaussian distribution, has mean 0 and variance 
<r 2 (l e~ 2 ^)/2/3. Then u(t) — e~ fit u(0) has this same distribution. The vari¬ 
ance becomes a 2 /2/3 when t—>o o, and therefore, according to M[ , a = 2&kT/m. 
Thus condition M'i completely determines the 2?(£)-process. We show next that 
condition M 2 determines this same 2?(0-process. In fact suppose condition M 2 
is true, and assign to w(0) the distribution of that condition. Then w(0) is inde¬ 
pendent of the integral in (3.11), and in (3.11), u(t) (which has a Gaussian 
distribution, according to condition M 2 ) is expressed as the sum of two inde¬ 
pendent chance variables, of which the first is Gaussian. The characteristic 
function of the second is the quotient of the characteristic functions of two 
Gaussian distributions, and is therefore the characteristic function of a Gaussian 
distribution. Thus the expression 

(3.19) e~ 0t [ e fiT dB(r) . 

Jo 


has a Gaussian distribution for all t, and this implies, as above, that B(t) — #(0) 
has a Gaussian distribution, with variance a 2 t. The variances on the right side 
of (3.11) add up to that on the left, giving an equation for or 2 : 


(3.20) 


kT _ Vl kT 1 - e~ m , 
to to 2/3 a ' 


Then a — 2@kT/m as above. 

We can now finally derive the O. U. velocity process as the solution of the 
Langevin equation. Suppose the i?(0-process is the one derived in the preceding 
paragraphs, and choose the chance variable u(0) to be independent of the 
U(£)-process for < ^ 0. Then u( 0) is independent of the integral in (3.11), and 
this means that the conditional distribution of u(t) for w(0) = uo is Gaussian, 
with mean 0 and variance kT(l — e~ m )/m. Moreover, (3.11) implies 

(3.21) u(s + t) - u(s)e- fft + J dB(r). 


As we have seen, u(s) is independent of the B(0-process as far as it appears 
in (3.21) and therefore is independent of the integral. Thus the transition 



BROWNIAN MOVEMENT AND STOCHASTIC EQUATIONS 


365 


probabilities from 8 to 8 + t are the same as those from 0 to t, which are pre¬ 
cisely those of the O. U. process. Incidentally it follows that the full condition 
Mi is satisfied. Finally, if w(0) is not only supposed independent of the 5(0- 
process, for t 0, but also is supposed to have a Gaussian distribution with 
mean 0 and variance kT/m , the same will be true of u(t) (as can be calculated 
from (3.11)) and condition M 2 is thus satisfied. We can summarize all our 
results as follows. 

Theorem 3. Let the B(t)-process be a temporally homogeneous differential 
process . Then (3.11) furnishes the solution of (3.3). The following conditions on 
the solution are equivalent . 

(i) The solution satisfies condition Mi . 

(ii) The solution satisfies condition Mi. 

(iii) The solution satisfies condition M 2 . 

(iv) 5(0 — 5(0) has a Gaussian distribution , with mean 0 and variance 
<r 2 t = t2f3kT/m. 

If the above conditions are satisfied , u{t) — e~ fit u( 0) will have a Gaussian distri¬ 
bution with mean 0 and variance kT( 1 — e~ m )/m; if u( 0) is independent of the 
B{t)-process for t ^ 0, u(s) is independent of the B(t)-process for t ^ s for all 
s > 0, and the transition probabilities of the u(t)-process are those of the O. U. 
velocity process. If in addition u( 0) has the Gaussian distribution with mean 0 
and variance kT/m , the u(typrocess becomes the O. U. process , with m = 0, 
cr o = kT/m. 

The Langevin equation gives a physical interpretation to every property of 
the O. U. process. It is interesting to verify that as h —» 0 the correlation 
coefficient of the pair B(s + h) — 5(s), u(t) (any s , t) goes to 0. In this sense 
then, dB(s ), the effect of the residual random impacts at time s, is independent 
of the velocity at any particular time t . Since in (3.11) u(t) is written in terms 
of the 5(0-process, u(t) is of course not independent of this process. 

We have written u(t) in terms of the 5(£)-process. It is easy to write x(t) 
in terms of the B(t ) process by combining (2.1) with (3.11): 

(3.22) x(t) = *(0) + ’ w(0) + i /■' [1 - e-*'-”] dB(r). 

P P Jo 

Instead of finding the distributions of the displacement and velocity processes, 
and their correlations, as at the beginning of the paper, we could easily derive 
the desired results using (3.11) and (3.22). The various expectations can be 
calculated using (3.7). 

In physical applications, the correlation function E { u(s)u(s + t) } is sometimes 
wanted as a time average. Now the transformation 5* taking B(t) — 5(0) into 
B(t + h) — B(h) preserves the B(t) probability relations (temporal homo¬ 
geneity), and the family of transformations {5 a} is well known to be metrically 
transitive. 9 Then applying the ergodic theorem to the function w(0)r(/i), con¬ 
sidered as a function of the B{t)> we find that 


• Cf. for example Doob, (3) p. 125. 



366 


J. L. DOOB 


(3.23) lira I [‘ w(s)w(s + h)ds - £{u(0)u(A)} = — e-* 1 * 1 , 

with probability 1, that is for almost all functions w(f). The ergodic theorem 
was applied to the J?(£)-process in essentially this way by Wiener ((18) p. 169) 
who has been interested in the harmonic analysis of functions like the u(t) dis¬ 
cussed here. The work of this paper verifies in this particular case the impor¬ 
tance Wiener gave to the functions of the i?(£)-process of the type (3.5). 

There is no difficulty in extending the above results to bound particles. For 
example, the Langevin equation of the harmonically bound particle is 

(3.24) ^ = -pu - + A(t), 

at 

which in our treatment becomes 

(3.25) du = —/ 3udt — w 2 x dt + dB. 

The usual methods of solving the differential equation (3.24) are still applicable 
to (3.25) and again the distribution of u turns out to be Gaussian. 10 The 
distribution of displacements is then obtained as above. 

4. The £(0-impact process 

When the 2?(0-process and the initial conditions on u( 0) are given, the solu¬ 
tion u(t) is determined by (3.11). Conversely if the solution u(t) is known, 
B(t) is determined by the equation 

(4.1) B(t) - B{ 0) = p [ 1 u(s ) ds + u(t) - m(0) 

Jo 

which is derived immediately from (3.3). The O. U. velocity distribution for 
the w(£)-process can therefore be given only by the 2?(£)-process described in §3. 
We shall investigate the possibility that a different choice of the 2?(£)-process 
might have led to a different velocity process compatible with the known 
physical conditions like Mi and M 2 . If we suppose that w(0) can be chosen so 
that the velocity at each moment is independent of subsequent residual random 
impacts, then we have seen that the J5(0-process must be differential, and is 
then uniquely determined by conditions M[ or M 2 . Any velocity process other 
than the O. U. process satisfying the Langevin equation and or M 2 would 
therefore imply dependence between velocity and later residual impacts. This 
is really another way of saying that the frictional resistance cannot be con¬ 
sidered as proportional to the velocity. Before going further we put a condi¬ 
tion going back to Maxwell in its modem setting. We formulate a hypothesis 
M 8 as follows. 

M 8 . In two or more dimensions (using any orthogonal axes) the velocity 
components are mutually independent. 

10 Cf. Omstein and Wijk (16) and Wijk (17). The B(t , A) used in these papers corre¬ 
sponds formally to our dB(t). The difference is that it is possible to give a precise descrip¬ 
tion of the B(t) -distribution. 



BROWNIAN MOVEMENT AND STOCHASTIC EQUATIONS 


367 


In conjunction with the following lemma, due to Ka$ (( 8 ) p. 278), hypothesis 
implies that all quantities linear in the displacement or velocity functions 
have Gaussian distributions. 

Lemma. 11 Let (x \, t/i), • , ( x n , y n ) be 2 n chance variables with the property 

that the sets of chance variables 

{xj cos 6 + yj sin 6, j = 1 , • • • , n) {— Xj sin 6 + yj cos 6, j = 1 , • •• , n) 

are mutually independent for each value of 6. Then (xi , • • • , x n ) have an n-variate 
Gaussian distribution or a singular Gaussian distribution . 

We can combine the Maxwell hypotheses to obtain another justification of 
the O. U. velocity process. 

Theorem 4. Let the B(t)-process be any process such that the distribution of 
B(ti ) — B(ti) or of any quantity depending on such differences is unaffected by 
translations of the t-axis , and that the integral (3.5) can be defined as the limit in 
probability of the usual sums , with (3.6) valid . Then if u(t) is defined by (3.11), 
and if conditions M 2 and M 3 are satisfied , the B(t)-process must be precisely that 
finally obtained in §3, leading to the 0. U . velocity process. 

Suppose that condition M 2 is satisfied, and let w( 0 ) be fixed as in that condi¬ 
tion. Just as in §3, (3.11) then implies that the integral 

B*(t) = j‘ <f T dB(r) 

has a Gaussian distribution with mean 0 and variance ( kT/m)(e m — 1). If 
condition M 3 is also satisfied, B*(tf) — B*(ti), and more generally any finite set 
of such differences, has a one or more dimensional Gaussian distribution. Using 
the fact that the distribution of c“ / 3 *[Z?*(s t) — B*(s)] is the same as that of 
B*(t), in evaluating the expectations in the following equation 

(4.2) JS{B*(s + 0 2 | = E{[B*(s) + [B*(s + t) - B*(s)]f }, 

we find that B*(s) = B*(s) — B*( 0) and B*(s + t) — B*(s) are uncorrelated. 
These two variables are therefore independent. Going further, similar ealcular 
tions show that any differences B*(t 2 ) — B*(ti)> B*(s 2 ) — B*(si) with 0 ^ 81 < 
»2 ^ ti < k are independent. Using the fact (derived from condition M s ) that 
any finite set of differences has a multivariate Gaussian distribution, the B*(t)~ 
process is thus a differential process. Thjs means, by a method we have used 
above, that the B(0-process is a differential process, leading to the O. U. velocity 
distribution, because condition M 2 is satisfied. 

It is easily seen from counterexamples that Theorem 4 is no longer correct if 
condition Mi is supposed instead of condition M 2 . 

6. Velocity processes not subject to Maxwell’s laws 

In all the above work the role of the Maxwell velocity distribution has been 
fundamental. In certain studies, however, other distributions play a somewhat 


11 The result is stated slightly incorrectly by Ka$. 



368 


J. L. DOOB 


analogous role . 12 It is interesting to note that the Langevin equation can be 
solved to give a distribution whose transition probabilities are asymptotically 
any of the symmetric stable distributions classified by L 6 vy «14) §30, §56, §57). 
Such a distribution has characteristic function 

e^o 1 * 17 

where al is a positive parameter and 0 < y ^ 2. The Gaussian distribution is 
obtained when y = 2 . The parameter al plays the role of the variance, although 
the second moment is never finite when 7 < 2. The velocity process we shall 
derive will be called the 0. U. ( 7 ) process. It is the O. U. process when 7 = 2 . 
The O. U. ( 7 ) process can be described as follows. 

1 . The process is temporally homogeneous, that is translations of the £-axis 
do not affect the probability distributions. 

2. The process is a Markoff process. 

3. For each fixed t , u(t) has a symmetric stable distribution with parameter 
value al , exponent 7 . The conditional distribution of u(s + t) for u(s) = Uo 
is the stable distribution symmetric about u 0 e~ fit , with parameter value 
al( 1 — e~ yfiU] ) and exponent 7 . 

We can obtain this process as a solution of the Langevin equation by choosing 
the B(t)- process properly. In fact, let the B(t)~ process be the temporally 
homogeneous differential process in which B(s + t) — B(s ) has a symmetric 
stable distribution with exponent 7 and parameter value at. Let u(t) be the 
corresponding solution of the Langevin equation, given by (3.11). If 7 / is the 
sum of two independent chance variables with stable symmetric distributions, 
having parameter values a\ , a\ , and with the same exponent 7 then y also has a 
symmetric stable distribution, with the same exponent, 7 , and with parameter 
value a\ + al . From this fact it is simple to check that the integral (3.5) in 
the present case has a symmetric stable distribution with exponent 7 and 
parameter value. 

f 1 fit) r dt. 

Ja 

If u{ 0) is given a symmetric stable distribution independent of the £(£)-process 
for t ^ 0 , with parameter value a 2 / y/3, the distribution of u(t) can be calculated, 
using characteristic functions, and is found to be symmetric and stable, with 
exponent 7 and parameter value a 2 /yp. The u(t) thus defined determines an 
O. U. ( 7 ) process, with the above three properties, setting a 2 = a 2 /yp. 

We shall not spend any time on the details of the analysis of the O. U. ( 7 ) 
process, since the work runs parallel to that for the case 7 = 2 , already discussed. 
There are, however, a few essential differences. If v(t) is determined by the 
equation 

( 1 . 2 . 1 ) v(t) = t lh u log tj, t > 0 , 


l * Cf. Holtzmark (7). 



BROWNIAN MOVEMENT AND STOCHASTIC EQUATIONS 


369 


the v(t) process can be analyzed using (3.11). The v(0-process has the same 
distribution as the 2?(<)-process just described. The continuity properties of the 
velocity process can now be derived from those of the t;(J)-proces8, which are 
known. When y < 2, the velocity function u(t) is no longer a continuous func¬ 
tion of t with probability 1, but is certain to have discontinuities. These dis¬ 
continuities are however non-oscillatory (jumps). 18 We omit the details of the 
analogue of Theorem 1.3. Theorem 2.1 is still true if 7 S 1. The considera¬ 
tions of §3 have their obvious counterparts here. Lemma 3 played an essential 
role, but its statement and proof are correct if the variable y of the lemma is 
supposed to have a symmetric stable distribution and if the conclusion is that 
the Xj have a symmetric stable distribution with the same exponent as y. 

University of Illinois 
and 

Institute for Advanced Study 

Bibliography 

1. S. Bernstein, Comptes Rendus de l’Acaddmie des Sciences de PURSS, N.S. (1934) 

pp. 1-9. 

2. S. Bernstein, Comptes Rendus de l’Acaddmie des Sciences de PURSS, N.S. (1934), 

pp. 361-364. 

3. J. L. Doob, Transactions of the American Mathematical Society 42 (1937), pp. 107-140. 

4. J. L. Doob, Duke Mathematical Journal 6 (1940), pp. 290-306. 

5. J. L. Doob, Transactions of the American Mathematical Society 47 (1940), pp. 455-486. 

6. G. L. de Haas-Lorentz, Die Brownsche Bewegung und einige verwandte Erscheinungen , 

Braunschweig (1913). 

7. J. Holtzmark, Annalen der Physik 58 (1919), pp. 577-630. 

8. M. Ka£, American Journal of Mathematics 61 (1939), pp. 726-728. 

9. A. Khintchine, Mathematische Annalen 109 (1934), pp. 604-615. 

10. A. Khintchine, Ergebnisse der Mathematik 4 No. 3. 

11. G. Krutkow, Physikalische Zcitschrift der Sowjet-Union 5 (1934), pp. 287-300 

12. P. L£vy, Pisa Annali Series 2 vol. 3 (1934), pp. 337-366. 

13. P. L£vy, Pisa Annali Series 2 vol. 4 (1935), pp. 217-218. 

14. P. L£vy, Thkorie de Vaddition des variables aUatoires t Paris 1937. 

15. L. S. Ornstein and G. E. Uhlenbeck, Physical Review 36 (1930), pp. 823-841. 

16. L. S. Ornstein and W. R. van Wijk, Physica 1 (1934), pp. 235-254, errata p. 966. 

17. W. R. van Wijk, Physica 3 (1936), pp. 1111-1119. 

18. N. Wiener and R. E. A. C. Paley, Fourier Transforms in the Complex Domain , Ameri¬ 

can Mathematical Society Colloquium Publications Vol. XIX. 

18 For further details, cf. L4vy (14) Chapter VII. 



Annals or Mathbmatics 
Vol. 48, No. 2, April, 1942 


A NEW HOMOLOGY THEORY 

By W. Mayer 
(Received January 20, 1942) 

In the classical homology theory one considers a sequence <7°, C 1 , • • • of addi¬ 
tive groups and homomorphisms C x+1 —* C x such that the induced homo- 
morphisms C x+2 —> C % are trivial, i.e. C x+2 is mapped into the zero of C\ In the 
present paper we propose to develop a new homology theory which also uses a 
sequence C°, C 1 , • • • of additive groups and homomorphisms C t+1 —> C x . In this 
new theory, however, a fixed prime number p is chosen, and the induced homo¬ 
morphisms C x+P —> C x are the ones which are trivial. These groups for p = 3 
were already considered at length, but by a different method, in the following 
papers: Mayer-Campbell, Generalized Homology Groups, Proc. Nat. Acad. Sci., 
U. S. A., 26, 655-656 (1940), and Generalized Homology Groups , to be published 
shortly in Revista de Matem&ticas y FIsica Tebrica. 

1. Simplicial systems 

We first define the new homology theory for a simplicial system i.e. a collection 
{<r} of (non-oriented) simplexes such that the faces of any simplex <r c 2 also 
belongs to 2. In addition to the simplexes of 2, henceforth called simple 
simplexes , we shall consider simplexes with repeated vertices, or generalized 
simplexes 

(1.1) (PVPV ••• PV) 

where P* 9 ^ Pk and the integers a, ^ 1 indicate the multiplicity of the corre¬ 
sponding vertex. 

The simplex (1, 1) shall belong to 2 if and only if the simple simplex 
(P 1 P 2 * • • P r) belongs to 2. The dimension of the generalized simplex (1, 1) is 
defined to be ot\ -j- <x% -f- • • • -f - <x w — 1. 

Hereafter we use the symbol 2 to denote the extended system of all the simple 
and generalized simplexes thus obtained. 

We now introduce, for a fixed prime number p( 5 ^ 1 ) the group C n of n-chains, 
K n , mod p. These n-chains K n are made up of simplexes p w of 2 of dimension 
n which may or may not be simple: 

K n = tfl/p?. 

A boundary operator F is first defined for simplexes of dimension > 0 by 

( 1 . 2 ) F(Pf 1 • • • p?’) = Z «<(Pi a ‘ • • • Pfir 1 K 1-1 put 1 ■ • • pf), 

i-1 


370 



A NEW HOMOLOGY THEORY 


371 


where vertices with exponent zero are crossed out, and the coefficients are re¬ 
duced mod p. Just as in the classical theory, the formula 

(1-3) F(£ «*?) = E «jF(p") 

defines a homomorphism C n —> C* 1 ” 1 for n > 0. 

If the w-dimensional simplex (1.1) is written in the form 

(1.4) (P1P2 ••• Pn+l) 

where the vertices Pi, P2, • • • P n +i need not be distinct, the formula (1.2) may 
be written 

(1.5) F(Pi • • • /VO = E (Pi • • • /VOi » a 1 , 

7-1 

where (Pi • • • P n +i)j is the face of (Pi • • • P n +i) opposite the vertex Pj . De¬ 
note by (Pi ••• Pn+i)• • • j r face opposite the face (P^ ••• Py r ) and let 

F 2 ( ) = F(F( )), ••• F*( ) = F(F ,_1 ( )). The formula (1.5) enables us to calcu¬ 
late rapidly the boundary of a boundary etc. Thus 

F 2 (Pi • • • /Vi) = 2 £ (Pi • • • /VO/u,, n ^ 2, 

( 7172 ) 

and in general, 

(1.6) F'(P, • • • P„ +1 ) - P E (Pi • * • P«+i)ii n £ i. 

<71* * * 7 i ) 

where the summation (• • • j t ) runs over all Or) combinations of 1, 2, • • • 
n + 1. If n ^ p then 

(1.7) F p (P 1 ••• Pn +1 ) = 0 n - p , 

where 0 r denotes the zero of C r . If i < p then in general F‘(Pi • • • P n +i) ^ 0 n ~\ 
Thus, for n g: i the homomorphism F‘ maps C n into the zero of C n ~' if i ^ p. 


2. (/-cycles of dimension n 

If 1 ^ q < min (p, n + 1), an n-chain K n will be called an n-dimensional 
g-cycle (briefly a (<7, n)-cycle) whenever 

(2.1) F(K n ) = 0 n "*. 


By (1.3) the (<7, n)-cycles form a subgroup Z* of C n . 
For any (n + p — g)-chain 


( 2 . 2 ) 




Hence the (p — <7) th boundary of an (n + p — g)-chain is always a (<7, n)-cycle. 
These (p — g) th boundaries form a subgroup B£ of Zg . The difference group 

( 2 . 3 ) Hg «* Zg ~ B'g 



372 


W. MAYER 


will be called the g‘ h n-dimensional homology group of 2 (briefly the (q, »)- 
homology group). It is defined for q £ n since (2.1) has meaning only in this 
case. If q > n, so that (2.1) is not applicable, we define 

(2.3) Z n t = C n , q> n. 

Thus every rirdimensional chain is a (q, n)-cycle when q > n. The group Bq 
of (p — q) th boundaries is defined for every n and every q < p and is a subgroup 
of Zq ; hence the group Hq is defined by (2.3) for every n — 0, 1, 2, • • • and 
every q such that 1 q < p. 

3. Regular and degenerate chains 

We shall call a simplex a degenerate if one or more of its vertices have a multi¬ 
plicity greater than p — 1; otherwise it is said to be regular. A chain will be 
called regular if its simplexes are all regular and degenerate if its simplexes are all 
degenerate. Thus every chain K n can be represented uniquely as the sum of a 
regular chain K“ r ) and a degenerate chain K"d) 

(3.1) K" - K?r> + KSo . 

Thereby we consider the zero chain as the only chain both regular and de¬ 
generate. From (1.2) and (1.3) it follows that the boundary of a regular chain 
is regular and the boundary of a degenerate chain is degenerate. Hence (for 
n > 0) (3.1) implies that 

(3.2) F(K”) = F(K?,)) + F(KJo), 

where 

(3.3) [F(K n )] (r > = F(K? r >), [F(K n )U = F(K?„). 

Thus the simplicial system 2 is the “direct sum” of its two sub-systems 2( f > 
(of regular chains) and 2 (<j) (of degenerate chains): 

2 = 2( r ) + 2(<f) • 

(3.4) Theorem. The (g, n)-homology groups of 2 are isomorphic with the corre¬ 
sponding groups of 2 (r ) . 

Let KJ>) be a (g, n)-cycle of 2 (r) ; hence also a (g, n)-cycle of 2. Let {K(r)} co 
and {K<V)} denote its respective homology classes in 2( f) and 2. The mapping r: 

(3.5) {KMw-MK?,,} 

defines a homomorphism f of the (q, n) homology group of 2 (r) into the (q, »)- 
homology group of 2. The nucleus of f is the zero class of the (q, »)-homology 
group of 2 ( ,). In fact if {K" r )}« belongs to the nucleus, then there is a chain 
K n+p -* 0 f 2 such that 

(3.6) p-«( K «+j>-*) = K » f) # 



A NEW HOMOLOGY THEORY 


373 


If Koo*""* denotes the regular part of K n+P ^, it follows then from (3.1), (3.2) 
and (3.3) that 

(3.7) F^(Kr,t M ) = K? r) . 

Thus r is univalent (= an isomorphism with a subgroup). To prove (3.4) we 
merely need to show that f is a mapping “onto,” i.e. that every class of the 
(g, n)-homology group of 2 contains a regular cycle. Let {K n } be such a class. 

We may suppose that K” is not regular, so that some vertex P appears in K n 
with a multiplicity > p — 1. The chain K n may then be written 

(3.8) K n = Ki n + K? , 
where 

(3.9) Ki n = FT + PR"- 1 + ••• + p p “ 1 R n ” p+1 > K? = P p S n ~ p , 

and the chains R* do not have the vertex P. Since no (n — g)-simplex can 
belong to the ^-boundaries of both Ki n and K? it follows from P(K n ) = O’ 1 ”* 
that 

(3.10) P(Ki n ) = P(K?) = 0 n "*. 

The ( q , n)-cycle K? whose dimension n is greater than p — 1 lies in the closure 
of the star St P of P. Hence (Appendix I) K£ belongs to the zero class of the 
(g, n)-homology group of 2 and hence, by (3.8), Ki belongs to the homology 
class of K n . Thus the removal of those simplexes of a cycle K n which contain 
a vertex P with a multiplicity greater than p — 1 does not alter the homology 
class of K n . Obviously K<V) is obtained from K" by repetition of this process. 
Hence the regular ( q , n)-cycle K? r ) belongs to the homology class of K n . This 
completes the proof of (3.4). 

4. Invariance under Subdivision 

We shall consider here the effect of a certain elementary subdivision of the 
simplicial system 2 with respect to the new homology groups. Let ( ab ) be a 
1-simplex of the not-extended system 2 and y the “midpoint” of (ab). We 
form a new simplicial system, 2(a6), which has all the simple simplexes of 2 
except those which contain both of the vertices a and b. A simplex A(ab) where 
A is a simplex free from a and b and may be vacuous, is replaced by the simplexes 

(4.1) May), A(6y), A7, 

where the first two have the same dimension as A (ab) and the last has a dimen¬ 
sion lower by one. The simple simplexes of S which do not contain the face 
(ab) the simplexes (4.1) and their generalized simplexes, constitute the simplicial 
system 2(ab). 

In the construction of S (ab) from S we limited ourselves to simple simplexes 
because every simplicial system is determined by its simple simplexes. 



374 


W. MAYER 


(4.2) Theorem: The simplicial systems 2 and 2(o&) have isomorphic ( g, »)- 
homology groups. 

First we construct a subsystem 2* of 2 (ab) which is isomorphic to 2. The 
n-chains *C n of 2* are generated by n-chains of 2 (ah) of the form 

(4.3) A(aY - y’ + > + yV), 


where p 9 y. = 0, 1, 2, • • ■ are non-negative integers and A are simplexes of 2(a6) 
free from the vertices a, b and y and may be vacuous. It is easy to verify that 
F(*C n+1 ) C *C n such that 2* is indeed a subsystem (not with a simplicial basis) 
of 2(o&). Furthermore, by the 1:1 correspondence 

(4.4) A(aV - y' +M + 7V) ~ A(aV) 


between the bases of 2* and 2 we establish isomorphisms between the groups 
of the n-chains *C n and C n (2) for n = 0, 1, • • • , which for n = 1, 2, • • • com¬ 
mute with the boundary operator F, thus showing the isomorphism of the 
systems 2 and 2*. We prove this last statement for the basic-chains (4.4). 

It is trivial when v = 0 or n = 0. We therefore assume that both v and n 
are ^ 1. Then (from the rule of Appendix I for taking boundaries of product 
chains) 


(4.5) 


'F[A(aV)] = (oV) F(A) + A [*rV + moV -1 ] 

< (oV ~ 7 V+ " + yb“) F(A) + A[v(a'-V + y r+ ^ 1 + y"~V) 

i + »(aY~ l - y'**- 1 + yb^ 1 )] = F[A(oY - 7 +M + y’b*)}. 


Hence 2 and 2* are isomorphic. 

Now we define a homomorphism of C n (2(o6)) into *C n by the mapping 

f AaV Aa' +M 

(4.6) 

(A yV -> A (aV - y * + yb 11 ), 


of the basic chains (simplexes) of C n (2(a6)) into basic chains of *C n . As before, 
A denotes simplexes free from the vertices a, b and y and v and /x are ^ zero. 
This homomorphism also commutes with the boundary operator F. In fact 

fF[AflVJ - aVF(A) + A[kTV + naY- 1 ] 

(4.7) 

1 - a' + "F(A) + A[(v + M)« r+M ' 1 ] = F(Ao' + '*), 


and 


(4.8) 


fFlAyV] = 7 r f>T(A) + A[vy'~V + MyV 1 ] 

J -> (oV - 7' + " + yV)F(A) + A[v(o'-y - y’*'- 1 + y-'V) 

+ m(oV _1 - 7"* -1 + TV" 1 )] = F[A(oV - y ,+l> + 7V)]. 


The homomorphism (4.6) therefore maps (g, n)-cycles into (g, n)-cycles and 
(p — g) tb boundaries into (p — g) th boundaries. In particular it determines a 



A NEW HOMOLOGY THEORY 


375 


homomorphism of the (g, ri)-homology groups which we now show to be an iso¬ 
morphism . 

The homomorphism (4.6) leaves the chains of *C n unaltered. We show this 
also for the basic chains (4.3) of *C n . In fact we have here 

A(aV - + yb>) A[a' +M - a r+M + aV - y +M + 7V]. 

Now let K n be a (g, n)-cycle of *C n and let {K n } * and {K n } denote its homology 
classes in *Hq and Hq(E(ab)) respectively. Under the homomorphism of the 
homology groups defined by (4.6) the image of {K n j is {K n )*. Thus the image 
of this homomorphism is the ( q , n)-homology group *H J of 2* itself. 

Suppose now that K n is a ( q , n)-cycle of C ,n (2(o5)) which is mapped by the 
homomorphism (4.6) into the zero class of *Hq , i.e. the image cycle *K n of K n 
is a (p — g) th boundary of a chain of *C n + p ~ q . By (4.6) the (g, n)-cycle 
K n - *K n is a linear combination of basis chains of the form 

(4.9) A(aV - a v+M ), A(aV - y +ft ). 

The sum of the coefficients of K n — *K n is therefore zero. 

Furthermore, since the chains (4.9) lie in the closure of St a of 2 (ab), it follows 
[Appendix I] that the cycle K n — *K n lies in the zero class of its homology 
group, i.e. is a (p — g) th boundary. Since *K n is a (p — g) th boundary, it follows 
that K n itself is a (p — g) th boundary of a chain of C n (2(ab)). 

Thus the homomorphism of HqCL(ab)) onto *Hq is univalent and therefore 
an isomorphism. Hence, referring to the isomorphism (4.4), the simplicial 
systems 2 and 2 (ab) have isomorphic (g, 7i)-homology groups. This isomor¬ 
phism, which is a combination of the isomorphisms (4.6) and (4.4), is obviously 
generated by the simplicial mapping of 2 (ab) into 2 in which y is mapped into a 
and the other vertices remain unaltered. Collecting the results, we have: 

The simplicial mapping of 2(ab) into 2, in which y is mapped into a (or b ) 
and the other vertices remain unaltered , generates an isomorphism of the (g, n)- 
homology groups . 

5. The new homology groups for topological spaces 

The passage from finite simplicial systems to arbitrary topological spaces is 
similar to the procedure utilized by Cech for ordinary homology groups. We 
shall suppose the reader familiar with that method, and for details refer him 
to Cech’s initial paper: TMorie gknkrale de Vhomologie dans un espace quelconque , 
Fundam. Mat. 19 (1932), 149-183, or to the full exposition of the theory in the 
forthcoming book by Lefschetz, Algebraic Topology , in the Colloquium Series, 
Chap. VII, §1. This book will be referred to in the sequel as “L,” and its 
general terminology will be utilized in the present section. In particular, here 
also a topological space designates a space which satisfies all but the separation 
axioms for Hausdorff spaces (L, Ch. I, No. 6). 

The general argument in the Cech theory runs as follows: Let {t/\} be the 
finite open coverings of a topological space % 3>x the nerve of U\ , and if U\ 



376 


W. MAYBE 


refines U M let xj be a projection by inclusion 4>x —* (see L, Ch. VII, 1.3). 
The projections x£ for given X, induce a unique simultaneous homomorphism 
x* of the homology groups H\ of into the corresponding groups of thus 
giving rise to inverse systems: S n = {H£ ; x*}, and the corresponding groups 
of are defined as H n = lim S n . 

The same argument may be repeated for the new homology groups. The 
only step which is new is the explicit proof that x£ is unique. As in (L, Ch. 
VII, 1.4) any two projections x* , t' m x are shown to be “prismatically related” 
(in the sense of L, Ch. IV, 16.2). The explicit deduction of the uniqueness of 
x£ from this property is given in Appendix II. As a consequence we shall have 
here also the inverse system Sq = {Hq (<fo); x£} and define for Hq = lim Sq . 

6. Application to finite polyhedra 

Let | 2 | be a finite simplicial polyhedron, where 2 is a finite Euclidean complex 
in the sense of (L, Ch. Ill, 6.9). We have thus the topological groups //£(| 2 |) 
of the space | 2 | in the sense just defined, and also the combinatorial groups 
Hq( 2) of the simplicial system 2. The proof of the isomorphism of the two is 
carried out as in (L, Ch. VIII, 10). The only modification made is the following: 
Instead of taking the successive barycentric subdivisions we choose successive 
subdivisions 2, 2 1 , 2 2 , • • • 2 n , • • • where 2 n+1 is deduced from 2 n by introducing 
the “midpoint” y of one of the largest of its one-simplexes {ah). 

In the notation of §4 we have 2 n+1 = 2 n {ab). We have seen in §4 that for 
the simplicial projection 

71 +1, ”«?«7l+l \* n 

TT n • & —* " j 

which leaves unaltered all vertices of 2 n+1 but y, which vertex is mapped in one 
of the vertices a or 6, isomorphisms between the new homology groups result. 
To complete the parallel with the treatment loc. cit. it suffices to observe that 
the maximal diameter of 2 71 has a length converging to zero for n —■► oo. 

This follows from the fact that in constructing 2 n+1 from 2 n we choose the 
midpoint of one of the largest of the one-simplexes of 2 n . 

We have then the analogue of (L, Ch. VIII, 10.1): 

(6.1) Theorem. The groups Hq{2) of the simplicial system 2 are isomorphic 
with the corresponding groups of the polyhedron | 2 |. Therefore the i/£(2) are 
topological invariants of | 2 |. That is to say , if two polyhedra | 2 |, | 2i | are 
homeomorphic , the corresponding ( combinatorial ) groups Hq are the same. 

7. Appendix I 

Let K n be an w-dimensional chain of the simplicial system 2, P a vertex of 2 
and St P, the star of P in 2. 

(7.1) Theorem: If n q — 1 then every (q , n)-cycle K n which lies in the closure 
of St P is a (p — g) th boundary ; if n = q — 1 then a (g, ri)-cycle K n which lies 
in the closure of St P is a {p — g) th boundary if and only if the sum of its coeffi¬ 
cients is zero (mod p). 



A NEW HOMOLOGY THEORY 


377 


Before proving this theorem we define the product-chain of the chains 
Kl = X] fl lX pl\ t Kj = 2 a *r P*r 

X r 

by 

(7.2) Ki Kt = a ix fljr pu pSt , 

X|T 

where plxpSr is the (v + y + l)-dimensional simplex containing the vertices of 
both simplexes plx and pj T , if and only if the product chain (7.2) belongs to 2. 
If y and v are positive integers, then F(KQ and F(K?) are defined and 

(7.3) F(Ki'KS) - KSF(K0 + KIF(KS). 

We can extend this rule to cover the cases of v £ 0 or y g 0 (i.e. chains of 
zero or “negative dimensions”) 1 by defining 

(7.4) F(P°) = 1, P° a zero-dimensional simplex, 

and 

(7.5) F(l) = 0, F(0) = 0. 

Remark: If q > n then P(K") = 0 no longer characterizes the ( q , »)-cycles 
(c/. §2). Because then every n-dimensional chain is a 5-cycle whereas 
P(K") = 0 is not always true if q = n + 1. 

We now return to the proof of (7.1). If K“ is an n-dimensional chain which 
lies in the closure of St P, we define for it the (linear) operators D ,, v = 1, 2, 
• • • , p, by the formula 

(7.6) D,(\ K n ) = (-1 r\ v - l)!P p_ 'K n , v = l, ■■■ ,p. 

In this formula 0! and P° are to be replaced, as usual, by 1. By (7.3) and (7.6) 
we have, for v = 1, 2, • •• , p — 1 

(7.7) Fi>,(K”) = (—l)V!P p_ ' _I K n + (^1)'“ 1 (>' - lJIP^'F^"). 

Hence 

(7.8) (F D, - D,F)K n = D , +1 K n , r = 1, • • • , p - 1. 

Thus, in terms of operators 

(7.9) F D, - D, F - D , +1 , v = 1, • • • , p - 1. 

1 In this section and the next we shall make use for formal reasons of chains of negative 
dimensions. 

To the chain-groups C n , n « 0> 1,2, • • • of §2 we add the chain-groups C ” n , n — 1,2, • • • of 
chains of negative dimensions. C~ l (by definition) is the group of the rest-classes modulo p 
and the groups C“*, n ■■ 2, 3, • • - consist of the zero-element only. 

(7.4) and (7.5) define the homorphisms C° —► C” 1 , C"* 1 C~*, 

then iB valid for any v and n » 0, =fcl, d=2, 


The relation (7.3) 



378 


W. MATER 


By eliminating Dt , Dt , • • • , D, from the first v of the formulae (7.9), we find 
(7.10) - £ (-1 y (j) P-^1 F\ 


Thus, for v 
(7.11) 


= 1, 2, • • • , p — 1 we have 


{§<-«'(;■) p -' c ‘ p } 


K" = ( — 1)' v! P p ~'~ l K". 


In particular, when v = p — 1, 

(7.12) (- iy ( p 7 F p - i_1 Dr F j K” = (p - 1)! K\ 

But p is a prime number so that (p — 1)! = — 1 and ^ ^ = ( — 1)', (modp), 

j = 0,1, • • • , p — 1. Hence, for every chain K n in the closure of St P, 


(7.13) 


F’-'Dr K” + F p- 2 Di FK B + • • • + F Dr F p ~ 2 K" + Dr F p ~ 1 K" 

= - K". 


Now let K B be a (q, n)-cycle in the closure of St P. If q g n or q > n + 1 
then F(K B ) = 0 by (2.1), (7.4) and (7.5). Hence, by (7.13), if q ^ n + 1 


(7.14) 


-K" = F P_1 AK B + 
- F M (F , “ 1 iJ I K B + 


+ F p - t DrF*-'K n 
+ Dr F a ~‘K B ). 


This proves the first assertion of (7.1). 

If q — n + 1 then by (7.4), F(K B ) = F n4:1 (K n ) is the sum of the coefficients 
of F B (K"). Suppose that 

(7.15) K" = £ a*(P*i •” P« n +i)- 


Then, by (1.6), 

(7.16) P(K") = n! £ a* £ (P., • • • 

Ur-" in) 

So that the sum of the coefficients of F n (K B ) is (n + l)l£ a a . But n + 1 = 
q < p, hence F n+1 (K B ) is zero if and only if £ a a = 0 (mod p). If this is the 
case, then (7.14) holds and K" is a (p — g)‘ h boundary. On the other hand if 
K" is a [p — (n + l)] th boundary 


(7.17) 
and if 

(7.18) 

then, (1.6), 

(7.19) K B 


K n = f P-n- 1 (K ,-1) } 


Kf 1 = £ Oa(P«, ■■■ Pa,) 


(p - n - 1) l £ a„ £ (Par • • * Pa,)h~ 
Cil * * */*—»—l) 


has as the sum of coefficients 



A NEW HOMOLOGY THEORY 


379 


(7.20) 

and this sum is zero modulo p. This completes the proof of (7.1). 

8. Appendix II 

Let 2 and 2 be simplicial systems and let / and g be simplicial mappings of 2 
into 2. Let A v , v = 1, 2, • • • , N, denote the vertices of 2 and let A v = f(A p ) 
and A' v = g(A v ). 

(8.1) Theorem: //, for every simplex (A\ ••• A n ) of 2 the simplex ( Ai ••• 
A n A\ • • • A n ) belongs to 2, then f and g determine the same homomorphism of 
H*(2) into H*( 2). 

In proving this theorem we first order the vertices of 2 so that the vertices of 
every simple simplex receive a certain definite ordering. If the simplex 
(Ai * • • A n ) is not simple, then i < k implies either that Ai = A k or that Ai 
precedes A k in the given ordering of the vertices of 2. 

We now define p linear operators Do , D\ , • • • , Dp_i , each one of which maps 
chains of 2 into chains of 2. It is sufficient to define these operators for the 
simplexes of 2 and this is done by the formulae 

D r (Ai •••!„) = (-l) r r!^Z W, • • • Ai-UiAi’-'-'Ai+i • • • A„) 

- it, (Ai • • ■ Ai-iA’r r A ' i+ 1 • ■ • A'n) 

i-1 

Since we have to apply these operators also to boundaries of chains, we have 
to extend the above definition to chains K of “negative dimensions,” in which 
case D r (K) shall be zero. 

If r < p — 1 we have 

FDXAi •••£.) = ( — l) r+1 (r + 1)! [z (Ai • • • AiA'r r -* • •• A' n ) 

*-i 

+ (~l) r rl |"Z E Ui • • • A i Ai p - r ~ 1 • • • A' n )i 

L^-1 j+i 

- Z E Ux • • • A^iA'r r • • • A' n )i 

Furthermore 

(8.4) Z D r (Al • • • A n )i “ D r Z (^1 * " • A n )i = D T F(Al * ’ • A n ). 

i-i i-i 





380 


W. MATES 


Hence from (8.2), (8.4) 

fl>rF(Ii • •'• An) = (- iy r! \t E Ui • • • AiA^ 1 • • . An), 

L)-l W 


(8.5) 


- EZUx--. Aw 4, An)i\. 

>-1 if*i J 


This combined with (8.3) gives 

(8.6) (FD r - Z>,F)(Ii •••!,)- Z>,+i(Ii • • • An), r < p - 1, 

which formula also holds for chains of dimension g 0. Hence, just as in 
Appendix I, 


ft -§ < - i>, C') F " D<F '' 


(8.7) D r = 2. (-D' l ) r-’Do P, r < p. 

In particular, when r = p — 1 (since (p — 1)! = — 1, ^ ^ = (—1) J (mod p)) 

we arrive at 


( 8 . 8 ) 


(E F p-j-1 Z)o F'Vli • • • In) - - E Ul • • • A, A' +1 • • • Al) 

\j-0 / i-1 


+ E Ui • * • Ai-iA\ • • • An) = Ui • • • A'„) — Ui • ■ 

i-1 

Hence for any (n — l)-dimensional chain K B_1 of S we have 
(8.9) (E P- H Z) 0 F'j K"- 1 = K' B_1 - K B_1 , 


An). 


where K B_1 = /(K B_1 ) and K ,B_1 = g{ K B_1 ). 
Now let K n_1 be a g-cycle, then 


( 8 . 10 ) 


DoFK" 


j0 0 F p_1 K" _1 


are all zero since either the FK" 1 , • • • , F p_1 K n_1 , are zero or they are of “nega¬ 
tive dimension.” 

Hence 


( 8 . 11 ) 



thus K ,n_1 — K" -1 is a (p — g)-boundary. 

Hence K ,B_1 and K n ~ l determine the same element of HJ -1 (S). This com¬ 
pletes tiie proof of (8.1). 

Remark: The above operations D, generalize in obvious manner the so-called 
“homotopy-operator” D of (L, Ch. IV, No. 14). 


Institute fob Advanced Study 



Annals or Matbbmatxcs 
Vol. 48, No. 2, April, 1942 


ON THE DIFFERENTIAL EQUATIONS OF THE SIMPLEST 
BOUNDARY-LAYER PROBLEMS 

By Hermann Weyl 
(Received February 10, 1942) 


1. The central boundary-value problem and its hydrodynamic interpretation 

In the theory of viscous fluids the following non-linear boundary-value prob¬ 
lem for a function w(z) of a real variable z ^ 0 involving two constants k > 0 
and X ^ 0 plays an important part: 


(Ax) 


V" + 2 ww" + 2X(fc 2 — w' 2 ) = 0 for z ^ 0; 
iv(0) = w'( 0) = 0, w'(z) —> k for z —> oo. 


We consider X as a given constant, but k as a variable parameter. A mathema¬ 
tically satisfactory proof of its solvability has never been given, although 
various numerical devices, including V. Bush's differential analyzer, have been 
set at work on it. We shall here give a complete solution of the problem, 1 first 
for the two special values X = 0 and X = $ by a process of alternating approximar 
tions, rapidly converging and thus well suited for numerical computations 
(§§2, 4, 5), and then approach the general case (§§6, 7) by the method of fixed 
points of transformations in a functional space,—which is considerably less 
amenable to calculation. In between (§3) the first method will be applied to 
certain boundary-value problems closely related to (A 0 ). 

There are available two hydrodynamic interpretations of (Ax). Consider 
first the steady flow of an incompressible viscous fluid of constant density p and 
kinematic viscosity e 2 filling the half z > 0 of an m-dimensional Euclidean space 
with the Cartesian coordinates X \, • • • , x m and the cylindrical coordinates 

r = {x\ + • • • + xL-i)*, z = x m . 


If cylindrical symmetry prevails and hence the radial (r) and vertical (z) com¬ 
ponents w, v of velocity as well as the pressure pp depend on r, z only, then the 
following differential equations obtain for z > 0: 


( 1 ) 



, m - 2 , dv 

+ —U+--0; 


du . du . dp 
dr dz dr 


= € 2 ^A u — 


dv . dv , dp 2 A 

“*- + ,, 5 + fe 



1 See the author's preliminary notes in Proc. Nat. Acad. Sci. 27, 1941, pp, 579-583, and 
28,1942, pp. 100-102. 


381 



382 


HERMANN WEYL 


where the Laplace operator A is defined by 


d <p , m 


— 2 d<p , d <p 
r dr dz 1 ' 


These equations are to be combined with the boundary conditions 
u —► 0, v —* 0 for z —* 0. 

For an ideal fluid, « = 0, we have this simple solution: 


«o(r, z) = ■ — ■ r, v 0 (r, z) = -2 kz, 
m — 1 


Po = const. — %(ul + vl) = const. — 2 k 2 


(m — l ) 2 



arising from the harmonic velocity potential 



and involving an arbitrary positive constant k. As is necessary, the vertical 
(though not the radial) component velocity vanishes along the boundary z = 0. 
The Navier-Stokes equations (1) for the viscous fluid possess a solution of the 
form 

u = —?_ riP'(z), v = —2 F(z), p = const. — 2 k 2 \( — \ + L(z)\ 

m — 1 (\m — 1/ J 

which approaches the solution , v 0 , po for z —>• <». The first equation (1) is 
identically satisfied, the second and third yield 


tF'" + 2 FF" + —?_ (Ik 2 - F' 2 ) = 0 

m — 1 

and 

(2) fc 2 Z/ = 2FF' + « 2 F" 

respectively. Setting F(«g) = e • w(z) we obtain the equations (Ax) with X => 
l/(m — 1) from which the viscosity constant has disappeared, so that w is 
independent of t. Equation (2) in integrated form gives 

tf-Utz) = t{w'{z) + w\z)\. 

Our solution describes approximately the flow of a viscous fluid around an 
obstacle with a blunt nose in the neighborhood of the forward stagnation point. 
The cases of physical interest, m — 2 and 3, i.e. X = 1 and have been treated 
by Hiemenz and Homann respectively. 2 


1 Hiemenz; Dihgler’s Polytech. Jour. 326, 1911, pp. 321-326. F. Homann, Zeitachr. 
angew. Math. Mech. 16,1936, p. 163. 



DIFFERENTIAL EQUATIONS OF BOUNDARY-LATER PROBLEMS 


383 


Let the subscript t in u,, v,, p, indicate dependence of our flow on the viscosity 
constant t. Certainly u,, v,, p, tend to Uo, v 0 , po with e —► 0 in the region 
z > 0, but the convergence cannot be uniform at the boundary because the 
viscous fluid adheres, the ideal glides along the wall. Hence we have the 
phenomenon of a boundary layer of thickness in which the velocity rises 
from 0 at the surface to the external value 

u = wo(r, 0) = —r, v = i> 0 (r, 0) = 0. 

171 — 1 

Indeed 

(3) u,(r, a), --i >.(r,ez), p.(r, tz) 

t 

tend with t —* 0 to the values 

U(r, z ) = - rw'(z), F(r, z) = —2 w(z), 

m — 1 

p(r) = p 0 (r, 0) = const. — 2 • 

[As a matter of fact, the first two quantities (3) are independent of e, the last 
differs from p(r) by the term 2 t{w'{z) + w 2 (z) j of order € 2 .] According to L. 
Prandtl, similar circumstances with regard to convergence for e —> 0 are to be 
expected along the surface of any obstacle immersed in a fluid of viscosity e. 

We propose to formulate the two-dimensional boundary layer problem in 
terms of conformal coordinates £i, {2 which arise from the Cartesian coordinates 
X\ , x 2 by a conformal transformation. Let Ui , u 2 be the covariant components 
of velocity with respect to these coordinates £1, £ 2 and 

ds 2 = dx 1 dx\ = 6(d£i -f- d£ 2 ) 

the square of the line element. The Navier-Stokes equations assume the form 



We suppose that u \, u *, p with € -+ 0 converge to the flow % of an ideal fluid 
arising from a harmonic potential <p\ % 


dtp/d£i, po = const. 


JLW - const. 



384 


HERMANN WEYL 


Along with <p any multiple hp with a positive constant factor k is equally service¬ 
able, which means that the total strength of the stream may be arbitrarily fixed. 
Choose £ 1 , & so that a multiple fcf of f = £1 + ifs is the complex potential of the 
limiting flow 3o and let the stream line fa = 0 be the one which forms the bound¬ 
ary. We have good reasons to believe, and this belief is the basis of the 
boundary-layer theory, that ui , u*/t and p, when expressed in terms of the 
arguments £ = £ 1 , y = £ 2 /*, tend to limiting functions i7(£, 1 ?), F(£, rj), P(£, rj) 
which satisfy the equations arising from (4), (5) by the same passage to the 
limit. The second equation (5) then shows that dP/di 7 = 0, and P(£, ri) is 
therefore independent of rj and has the value 

Pit) = Poit, 0) = const. —k 2 /2S(£), e(£) = e({, 0), 


throughout the boundary layer. Thereafter the two other equations give 


(B)‘ 


where 


^ + ^ = 0 , 

*t dr, 

U ^S+ vd f + hit)ik 2 

d( dr, 


- U 2 ) = 


d*U 

dr, 2 


_ 1 d, log e(() 
2 d( ' 


One has to add the boundary conditions 


(B) U —* 0, V —» 0 for r, —> 0 and U —> A; for r, —» «. 

A full justification of the basic hypothesis of boundary-layer theory will hardly 
be possible without changing its differential form as given by these equations 
into a suitable integral form and without proving the existence of a unique solu¬ 
tion of the problem (B, B) 2 

Because of the first equation (B), the flow (U, F) derives from a stream 
function 

U = d\j//dr,, V = —d^/d£ 


satisfying the formidable differential equation 


(B,) 



d 2 \// drf/ d 2 \p d\p _ d* rf/ 

djjdt, dr/ dr, 2 d£ dr,* 


and the boundary conditions 


(B,) 


*- 0 , 


g -0 for 

di, 


o, 


d\f/ 

drj 


k for i; —> oo. 


* Experience shows that in general the assumptions of the theory are fulfilled only along 
a certain frontal part of the surface of the solid. For the whole theory see S. Goldstein, 
Modern Developments in Fluid Dynamics, vol. I, Oxford, 1938. 



DIFFERENTIAL EQUATIONS OF BOUNDARY-LATER PROBLEMS 


385 


Suppose now the obstacle is an angle of irX (0 ^ X < 2) with the origin as 
vertex and the positive real axis as median. The exterior of the angle is mapped 
conformally upon the slit (& + ff 2 )-plane, the slit extending along the positive 
real axis, by the analytic function 

Xi + ix 2 = const. (fc + 2) 1-i \ 

and thus one readily finds 

( 6 ) h(t) - 

The domain for the differential equation (B+) is the quadrant £ > 0, tj > 0. 
If, more generally, the solid parts the stream symmetrically with a prow of 
angle ttX at the origin, then the formula (6) will hold at least approximately in 
the neighborhood of the forward stagnation point. The problem (B* , B*) 
with this value of h(£) is carried into itself by the transformation 

(7) £-+7 2 -£, r; —y• 77, yfr-* 

(7 a positive constant). Hence the solution must be of the form 

(8) vK£, v) = 

and for the function w(z) one obtains exactly the conditions (A x ). The case 
X = 0 where the obstacle consists of the half line y = 0, x ^ 0 in the x, y- plane 
and the fluid flows by with constant positive velocity k was the first boundary- 
layer problem to be numerically integrated (Blasius 1907). Arbitrary values 
of X have been treated by V. M. Falkner, S. W. Skan and D. R. Hartree. 4 Of 
the two hydrodynamic interpretations for (A\) which we have described, the 
second is applicable to all values of X (at least within the range 0 ^ X < 2), the 
first to the reciprocal integers X = l/(m — 1) only. Both coincide for X = 1. 
We notice in particular that the two-dimensional boundary-layer problem of 
the rectangular prow is mathematically equivalent to the three-dimensional 
flow against a straight wall (X = £). 

2. Solution of Blasius’s Problem 

Turning to the solution of our problems, we start with the case X = 0, which 
occupies a singular position inasmuch as the parameter k is absent from its 
differential equation 

(9) w"' + 2 ww" = 0. 


4 H. Blasius, Zeitschr. Math. Phyd. 66, 1908, p. 1, V. M. Falkner and S. W. Skan, Phil. 
Mag. 12, 1931, p. 865; (British) Aero. Res. Comm. R. & M. 1314; D. R. Hartree, Proc. 
Camb. Phil. Soc. 33,1937, pp. 223-239. 



386 


HERMANN WEYL 


For any constant #c the expression k'w(kz) is a solution of this equation if w(z) is. 
Following an argument first advanced by Topfer 6 let w = f(z) be the solution 
determined by the initial values 

m - /'(o) = o, no) = i. 

Once we are sure that/extends over the whole interval 0 g z < « and/' tends 
to a positive limit 0 with z oo, 

= f n*)dz > 0 , 

Jo 

we may adjust the constant k so as to let the derivative of w = k*/(kz) approach 
k at infinity: 

(10) k 2 /3 = k, K = (k/p)K 
Therefore 

(11) u/'(0) = k 3 = ak*, a = /3~*. 

The value w;"(0) is the essential factor in the formula for the skin friction along 
the immersed plate. Hence skin friction is proportional to the 3/2 power of 
velocity. 

Treat / and /" in the equation 

d -£ + w = o 

as two separate functions. Because of the initial condition /"(0) = 1 one then 
obtains 

/"(*) = exp (-2 //O') df). 

Introduce /" = g as the unknown function and using the initial values /(0) = 
/'(0) = 0, tie up /, or rather its integral, with g by two successive partial inte¬ 
grations: 

2 f md$ = f (* - r) 2 -/"(r)#. 

Jo Jo 

The differential equation plus the initial conditions are thus equivalent to the 
integral equation 

( 12 ) g = ${ 0 } 
with the operator 

${g} = exp (« - f) s 0(f)d^. 


• Zeitschr. Math. Phys. 60,1912, pp. 397-398. 



DIFFERENTIAL EQUATIONS OF BOUNDARY-LAYER PROBLEMS 


387 


Notice the following properties of this operator: 

(0 ${0! ^ 0, (ii) 4 >{g} ^ 4 >{g*} if 0 ^ g*. 

We are led to define a sequence of successive “approximations” g n , starting 
with g 0 (z) = 0 , by 

9n +1 == (/I = 0, 1, 2, • • • ). 

The trivial relations gi ^ go = 0 , gi ^ go = 0 , implied in (i), give rise by (ii) 
to two rows of inequalities, namely 


(13) 

and 

§ 

IIA 

<s 

All 

H 

Os 

02 ^ 03 , 

All 

(14) 

go ^ gs, 

& 

All 

os 

to 

IIA 

to 

Os 

All 

& 


The latter may be rearranged as follows: 

go Hk gi tk g\ ik • • • and gi ^ g z ^ g h ^ 

In view of (13) these relations prove that the descending sequence of the odd g n 
lies above the ascending sequence of the even g n . Does this “alternating 
pincer movement” close in on a uniquely determined limit function g(z )? 

To answer this question, introduce the abbreviation 


(15) 

G(z) = f(z-f)Vr)# 

Jo 


and let 



0 ^ g(z) g g*(z), 

Ag = g* - g, A<f>{g} = 4>{g*) 

- ${0!. 

Since 

0 ^ e““ - e - ” ^ v - u 

if 0 ^ u g v 

we get 

0 ^ -A<f>{g} g A G. 



The increment AG arises from 2-Ag by thrice integrating from 0 to z. These 
remarks suffice to establish the inequality 

(16) | 0 »+i(z) - 0 „(z) | g (2z 8 ) n /(3n)!. 

Indeed, because go = 0 , gi = 1 , it holds for n = 0, and since threefold integration 
changes 

z ,B /( 3 n)! into z ,B+ 7(3n + 3)! 

the inequality carries over from n to n + 1 , i.e. from g n + 1 — g„ to$>{g„ + i} — ${g»} • 
Thus convergence (of the type of the exponential series) is assured by the rela¬ 
tion (16), and we obtain a solution 

g(z) = lim g„(z) 

n-+ oo 

of ( 12 ) which is larger than, the even and smaller than the odd g n . 



388 


HERMANN WEYL 


Uniqueness is established by the remark that any solution g(z) of ( 12 ) satisfies 
the inequalities 

g £ go, g £ gi, g ^ g», g S g», • • • 

derived from the trivial one g 2: g 0 = 0 by iterated application of the operator $>. 
Thus g is necessarily caught between the tongs of the even and the odd g„ . 

Our next concern is the asymptotic behavior of g(z) for z —* <». Choose any 
zo > 0 and set 

1° g*(t)d? = c(> 0 ). 

Jo 

As Gi arises by two-fold integration from 2 / g 2 (r) df we get G 2 (z) § c(z — Zo) J 

Jo 

and thus 

g(z) g gt(z) | for z Si Zo. 

Consequently 

j£ g(z) dz = /3 > 0, zg(z ) dz = /S' > 0 

converge, the asymptotic behavior of 

/(z) = [ (z - f)sr(f) dr 
Jo 

is indicated by 

(17) /(«) ~ f (z - r)gf(r)dr = - d' 

JO 

and that of w(z) by 

(18) w(z) ~ fez — A: 1 . 

P* 

As g(z) ^ g 2 (z) = e -i ** implies 

= f 3‘-r (|) 

we find the numerical coefficient a in ( 11 ) to be < 0.684. According to the most 
reliable computations® « = 0.664. Hence the very first approximate value for 
a which can be derived from our method misses the mark by not more than 
3 per cent. 

Given any positive constant c < B we have seen that, for sufficiently large z, 

«?»(z) £) G t (z) £ cz\ 


• See TOpfer, l.c. f , and S. Goldstein, Proc. Camb. Phil. Soc. 20,1930, pp. 19-20. 



DIFFERENTIAL EQUATIONS OF BOUNDARY-LAYER PROBLEMS 


389 


By making use of this fact, one can sharpen the upper bound in (16) to 

e“"’.(2« , )7(3n)!. 

The maximum value of this function is assumed for z = (3ra/2c) } , and we thus 
ascertain a constant upper bound n„ for | g n +i(z) — g n (z) | in the entire interval 
z Si 0 which is essentially of the order 


1 

\/(6irn) 


(ixn) 


—3n/2 


H ~ 1 . 8 . 


Such sharper estimates are valuable guides for numerical computation. 

Knowing a priori that the positive functions have the upper bounds 1, 

z, respectively, one could have established the existence (and uniqueness) of 
the solution / over the entire interval 0 ^ z < oo within the frame of the classic 
theory of differential equations. But as those bounds (and other related 
estimates) are most easily derived from the integral equation (12), I have pre¬ 
ferred to carry the construction through on its basis. For the general case ( A\) 9 
X 0, I see no other alternative. 

J. von Neumann pointed out to me that the differential equation (9) of order 3 
must be reducible to one of first order (followed by two quadratures) because it 
permits the group of transformations 


z —> z + Zo , w{z) —> k-w(kz) 


involving two arbitrary constants Zo and *, and that thus the problem comes 
within reach of Poincares discussion of first-order differential equations. 
Setting 


w = e 


9 


dw 

dz 


= e -2 ’■$($), 



2d = t 


von Neumann obtains the equation 

dt _ t(t -f- d -f* 2) 

dd - t) 

with the initial condition t —* «o for # —> «. After determining t{d) from this 
equation, one finds by quadratures s and then z as functions of d from 


ds = 


dd 



, dd 
6 'd(2d - t) * 


3. Generalization. Power series. Goldstein's wake problem. 

In a trivial manner our method carries over to the equation 

(19) / (F+l> + 2// w =0 (2 £ 0) 

with the initial conditions 

(20) / tr ~ l) = 0, f w = 1 


for z ** 0« 



390 


HERMANN WEYL 


Here v may be any positive integer. Setting /*' 1 = g we get the integral equation 

<?(*) = exp r rrtr)#), 

and after defining the alternating sequence g n (z) accordingly, we find instead 
of (16) 

| g n+l (z) - g n (z) | £ 2 V*/»*! [»• - (f + 1 )»J. 

We may even generalize the initial conditions (20) to 

(21) /(0) = Co, • • • , / < — 1, (0) = c,_i, / w (0) = 1 

with arbitrary constants c M . Then our integral equation reads 
g(z) = exp^- 2 Q(z) — ~ (z - f)'ff(f)df^, 
where Q(z) is the polynomial 

(22) Q( z) = * 2 + * 2 ’+... + ^V, 

and convergence follows from the inequality 

I ffn+x(z) - g„(z) | g A n+ 1 -2"z B */n*![^ A{2Aa' +1 ) n /n*\] 

holding in any interval 0 ^ z ^ a in which e~ m,) ^ The solution gr(z) satis- 
fies the inequality 

(23) 0 ^ g(z) ^ g'i(z) = e“ 2Q<,) . 

Let us for a moment return to the simple initial conditions (20). From the 
lowest case v = 1 where the solution is an elementary function, namely f(z) = 
tanh (z), we learn that we must not expect the Taylor expansion of the solution 
f(z) around the origin to converge beyond a certain finite limit, which for v = 1 
is reached at the point z = 7 r/ 2 . I find no indication in the literature that this 
had been realized in Blasius’s case v = 2 . For any v the coefficients c n of the 
power series 

f(z) = Z (-l) n c„z"* + ' [n* = (y+ l)n] 

n—0 

are determined by the recursive equations Co = l/v\, 
n*(n* + 1 ) •••(»* + t>)c n = 2 (** +'l) •••(»* + v)ac k 

(i + k => » — 1 ). 

Following the same straightforward procedure as in my first note in the Pro¬ 
ceedings, we obtain 

1 f 2-vl Y ^ ^ if 2 Y 



DIFFERENTIAL EQUATIONS OF BOUNDARY-LAYER PROBLEMS 391 

and thus for the radius R of convergence the bounds 

h(p + l)-(v + 1)! ^ R ,+1 £ |(r + 1) ... (2 f + 1). 

The essential difference between the flow of a viscous fluid before and behind 
an obstacle is clearly exhibited in our problem (A 0 ) by the fact that no solution 
w exists if w' is required to assume a negative value k for z —► oo. 

However, S. Goldstein 7 has treated the wake behind a flat plate under the 
plausible hypothesis that the flow up to the abscissa x = l is but little modified 
if the plate, y = 0, x ^ 0 , ends at this point. We shift the origin of the coordi¬ 
nates to the end of the plate and at the same time enlarge the standard length 
at the ratio Z*:l, i.e. in our stream function 

>p{x,y) = 2y/x-w{y/2\/x) 

we make the substitution 

x = l + !*•£, y = 

We tfien obtain for £ = 0 : 

yp = <p(rj) + ••• , <p(v) = jotk?• t\ . 

The remainder indicated by the dots tends to zero with l —* oo and shall be 
neglected as is permissible for plates of great length. The stream function 
M£, v) of the “wake layer” behind the plate, £ > 0 , satisfies the same differential 
equation as before 

d 3 \p , dip d 2 \p dyp d 2 yp _ 

<V + aT df ~ dj'dtdr) ~ 

while symmetry requires ^(£, — r?) = —^(£, y). Hence under limitation to the 
half plane y 2 : 0 the conditions at the fictitious boundary y = 0 become yp = 
d 2 yp/dy 2 = 0. We wish to construct that solution which for fixed y and £ —► 0 
(or for fixed £ and y —» ») ties up with our function <f(y). The problem, in¬ 
cluding this boundary condition, permits the substitution 

V-*y-V, yp->y 2 -yp 

with an arbitrary constant y and must thus be of the form 

*(£, n) = 3?-win/?). 

For w(z) one readily obtains the differential equation 

(24) w'" + 2 ww" - to' 2 = 0. 

The boundary conditions are: xy = w" = 0 at z = 0, and 

(25) w"(z) —» \ak* for z —> <*. 


7 Proc. Camb. Phil. Soc. 26, 1930, pp. 18-30. 



392 


HERMANN WEYL 


If w(z) is a solution of (24), so is the function k-w(kz) involving an arbitrary- 
constant k* Let w = /(z) be the solution of (24) with the initial values / = 0, 
f = 1,/" = 0 for z =s b. Then/'"(0) = 1 while differentiation changes (24) 
into 

(26) w "" + 2ww f " = 0. 

Thus we find ourselves confronted with the case v = 3; Co = 0, Ci = 1, C 2 = 0 
of the general problem (19) + (21) discussed above, and since (23) now reads 

0 s /"'(*) = g(z) £ e-\ 

f"(z) tends with z —> oo to a positive limit 

0 * -1 ^ )dz < ]/l- 

Consequently we may adjust the constant k* in w(z) = K*-f(K*z) so as to give 
w”( oo) the desired value (25) . 7a 

4. Solution of Homann’s problem 

Of a more difficult type is the problem (Ax) for X ^ 0, as it involves the 
parameter k in the boundary conditions as well as in the differential equation 
itself. Here we. are dealing with a real boundary-value problem, which is not 
reducible, as (Ao) is, to an initial-value problem. Only convergence of the type 
of a geometric series if any can be expected for the process of successive approxi¬ 
mations. By differentiating the differential equation we eliminate the par¬ 
ameter k : 

(27) w"" + 2 ww'" + 2(1 - 2 \)w'w" = 0. 

This equation is again invariant under the transformation w(z) —> k"w(kz). For 
X = i, the case with which we shall be concerned in the next two sections, we 
fall back upon the familiar type (26), although the boundary conditions make 
our problem considerably more intricate than before. Let / denote that solu¬ 
tion of (26) for which 

/(0) - /'(0) = 0, /"(0) = 1, /"'(0) = -0 s . 

It will satisfy the third-order equation 

/"' + 2//" + (0 s - f) = 0, 

anjl we expect that, for a certain positive 0, the derivative /" (and f"') will 
strongly approach 0 with z —> «. Thus the equation itself forces /' (which is 
positive throughout the interval) to approach 0, and w = k^(kz) will solve our 
problem if k is determined by (10). 

U A remark by K. Friedrichs to the effect that the assumptions v — 3, 2 Q(z) — — z % 
lead to a wake with back flow caused me to drop the restriction Qfc) & 0 for z £ 0 which 
the original MS contained. April 4, 194$. 



DIFFERENTIAL EQUATIONS OF BOUNDARY-LAYER PROBLEMS 


393 


As before we obtain first 

/"'GO - -/3 2 -exp(—2jdf) 

= -f?-exv(-f\z - fWDdf) = -0 2 -e- o( -> 
and then for /" = g the equation 

rfO = 1 - /?■ [’e~ 0( » dt. 

Jo 

The constant /S 2 is determined by the condition 0 ( 00 ) = 0 , thus 

/3 2 = 1 / jfV 0(f) df. 

Adhering to the notation (15) we are led to introduce an operator ^ which pro¬ 
duces from any given g(z) the function 

( 28 ) *{g} = J ’ c“° (r) df / J f e-° (r) df 

^provided the integral J converges^. Evidently 

0 <*{g} g l, 

and as we shall presently prove, 

(29) *{g}£*{g*} if g^g*. 

The operator ¥ is applicable to the function g(z) = 1 but not to 0 (z) = 0 . 8 
In order to solve the functional equation 

(30) g = *{g\ 

we therefore construct a sequence of functions g n (z) by the recursive equation 

9n +1 = ^{0n} (n = 1, 2, ••• ) 

starting with gi(z) = 1 , in the hope that the sequence will converge to a solution 
g of (30). Alternating pincer movement of the g n is a consequence of (29) and 
the trivial inequalities g* ^ g \, g z g gi . 

To prove (29), set as before g* — g = Ag. The third derivative of AG = 
G* — G is 2 • Agr, and AG and its first two derivatives vanish for z = 0. Hence 


8 Application of the operator ¥ becomes unrestricted if one replaces the definition (28) by 





Then one may start with g*(z) — 0 and find gi(z) -■ 1. Cf. §6. 



394 


HERMANN WEYL 


Ag ^ 0 implies (AG)" to be an increasing function of z, and as it vanishes for 
z = 0 it must be positive throughout. Repeating this argument two more 
times, we find that A G(z) is an increasing positive function for z > 0. Set 

f‘ e~ ai{) rff = H lt J" e~ a(n df = H t , 

so that 

*{?} = H 2 /(H, + H 2 ). 

We then have 

Hi - df = 

Jo Jo 

Hi = f t-*™ ds = /%- 0(f) -e- AO(f) df =S e~* 0{,) 'Ht, 

or the ratios #1 = Ht/Hi , = H\/H 2 satisfy the inequalities (tfi ^ 1, # 2 ^ 1), 

#2 ^ # 1 . Consequently 

ftVCHf + H *) ^ + H 2 ) or *{g*} ^ *{g}. 


Again choose a Zo > 0 and set 

[ 02 (f) df = c > 0 
Jo 

so that G 2 (z) ^ c{z — 20) 2 and 

£ 3 ( 2 ) ^ const. J e _c(f ~* o)a *df 


for z ^ zo. 


All following g’s are smaller than g s and hence the same inequality prevails for 
g n (z) (n § 3) and, provided the limit limn-* g n {z) = g(z) exists, also for g{z). 
Thus knowing that g{z) = f"(z) converges strongly enough to 0 with z —> 00 
we again get the asymptotic formula (17) and (11), (18) for the solution 

w(z) = K-f(nz), K = (k/0) h 


of our problem (A*). 

In proving convergence of the alternating sequence g n (z) we use the above 
notations g(z) ^ 0 *(z), Ag, G etc., and write H = Hi + H 2 . Then 

-A* = ¥{?} - W) = I? - || • 



^ Hi - Hi ^ H — H* 
H ~ H 


-A* g j[V 0(f) (l - e~ Law )d! / £ 


o-O(f) 




Thus 



DIFFERENTIAL EQUATIONS OF BOUNDARY-LATER PRO BLEMS 


305 


Suppose we have a constant p such that 

0 g A g(z) g 

Then 

1 - e" A0(f) £ AG(f) £ 

and hence 

0 ^ — A>£ ^ q-ja 

where 




e- 0<1) dz. 


Another expression for the quotient q is 


/■*w 


z dz 


as one verifies by substituting (28) for ^{(/|. 

In this argument we can choose g = g n and g* = g n+i or g n _i for any even 
n Si 2 and then we obtain majorizing constants n n for all odd and even n S 1 , 

(31) | ^+ 1 ( 2 ) - fif„(z) | g Mn , 

which are defined by the recursive equations 


(32) 

with 


M = 1; Mn = tfn-Mn- 1 , Mn-i-l = 5n*Mn (/l even) 



dz 


The second expression of $ n shows that the constants <?„ perform a pincer move¬ 
ment of the same type as the functions g n (z), and hence all q n lie between qi 
and 52 . Evaluating by partial integration the integral in the numerator of 

Qi — j dz j j e -i ‘* dz, 


namely 


1 

3 


f 


z-de 


-l * 8 


we find gi = $. By some rough estimates it is proved in §5 that g 2 < 0.76; 
but the value of g* is probably not much larger than gi = 0.33. Once we are 
sure that g 2 < 1 we see from (32) and q n I q% that the sequence g n (z) converges 
at least as strongly as a geometric series of quotient g*. 



396 


HERMANN WEYL 


Uniqueness is assured since every solution g of the equation g = ^{g} is 
necessarily sandwiched in between the odd and even g n . 

6. Proof that q t < 1 

We use the constants . 

A = [ z-e^'dz - 3-*-r($), 

Jo 


B = r e -1 '* dz = 3 _ 1 -r(i) = 1.288. 
Jo 


2ir 


Their product is 

ab = *r(*)r(|) - - 37 - 3 . 

q 2 is defined as the quotient 


Since 


| e~° lW •&* dz / e~ 0,M dz. 


Gi{z) < Gi(z) = *z 3 


the denominator is greater than 5. Let us split the integral of the numerator 
into the parts 

/.2 




and employ in the first part the initial terms of the power series of G 2 {z) } in the 
second part an asymptotic appraisal. We readily find 

(33) Gt(z) = $z 3 - J jf i(z - f) 3 -e~ n, dr 

and, because e~ ie> g 1, 

Gt(z) > }z 3 — 

and thus, as long as z g 2, 

Gt(z) > C'-faz* with c = 2 — l/B. 

Consequently 

• f ■ f 2 'lz* dz < [ 

Jo Jo Jo 

= -(1 - e -4</3 ) = 0.6574. 


e~ c,tllt -d— 

12 



DIFFERENTIAL EQUATIONS OF BOUNDARY -LAYER PROBLEMS 


397 


To find an asymptotic estimate write (33) in the form 

B-G t (z) = j[" \[z* - (z - d$-\ /" (f - 2 ) a . e - if ‘ df. 


The first term 


= ri 2 2 - 2 + 


the integral of the second term is changed by the substitution f f + z into 

f df ^ e- 1 *’.j[‘ f 3 .e-‘ ,r dr 

= 6 z- 8 e~ i, \ 

Hence 
(34) 

Set 

2y/AB = 1/6, i.e., (26) 4 - 27/(2*-) 2 ; 

*-*■- mB*-’ 1 ' - B ' 

so that the right side of (34) equals x + B'. Then 


G 2 (z) ^ b z 2 b z + Q 12gB e 8/3 ) 


for 2 ^ 2 . 


Developing 


*0O i*00 

/ g / e- (l,+B,) . 42 3 d 2 

J2 J 2 

= K2B6) 4 e“ B '- f e~ x \x + 6) 3 dx. 

JXQ 

(x + 6) 3 = x 3 -(■• 3 x 2 6 -f" 3x6 2 + 6 3 


one readily finds 

f* e~ x \x + 6) 3 dx = ie”*2(x? + 1 + 36x„ + 36 s ) + 6(f + 6 s ) jf «-* dx. 


But 


Hence 


r . 1 r dt . 1 f" 1 -x« 

JT £ i(2B6) 4 e" <i, ' + *2 > .|x 3 + 1 + 36x 0 + 36 s + ^ # (l + 6*)} 


= 0.3171. 



398 


HERMANN WEYL 


The numerator 


c 


of turns out to be 


< 0.6574 + 0.3171 = 0.9745 


and <?2 itself 


< 0.9745/5 < 0.76. 


6. Set-up for arbitrary X 

Enriched by the experience gathered in the cases X = 0 and \ we now make 
bold to attack (A x ) for arbitrary X ^ 0. We seek the solution in the form 
w(z) = K'f(nz) where 

(/"" + 2 iff'" + 2(1 - 2X)/'/" = 0 (for z £ 0); 

(,i5) 1/(0) =m = 0 , r(O) = 1 ; /"(«) = o, 

and introduce/" = g = <p as the unknown function. Hence we start with this 
set-up: 

(36) /GO = f (* - f¥f) df. 

Jo 

(37) | + 2/p' + 2(1 - 2X)/V = 0, 

(37*) I ^(0) = 1, *(ao)-0. 

(38) <p = g. 


More explicitly: for an arbitrarily given function g we form (36) and then solve 
the linear boundary value problem (37) + (37*), thus defining the functional 
operator carrying g into <p; at the last step (38) we ask for a fixed element g 
of that operator, 

(39) 0=*x{f7}. 

In proving the unique existence of <p we shall fix the precise meaning of the 
boundary condition <p («) = 0. (The whole discussion would turn out a bit 
simpler if we dealt with a finite interval 0 ^ z ^ a instead.) 

Auxiliary Theorem. If g(z) is any continuous non-negative function and 

/'(«) = f g(t) dt, f(z) = f /'(r) dt = f (z - f)g(f) 

Jo Jo Jo 

then (37) has a unique solution with the 'properties 

(40) <p{0) = 1; <fi(z) £ 0, <p' + 2ife g 0 (for* £ 0). 

Pboof. Set 

p(z) - exp (2 j £ /({•) d^j 



DIFFERENTIAL EQUATIONS OF BOUNDARY-LATER PROBLEMS 


399 


so that p' = 2 /p and p(z) ^ 1 , and introduce the auxiliary function 

<Pi = (PvY ~ VW + 2 !/#>). 

Then 

(w/p)' = v’ ,/ + 2/v?' + 2/V = 4X/V, 
and the single equation (37) for is replaced by the system 

(41) ( + % - !■». 

= 4 \pf'*<p. 

It defines an infinitesimal linear transformation in two variables cp, <pi with the 
matrix (of vanishing trace) 

-2/, 1/p | 

6 ( 2 ) = • 

4x«r, 2 /1 

Hence the two solutions (v>, <pi) = ( 17 , 171 ) and (#, # 1 ) satisfying the initial con¬ 
ditions 


V = 1 , 771 = 0 ; # = 0 , tfi = l (for z = 0 ) 

are given by the formula 

(42) v ’ ^ = it [ • • • [ e(zi) • • • ©ten) dzi • • • dz n 

rjl , #1 n -0 J J 

(where the term n = 0 of the series at the right is understood to be the unit 
matrix). Multiplication of (41) by <pi , <p respectively, followed by addition and 
integration, establishes the fundamental relation 

(43) [wi]o = J <Pi + 4Ap/V 2 ^ dz. 

The fact that/' ^ 0 guarantees the positive definite character of the “Dirichlet 
integral” at the right side. 

Apply (43) to #: 

Mi = J o ^ «>? + 4Xp/'fl 2 ^ dz > 0 for z > 0 . 

This shows (1) that #1 never vanishes, therefore never changes sign and, because 
of «>i( 0 ) = 1 , stays positive throughout; and ( 2 ) that t? has the same sign as i?i. 
In the same manner we find that rjrji, if, in (in this order) are all positive, 


> 0 , > 0 ; 


V > 0, in > 0 


for z > 0. 



400 


HERMANN WEYL 


Next consider a finite interval 0 ^ z ^ a (a > 0) and determine that solu¬ 
tion <p for which <p(a) — 0, <pi(a) = — 1. Again we see from 


TO 


= ~L (£ ** +4xp/v ’) 


dz < 0 


that <pi is negative and <p positive throughout the interval 0 ^ z < a. In 
particular, ^( 0 ) > 0 , so that we can divide by ^( 0 ) thus constructing a solution 
<p {a) with the boundary values 


v> to) ( 0 ) - 1 , <p (a) (a) = 0 . 


It satisfies the inequalities 


v> (o) > o, 


A a) < o 


for 0 ^ z < a. 

Clearly <p {a \z) is of the form rj(z) — l a -#(z) with a constant Z« for which we find 
the positive value 17 (a)/#(a). Let a < b and write 

*“>(*) = V a) (z) + (la - «*(«). 


Then 


la - 4 = <p ib) (a)/&(a) > 0 , 


or the positive coefficient l a decreases with increasing a and thus tends to a 
limit l ^ 0 for a —> 00 . The solution 

c0(2) = 17 ( 2 ) — Z*#(z) 

is the one we wish to construct . 9 It has the properties 
(44) «(0) = 1; co(z) > 0, «i(z) g 0, 

and is characterized by the fact that the condition 

<p(z) = w(z) — m-tf(z) ^ 0 


cannot be satisfied throughout the interval 0 ^ z < 00 for any positive con¬ 
stant m. 

It remains to show that no solution <p except this <0 satisfies (40). Indeed, 
according to what has just been stated, any such solution would have to be of 
the form 


<p(z) = w(z) + m > 0 . 

The required inequality <pi ^ 0 or (p<p)' ^ 0 implies jxp ^ 1 for z ^ 0 . This 
remarkable relation prevails in particular for <p = cu. On the other hand, 

(t h/pY - W* * 0, 


9 A similar construction in H. Weyl, Nachr. Ges. Wissensch. Gottingen, 1909, p. 39. 



DIFFERENTIAL EQUATIONS (OF BOUNDARY-LAYER PROBLEMS 


401 


therefore 

0i/p ^ i> 0i ^ p; 

(p#)' = 0i ^ p, ^ - j[ ^ *■ 

The consequent relation p<p § is incompatible with p<p g 1 for positive m. 
We have now completely and unambiguously defined the functional operator 
carrying a given g(z) ^ 0 into the function c 0 ( 2 ), w = $x{gj. Since 

(45) pw ^ 1, a fortiori 0 ) ^ 1, 

our operator f>\ obeys the law 

0 <<h{g] ^ 1 . 

Hence we can and will restrict ourselves to the set 8 of all continuous functions 
g(z) for which 0 ^ g ^ 1. Were 4>x monotone in the sense that <h{g] decreases 
while g increases, then there would be some hope for successful construction of a 
fixed point of the operator 4>x in the functional space 8 by some such alternating 
process of successive approximation as carried us through in the special in¬ 
stances X = 0 and Unfortunately this does not seem to be so, and this 
calamity forces me to proceed by the general theory concerning fixed points of 
functional operators which we owe to Birkhoff-Kellogg and Schauder-Leray. 10 
The main point will be to establish “equi-continuity” for the images w = 
of all elements g e 9 and continuity for the operator <$>x . The lemmas in the 
following section are so conceived as to meet this demand. 

7. Solving the problem (A x ) 

In the lemmas 1-4 the function g is supposed to be any element of 8 and 
c, Co, Ci, c 2 , c 3 are numbers not depending on g . The condition g ^ 1 implies 
/' g 

Lemma 1. 

0 < — c*/(0) ^ Ci. 

Proof. Denote —a/(0) by l and argue as follows: 

(ui/pY = 4X/'co ^ 4 \z y 
coi /p ^ — l + 2X 2 2 , 

and then, because p ^ 1 and on negative, 

(po)Y = 0)1 2* 0)1 Ip ^ — l + 2 Xs 2 , 

po) ^ 1 — Iz -f- § \z*, 

10 G. D. Birkhoff and O. D. Kellogg, Trans. Am. Math. Soc. 23, 1922, pp. 96-115. 
J. Schauder, Studia Math. 2, 1930, pp. 171 r 180. J. Leray and J. Schauder, Ann. Sc. Ec. 
Norm. Sup. 51,1934, pp. 45-78. 



402 


HERMANN WEYL 


Since co(z) > 0 we get 

1 — Iz -f- — Xz* >0 or ? < — + — Xz 2 , 

and taking z = 1 we have proved the lemma with c\ — 1 + §X. However, we 
exploit our inequality to the full when we choose Ci as the minimum of the ele¬ 
mentary function at the right side of the last inequality, which is given by 

(46) c\ = 9X/2. 

One could also argue from the equation 

(pu'Y = 2(2X — 1 )fpa. 

If X :S ^ then 


(pu'Y 0, pu' g — l, 

and since p £ e u \ 

«' g 0S«S1-I*f e" lrS df, 

Jo 

therefore 

f g 1/B = 0.777 with £ = f c^’dz. 

Jo 

If, however, X ^ J, then ^ 2 , pw ^ 1 yield 

(p<o')' ^ 2(2X - 1 )z, pw' g -l + (2X - l)z 2 
and taking the value 

l"° z’-e-^dz = r e~ u ’-d(jz 3 ) = 1 
Jo Jo 

into account, we obtain by the same argument 


l g 2X/J3. 

Hence the lemma is satisfied with 

Ci = 1/B for X ^ Ci = 2\/B for X ^ i. 

For small X and large X our first appraisal (46) is better, but in a certain middle 
range, namely for 0.104 ^ X g 1.096, the second gives a sharper result. 
Lemma 2. 

0 ^ — pu' ^ Ci + ct z 2 , 


(47) 

and thus, a fortiori, 

(48) 


0 ^ — w' 2£ Ci + C 2 Z 2 . 



DIFFERENTIAL EQUATIONS OF BOUNDARY-LAYER PROBLEMS 


403 


Proof. 

(49) pu' = wi — 2 pfu ^ wj 0. 

(— pw'y = 2(1 — 2\)f'pw. 

Again distinguish the cases X ^ X ^ In the first case —pu' decreases and 
therefore 


— pu' ^ l ^ Cl. 

In the second case 

(-P&/)' ^ 2(1 - 2X)z, 

-pu' g ! + (1 - 2X)z 2 g ci + (1 - 2X)z 2 . 

Thus we have established Lemma 2 with 

c 2 = 0 for X § c 2 = 1 — 2X for X ^ 

The following two lemmas prepare the way for an asymptotic appraisal of 
g(z) when z approaches infinity. 

Lemma 3. 

(50) [ c o(z)dz S c (> 0). 

Jo 

Proof. Twice integrating (48) we find the inequalities 

co(z) ^ 1 — Ci z — $c 2 z 3 , 

f w(f)df ^ 2 — \c x z — T Vc 2 z 4 ^ z(l — W^s) 

Jo 

for 2 ^ 1 where 


Thus 


l/c 3 = max. ( 1 , ci + |c 2 ). 


[ ^ f <o(f)df = Jc 8 . 

Jo Jo 


This argument is fully exploited by choosing z = Co as the point where the 
polynomial 1 — Ciz — $c 2 z 3 changes sign and then computing the area 

mCQ 

c — I (1 — CjZ — $c 2 z’) dz = co(l — §ciCo — T^CjCo) 

^Co(3 Co Cl) ( ^ ic). 


with the result 



404 


HERMANN WEYL 


Lemma 4 . If / g(z) dz y then 
Jo 

0 < co(z) £ 

. , • for z ^ 1. 

0 £ -(«' + 2/co) £ (ci + c a z 2 )-e _y(,_1, J 

Proof. The hypothesis implies for z ^ 1: 

/'(a) S y, /(z) ^ ?(z - 1), 2 f f([)d{ ^ 7(2 — l) 2 , 

Jo 

p(z) £ e yt - u \ 

Combine this with (45), (47) and (49): — coi ^ — pa/. 

We topologize the functional space .7 consisting of all functions g = g{z) 
defined and continuous for z ^ 0 by agreeing that a sequence g n approaches 
zero, 0 n — » 0 with n — > oo, if 0 n (z) converges to zero uniformly in each finite 
interval ; in other words, if to every € > 0, z > 0 one can assign an N( e, z) such 
that 0 ^ z g Zo, n ^ N(e, z 0 ) imply | g n (z) | g e. This functional space 7 is 
complete , i.e. convergence of a sequence 0 n , g n — 0 m —* 0 with n, m —» <», implies 
convergence to some element g, g n — g —^► 0. Our domain 8 defined by 0 ^ 
g{z) ^ 1 is a closed convex part of 7. (See Appendix, under 1.) 

Let 8(e, z) be any positive function of the variables c > 0, z > 0. The 
element g e 8 is said to lie in S 3 if the inequality | g(zi) — g(z 2 ) | ^ e holds 
whenever 


0 ^ Zi , z 2 ^ Zo and | Zi — z* | ^ 8(e, Zo) 

(equi-continuity of type 5). Clearly 8a is a compact subset of 8; one has only 
to consider the values of g for rational arguments, marching them off in Indian 
file. The same simple argument of interpolation as employed by Birkhoff and 
Kellogg, l.c., 10 in proving their Theorem II (p. 103) yields the following general 
principle (see Appendix, under 2): 

An operator&{g] defined and continuous in 8 and mapping 8 into 8a necessarily 
has a fixed element g — $>{0}. 

According to (48), Lemma 2, our operator 3>\ maps 8 into the subset 8a corre¬ 
sponding to the function 

8(e, z) = e/(ci + C 2 Z 2 ). 

Hence the existence of a solution g of the functional equation 0 = #x{0} will be 
proved as soon as we can establish continuity of the operator in 8: 

Lemma 5. The image co == $\{g} depends continuously on g € 8. 

This fact, which we are now going to prove, is less trivial than it appears, 
because it implies that w varies but little when the “tail” of the function g e 8 
(i.e. its values for large values of the argument z) changes arbitrarily. The 
explicit formula (42) clearly shows that the particular solutions 1?, rn and &, 
depend continuously on 0; these functions in a finite interval 0 ^ z £ Zo do not 



DIFFERENTIAL EQUATIONS OF BOUNDARY-LAYER PROBLEMS 


405 


depend on what g(z) does for z > Zo . The salient point is the way in which the 
constant Z = —w'(0) in 

o)(z) = 77(2) — !•*(*) 

depends on g. 

Let g in) € 8 be a sequence which in the sense of our topology tends to g. Using 
the notations ij (n) , tf (n) , Z (n) , c*> (n) in an obvious way we have 

rf n) —> Vy vi n) —* *7i i —> #, ${ n) —► #1 with ft —> oo. 

According to Lemma 1, all Z (n) lie between 0 and C\ and thus this sequence has 
at least one point of condensation Z*. The functions 

w*(s) = v(z) - cof (2) = 771(2) - Z*-^(2), 

being the limits of a subsequence of gj U) , <4 n> , satisfy the inequalities o>* ^ 0, 
co* g 0. However, we know that co is the only solution of this kind, and conse¬ 
quently l* = Z. Thus the bounded sequence Z (n) has only one condensation 
point, namely Z, and therefore converges to Z. 

Our proof for the existence of a function g e Q which is its own image under 
the operator 4>x is now complete. As an image it satisfies the inequalities 
(49), (50), 

(51) g' ^ 0, f 1 g(z) dz £ c, 

Jo 

besides 0 £ g £ 1, and as image of a g for which (51) holds, it satisfies the 
further conditions 

(52) g(z) S e~ c< ‘ _1)J , 

0 — (g' + 2 fg) ^ (ci + Ctz 2 ) ■ e c< * l) 

for z 1 (Lemma 4). Expressing everything in terms of / we see that /(0) = 
/'(0) = 0, /"(0) = 1 and 

(53) /"' + W" ~ 2X/' S = const. 

Moreover f" is monotone decreasing and f" as well as + 2 iff" tend to zero 
with z —* oo essentially as strongly as e~ c '\ Hence the positive integral 

f g(z) dz = /'(«) = f) 

Jo 

converges and for the constant on the right side of (53) we find the value —2X/3* 
so that 

f" + 2 IS" + 2X03* - f) = 0. 

An explicit appraisal of /3 is obtained from (51) and (52): 

c < fi < 1 + 



406 


HERMANN WEYL 


Putting finally 

w(z) = K-f(lCZ) with K = (fc/0)* 

we may formulate our chief result as follows: 

Theorem. For given positive k the problem (Ax) has a solution w whose derivar 
live is monotone increasing from 0 to k as z travels from 0 to infinity ; the second 
derivative decreases monotonely from 

0) = ak* (a = /T«) 

to zero , approaching zero with z — > oo at least as strongly as a function of the type 

e- y *' (■y > 0 ). 

So far so good. But I should like to see the aircraft engineer who will apply 
this method to compute the boundary layer for a given profile of an aerofoil! 

We have not proved that there is only one solution of the problem (Ax). 
One could try to approach the question of uniqueness by studying the continuous 
variation of the operator 4>x with X. 11 It seems not impossible to attack the 
general boundary layer equation (B*) by the method here developed. 

8. Appendix 

1. The bounded continuous functions g form a normed linear space c Jq in 
which the norm || g || induces our topology if, somewhat artificially, we define 
the norm by 

II 9 II = 52 i, { max- I 0 ( 2 ) I} O' = 1, 2, •••)• 

Whereas C J 0 is incomplete, the part 8 is a complete closed subset of ZFo . Hence 
the “general principle” on which we base our argument will fit into Schauder’s 
scheme only if one slightly generalizes his central theorem (Satz 2, on p. 175 of 
Studia Mathematica 2, 1930) in the following manner. Let 5F 0 be a normed 
linear space; a continuous mapping of a complete and closed convex subset 8 
of % into a compact subset 8* of 8 has a fixed point. 

2. In adapting the proof to our conditions, I give it a more constructive twist. 
For the open interval 0 ^ z < 00 one has to combine ever more refined sub¬ 
division with exhaustion. Let therefore n, v be two positive integers. For 
any given numbers 

Xm (rn = 0, 1, • • • , nv\ 0 g x m ^ 1) 

form by linear interpolation the function g{z) = g(z; x 0, • • • , x nv ) with the 
prescribed values 

g{m/n) = x m (m = 0, 1, • • • , nv) 

in the interval 0 £ z £ v and extrapolate it beyond v by g(z) = x n , for z ^ v. 
Denote by xZ the values of its image g* ~ ${g] at the points z = m/n (m = 


11 See E. Rothe, Bull. Am. Math. SoC. 45,1939, pp. 606-613. 



DIFFERENTIAL EQUATIONS OF BOUNDARY-LAYER PROBLEMS 


407 


0, 1 , • • • , nv). The continuous mapping x m —> x* of the (nv + l)-dimensional 
unit cube 0 g xd ^ 1 , — , 0 ^ x„„ ;£ 1 has a fixed point (Brouwer’s Theorem); 
choose one, let it have the coordinates x° m and set 

gn.,(z) = (K*; a=o, • • • , a;®,). 

The image g of g n ., takes on the same values x° m as g n , r itself at the points 
2 = m/n. Let vo be a positive integer. Since g* ,» , 

(54) | g* n Az) ~ I ^ « for - gzi m — 1 (m — 0,1, • • • , nv 0 — 1) 

n n 

provided v ^ vo and 1/n ^ 5(c, po) ; in particular 

| Xm+1 — Zm | <; €• 

Hence, because 0 ntV ( 2 ) is the linear interpolation of the values x° m , the inequality 

| g„,,(z) - s& | SS « 

holds under the same conditions as (54) and thus 

I fl£.»(z) — 9n,,(z) I ^ 2e for 0 ^ 2 g v 0 

as soon as v ^ p 0 and n ^ l/5(e, p 0 ). This means that £*,„ — gr n ,* —► 0 with 
n and v tending to infinity. A subsequence 12 of the £*,„ c tends to a limit g , 
the corresponding subsequence of the g n , v to the same limit, and, on account of 
the continuity of <$, the relation g n%v = yields g = 4>{g}. 

Institute for Advanced Study 

12 By a subsequence of the pairs (n, v) we mean a sequence (w», **) both members of 
which are monotone: 7i< < n 1+1 , Vi < v.+i. 


(• - ^ J ) 




Annals of Mathematics 
Vol. 43, No. 3, July, 1942 


ABSOLUTELY CONVERGENT FOURIER EXPANSIONS FOR 
NON-COMMUTATIVE NORMED RINGS 

By S. Bochner and R. S. Phillips 
(Received February 11, 1942) 

I. The main theorem 

The set R of elements x, y, • • • will be called a normed ring if 

a) R is a Banach space over the field of complex numbers. 

b) The operation of multiplication is defined for elements of R. This operation 
possesses the usual algebraic properties. 

c) R contains a unit element e. 

d) \\xy\\ g || a INI » ||, || « || = 1. 

Since commutativity of multiplication is not assumed we shall have to dis¬ 
tinguish between left inverses, right inverses, and (two-sided) inverses. We 
note that a left inverse, if it exists, need not be unique. 

Our principal result is the following theorem: 

Theorem 1. If R' is the ring of periodic functions 

(1) x(t) = X) a n e int , a n e R, 

— 30 

on 0 g t < 2ir to R, with 

(2) ZIKII<», 

where the product x(-)-y(-) in R' is the function x(t)y(t ), then x(t) has a left in¬ 
verse in R' if x(t Q ) has a left inverse in R, for every to . 

This is a generalization of the known theorem of N. Wiener 1 from complex 
numbers to a general R. 

Denoting by L(R) the Banach space of strongly integrable [Bochner, 2] 
functions f(t) on — <x> < t < oo to R with the norm 

(3) ii/(-)ii - r\\m\\dt 

J— 00 

we are able to obtain a generalization of another of Wiener’s theorems. 
Theorem 2. If f{t) € L(R) and if the Fourier transform 

(4) x(u) = f f(t)e iut dt 

•1—00 

has a left inverse in R, for each real u, then the linear combinations 

(5) A n t R, 


1 We shall quote from the memoir of Wiener [6] rather than from his book on the Fourier 
Integral. 


409 


f.tn/tVA <rrt##* T .iAmrw 



410 


S. BOCHNER AND R. S. PHILLIPS 


are dense in L(R). Conversely , if (5) is dense in L(R) then x{u) has a left 
inverse throughout ( — oo, <x>). 

The possibility of extending Wiener’s results from numbers to ring elements 
was suggested to us by Gelfand’s new method [ 3 ] of proving Wiener’s theorem 
with the aid of maximal ideals in rings. It turns out that Wiener’s own method 
leads very directly to our version of his theorem, whereas the adaptation of 
Gelfand’s method to the non commutative case is slightly more elaborate. 
We shall present both methods. The normed ring approach has been found 
applicable to a slightly more general type of function ring. 

II. Wiener’s method 

Lemma 1. If a *R, || a || < 1, then the series 
c = e — a + a 2 — a 3 + 
is absolutely convergent and 

( 6 ) c(e + a) = (e + a)c = e. 

Relation ( 6 ) can be verified by direct multiplication. 

Lemma 2. If a has a left inverse a ' (i.e. a'a = e) and if || h ||-|| a' || < 1 , 
then a + b has a left inverse c , and 

(7) c = a'(e - ba' + ( 6 a ') 2 - ( 6 a ') 3 + .. •) 

In fact, a + 6 = (e + 6 a')a, and hence, by Lemma 1 , 

c(a + 6) = a'(e — 6 a' + ( 6 a ') 2 — • • *)(e + 6 a')a = a'ea = e . 

Lemma 3. The set Ui of all elements in R with a left inverse is an open set. 
Proof. Lemma 2. 

Since R is a Banach space, we first note that if x(t) belongs to the R' of Theo¬ 
rem 1, it is in particular strongly integrable. Therefore series (1) is its Fourier 
series, and many familiar theorems hold. For instance, expansion (1) is unique, 
no matter how arrived at. In what follows, analysis of real and complex 
variables will be applied to the value space R without special emphasis. 

Lemma 4. If x(>) c R', and if £(0) has a left inverse in R, then there exists an 
element 

y(t) - £ Cn6 <n< 

in R' with the following two properties : 

(i) the coefficient Co has a left inverse c 0 , and 

ll4lHl£(c. + c_)|| <1; 

n—1 

(it) in some interval —e<t<e, y(t) = x(t). 



FOURIER EXPANSIONS IN NORMED RINGS 


411 


Proof. As in Wiener [ 6 , p. 12 , Lemma 2 d], we introduced on the circle 
— k g t < v the numerical function 


««(<) = 


1 , 


2 — - , 
e 


0 , 


1 1 1 < « 

« ^ 1 1 1 < 2 « 
2* S I <|; 


and the function 


y,(t) = + [1 - u,(t)]'x( 0 ) = 2 >„(«)e in ‘. 

Obviously y,(t) satisfies property (ii) for each e. As for property (i), Wiener’s 
argument shows that 

lim co(«) = z( 0 ), 

«-»o 

lim £ || c„(t) + c_„(e) ||= 0 . 

< —*0 1 


Thus by Lemma 2 property (i) holds for sufficiently small t. 

Lemma 5. If y(-) e R\ and y{t) satisfies property ( i ) of Lemma 4, then y(-) 
has a left inverse ?/'(•) in R'. 

In fact, putting 

B(t) = £ {cj nt + c_„e~ in ‘), 

» —1 

we have by Lemma 2, 

?/'«) = co[e - m)c'o + m)c'of - m)c' 0 ) 3 + • • •] 

Now if x(-) € R f and if for all t , x(t) eUi , then by Lemmas 4 and 5 there 
exists for each a function y tQ ( •) € i£' such that yt Q (t)-x(t) = e in some interval 
(fo — €, + *). We can now piece together a finite number of these functions 

yt 0 (-) and obtain a function x'(-) e R ' for which x'{t)-x(t) = e for all L For 
details of the argument see Wiener [ 6 , pp. 10 - 11 , Lemma II*]. This concludes 
the proof of Theorem 1. 

We now turn to the proof of Theorem 2 . Chir main step is 

Lemma 6. Given constants — x < a < a < b < < t, if x i(-)> z 2 (*) belong 

to R' where x 2 (u) has a left inverse for a < u < 0 , and £i(u) vanishes outside 
a ^ u bj then there exists an element x$( •) of R' which vanishes outside a ^ u ^ P 
such that 


(8) xi(u) = x z (u)x 2 {u) 

in —ir ^ u < ir. 

Proof. By Lemmas 4 and 5 corresponding to any uo in a <£ u ^ b, there 
exists a function y UQ (-) « R' such that in some interval (wo — «, Uq + c), 
Vu 0 (u)x 2 (u) = e. As above, we can define a function x 2 (u) which belongs to R' 



412 


S. BOCHNER AND R. S. PHILLIPS 


and for which 2 a(a) 2 2 (a) = e in a g u ^ 6. Let x 3 (u) = 2 i(u)* 2 ^(a). Then 
2 8 (u) = 0 in the intervals — ir £ u ^ a, 0 £ u < t. In addition 

x 3 (u ) € R', and xi(u) = 2 3 (a) 2 2 (a). 

Lemma 7. 7/ 2 (a) is strongly integrable in (— 7r, it) and vanishes in ( —7r, 

7r + *) and (tt — €, *■), and if we put 

f(t) = i- £ x(u)e~ iut du, o„ = 1 £ *(«)«-““ du 
then f || /(0 || dt < oo and only if ^ 11 a w 11 < 00 . 

J—oo 

Proof. As in Wiener [6, p. 14-15, Lemma Ilf]. 

From Lemmas 6, 7 we now conclude 

Lemma 8. If g(t) and f(t) both belong to L(R), if xi(u) = / g(t)e~ xut dt vanishes 

J— 00 r 

outside some interval (a, b), and if 22 (a) = J f(t)e~ xut dt vanishes outside some 

larger interval (a, 0), a < a, b < 0, 2 2 (a) having a left inverse for each u in a < u 
< 0, then there exists an element h{t) in L(R ) such that 

(9) • . f7(0 = r h(x)f(t - X) dX. 

J—aO 

Integral (9) can be approximated in L(R) by sums of the form (5). The 
transition from Lemma 8 to Theorem 2 can be effected by Fejer approximation 
as in Wiener [6, p. 16-18]. 

III. Irreducibility and maximal ideals 

The following lemmas are the algebraic basis for an adaptation of Gelfand’s 
methods. 

Let V = (a, b f c, • • •) be a Banach space over the complex numbers, and let 
A = (a, /S, • • •) be the ring of all bounded linear transformations on V to V . 
Then a(a + b) = aa + 0b, ( $a)a = j8(ao). If Vo is a subset of V and A 0 a 
subset of A then A 0 V 0 will denote the set of all elements (aa) of V for a e A 0 
and a e Ao. 

V will be said to be irreducible over the subring Ao of A if for every a tV, 
(o ^ 0), Aoa = V . This means that V has no subspaces invariant with 
respect to A 0 . 

Lemma 9. If V is irreducible over Ao , and if A f denotes the set of all elements 
of A which commute with all elements of A 0 , then A ' is isomorphic to the field 
of complex numbers . 

In fact if a', /3' e A' and a 0 c Ao , then a'ao = a 0 a', 0'a o = implies (a' zb 
P')a 0 = ao(a' zb £')• Similarly a'p'a 0 = a'o<>0' = otoa'P'. Thus A' is a ring. 
Clearly A' contains the null element and the identity. Now, if a! e A 1 , a f 5 ^ 0, 
consider the set W = a'7. The irreducibility implies A 0 W = A 0 F = F, 



FOURIER EXPANSIONS IN NORMED RINGS 


413 


and the commutativity implies a'A 0 = A 0 a'. Therefore W = a'V = a'(A 0 V) = 
Ao(a'V) = AqW = F. That is, the range of the homomorphism a' is all of F 
Now suppose that for some a e F, a ^ 0, we had a'a = 0. Then a'V = a'(Aoa) 
= AoCa'a) = 0 which is contrary to a'V = F. a' is therefore a one to one 
linear bounded transformation on V to F. By a theorem due to Banach (1, 
p. 41, Theorem 5), the inverse 0' is a linear bounded transformation and hence 
belongs to A. 0'ao = 0'a o a'0' = 0'a'ao0' = a<J3 f for all ao e A 0 . Thus 0' 
belongs to A'. A' is therefore a field. Clearly a limit (in A) of transforma¬ 
tions which commute with elements of A 0 also commutes with these elements. 
A' is thus a complete normed field over the complex numbers. Such a field is 
isomorphic to the field of complex numbers. Since the proof of this is the same 
in the commutative and noncommutative cases we refer the reader to Gelfand 
[3, pp. 6-8]. 

If R is a ring with unit element, a left ideal I is a subset with the properties: 

1) if x e /, y e /, then px + qy e I for any elements p, q from R. 

2) 1 is a proper subset of R. 

A maximal left ideal is one not contained in a larger left ideal. 

Lemma 10. If R is a ring with unit , if I is a maximal left ideal , if V is the 
addition group of the cosets R/I , and if A 0 is the ring of homomorphisms of V onto 
itself as produced by multiplying V by elements of R from the left , then V is ir¬ 
reducible with respect to A 0 . 

If x e R, then the element of A 0 corresponding to x will be denoted by n(x). 
For fixed a e F, a 0, we form the set F 0 = ii(R)a. Since R is a linear space, 
Fo is a linear subspace of F. Also, since R contains a unit element c, /x(c) a = a ^ 0. 
Thus Fo contains the null element and some other elements. Now form the 
set S of all elements of R contained in the cosets composing F 0 . Then S 
includes I as a proper part. On the other hand, S is a left ideal. As I was 
supposed maximal, S = R. Thus Fo = V; that is Aoa = F. 

Lemma 11. ///?,/, F, Ao have the same meaning as in Lemma 10, and if 
for fixed x e R and et'ery maximal left ideal I the corresponding element n(x) of A 0 
has a left inverse in A 0 , then x has a left inverse in R. 

Proof. Since R contains a unit element e , every left ideal is contained 
in a maximal left ideal, and consequently an element x of R has a left inverse 
x' if (and only if) it is contained in no maximal ideal. See Gelfand [3, p. 8-9]. 

If x'x = e y then p(x')n(x) ~ fi(e). Denoting by a e and Oo the elements of F 
which as cosets in R/I contain the elements e and 0 respectively, we have in 
particular 

(10) ii(x')y(x)a e = y(e)a e . 

Since e-e = e, we have y(e)a e = a e . On the other hand, n{x)a e contains x . 
Hence if x were contained in Oo = /, we would have n{x)a e = a 0 , and con¬ 
sequently y(x')fjL(x)a e = Oo . This completes the proof of Lemma 11. 

Finally we have to make several statements for normed rings. 

Lemma 12. Every maximal ideal of a normed ring is closed. 



414 


8. BOCHNER AND R. S. PHILLIPS 


For the proof see Gelfand [3, p. 8]. 

Preparatory to an amendment of Lemma 11, we first observe that if R is 
normed, and X is a closed linear subspace, then R/X is made into a Banach 
space by introducing the norm 

\\Y\\ = inf s , K || y || 

for any F of R/X [see 3]. 

Lemma 13. If R, I, V, A<>, have the same meaning as in Lemma 10, then V 
is a Banach space, A ti is a normed ring, and for the norm || y(x) || in A a we have 

II M 0*0 II S II X ||, x « R. 

In fact, 


n(x) || = sup 


I m(*) a I 


= sup 

a < V 


inf [ 
inf 


x y\ 

II y f! 


yea] 


yea] 


* 11*1 


IV. Theorem 3 

Assumptions. Let F denote a ( commutative ) ring of complex valued functions 
f{t) on a point set [/]. Ring multiplication is ordinary multiplication f(t)-g(t), 
and the constant function f(t) = 1 belongs to F and is its unit. The norm in F 
will be denoted by ordinary bars : | f(t) |. By M(f) we shall denote any continu¬ 
ous ring homomorphism from F to complex numbers. Thus in particular M(fg) = 
M(f)'M(g). 

Let R = (x, y> • • •) denote any (non-commutative) normed ring with unit e and 
norm || x ||, and let R r denote a family of functions x(-) = x(t) from [<] to R with 
the following properties: 

1 ) R f is a ring under point multiplication x(t)y(t). 

2) If Xi, • • • , x n € R , fi(l) • • • , fn(l) € F , then 


(11) + ••• + Xn f n (-) 

belongs to R f . If x e R } f(t) e F , then the element xf(-) of R' will be denoted by x f . 

3) R' is a normed ring with norm 11 x( •) 11 for which 

ii^ii = ii*ii-i/i. 

4) The linear combinations (11 ) are dense in R'. 

5) If x(') = x{ 1 + • • • + x^ n , then for every homomorphism M(f) of F y 

\\x 1 M(f 1 ) + ... +x n M(f n ) || ^ || x (0||. 

On the basis of 4) and 5) every M(f) gives rise to a continuous homomorphism 
M(x( •)) from R f to R, with the property 

M(x / ) = xM(f). 

We will call M a generated homomorphism . 

Conclusion. An element x( •) e R f has a left inverse if for every generated 
homomorphism M, the element M(x(.)) of R has a left inverse in R. 



FOURIER EXPANSIONS IN NORMED RINGS 


415 


Proof. Replacing R by R ' in Lemma 11, we consider an arbitrary maximal 
left ideal I and for any element x(-) e R f the corresponding element m(z(*)) of 
Ao. The elements e f (-) = <?/(•) of R\ e = unit in R,fe F , are commutative 
with any element x° of R' and hence by 4) with every element of R'. There¬ 
fore, by Lemmas 9, 10, and 13, n(e f {-)) can be looked upon as a continuous ring 
homomorphism from F to complex numbers. More precisely there exists an 
M(f) such that 

li{e f ) = eiM(f) (ci = u(e-l) is unit of A). 

For an element of the form (11) we have 

mWO) = + • • * + l k ) = + • • • + ix{x k -\)n{e /k ) 

= m(*i-1) M(fi) + ■ • • + u(x k -l )M(f k ) = n(lziM(fi) + • • • +x k M(f k )\-1 

and thus 

(12) m(*(0) = m(M(*(0)-1) 

By property 5) in conjunction with Lemma 13 this relation is valid for all ele¬ 
ments x(-) c R'. Now if M(#(£)) has a left inverse x' in /?, then 

n(x'-l )'ll(x(-)) = fl(x'-x(-)). 

Applying (12) to x r -x( •) instead of ), this is 

= m(M(*'•(*(•))-1) = m(* ; -M(x(-))-1) = m(«- 1) = ft. 

Thus m(^(*)) has a left inverse, and by Lemma 11, x( •) has a left inverse in /?'. 

A Particular Assumption. Corresponding to any M(f) there exists a 
point t 0 such that 

M{f) = m f e F. 

In this case, property 5) can be replaced by the simpler property: || x(( 0 ) || ^ 
|| z(-) ||, for each * 0 . 

A Particular Conclusion. The dement x(-) * R f has a left inverse in R\ 
'provided x{t) has a left inverse in R , for all t. 

The proof is obvious. 

Adjunction of unit . If the ring F has no unit, we make a formal adjunction 
of a unit 1, and we consider the enlarged ring F: XI + /, with the norm | X | + | /1 
(X = complex number). This leads to the ring R' of elements .f(-) = Xel + 
x(-) where || £(•) II ==_ | X | + || x(-) ||. It can be shown, Gelfand [3], that 
every homomorphism M = M(XI + /) to the complex numbers is either X +M(/) 
where M is a homomorphism of F, or it is the exceptional homomorphism Af(Xl 
+ /) = X. We can then obtain the following conclusion : If F has no unit, 
then the element x{ •) = Xe • 1 + x( •) with X ^ 0 and x( •) e /?', has a left inverse 
of the form X'e-1 + £'(•)> if for every generated homomorphism M the element 
Xe + M(x(*)) « R has a left inverse in R. In the particular circumstances 
cited before, it is again sufficient that Xe + x{t) shall have an inverse for each i. 



416 


8. BOCHNER AND R. 8. PHILLIPS 


V. Expansions on groups. Applications 

By known results on topological groups, and by results on numerical func¬ 
tions by Gelfand [4], and Gelfand-Rykov [5], Theorem 3 leads easily to the 
following theorems. 

Theorem 4. If T = ( a , f}, • • • ) is a commutative group of addition with dis¬ 
crete topology and G = (t,s, ■ ■ ■) is the compact dual group of addition; if {x(«, OK 
a t T, t t G are the characters from T to G; if R is any normed ring; and if R' is 
the ring of functions on G, 

(13) x(t) = 22 a «x(«, 0 a a e R 

a 

(14) 22II««II < 00 » 

a 

then x(-) has a left inverse in Rprovided x(t) has a left inverse in R> for all t. 
More generally, if (14) is replaced by 

22 e ia II a a II < =o, 

a 

where the real numbers q a have the properties q a +p ^ q a + and q 0 = 0, then the 
condition is that 

12 a ae P “x(°‘, 0 

a 

shall have a left inverse in Rfor all t, and every system of real numbers p„, for which 
Pa-rn = p a + Ps, Po = 0, p a g q a ■ 

However, if T is a locally bicompact group with unique Haar measure da; if 
x(a, 0 are continuous characters; if R' is formed by 

x(t) = J a a x ( a i 0 da, a a t R 

where 

f e , “!| a„ || da < » ; 

Jr 

and if R' has no unit, but 1 is an adjoined unit, then the element x(-) = XI + x(-) 
has a left inverse of the form X'l + x'(-) provided 



has a left inverse for all t, and every continuous system p a with the previous properties. 

In the case of a discrete group T it is often natural to consider not the total 
compact group G but a dense subgroup Go . For instance, if T is the linear 
group — oo < a < » without topology then it is natural to consider the char¬ 
acters x(a, 0 = e ' a ‘ with — oo < t < oo , dense in the total group G. It is not 



FOURIER EXPANSIONS IN NORMED RINGS 


417 


necessary to make sure of the existence of x f (t) for all G , but it is sufficient to 
know that the left inverse x'(t) exists and is bounded in norm ( — x, x). This 
results from the following lemma: 

. Lemma 14. Suppose x n —*x in R. If x n is a left inverse of x n and the \\x n || 
are bounded, then x possesses a left inverse. 

Proof, e — x n x = e — x n x n + x n x n — x n x. Hence || e — x n x || g \\x' n \\- 
|| x n — x || — * x . By Lemma 1 there exists for sufficiently large n, an inverse 
y n of x n x so that y n y n is a left inverse for x 

However, if F is the continuous group — x < a < x with the ordinary 
Lebesque measure as measure, the line (— x, *>) is the complete group G. 
Thus we obtain the following: 

Theorem 5. If x(t) = UnC tan< , a n e R, 2 II a » I! < 00 , and if a left inverse 
y(t) exists for each t and || y(t) || < C, then there exists a function x'(t) = JZ b n e lffnt 
with £ || b n || < 00 such that x'(t)-x(t) = e. 

Ifa(a) eR, — x < a < x , and a{a) is strongly integrable / || a(a) || da < x, 

J— oo 

and if for complex X ^ 0, \e + [ a(a)e xat da has a left inverse for all t, then 


there exists a left inverse of the form \'c + 


£ 


£ i 


b(a)e ia da, with / || b(a ) ||da < x 


Difference and integral equations . Now, consider functions <p(t), \l/(t), • * • 
on the line — *> < t < x with values in a space on which R operates; and assume 
that they form a Banach space B, and that the norm in B is invariant under 
translation. Then, all symbols having the same meaning as in Theorem 5, 
we see that the difference equation 


(15) 


2Z an<p(t — a n ) = iKO 


has a solution 


(16) = £ bM - fin), 

and the integral equation 


v(0 + / a(a)<fi(t 
J— 00 


a) da = 


has a solution 


<p (0 = lA (0 + f b{p)\p{t — p) dff. 

J— 00 

For instance, let R be the space of A:-dimensional matrices of complex numbers 

x = (xu) i, j = 1 , • • • , k, 

and let || x || be the maximum of x nP<Qi f° r 2 I P« | 2 ^ 1, 2 I Q> I* ^ 1. 

Then we obtain the following result. 



418 


S. BOCHNER AND R. 8. PHILLIPS 


If a„ = (a?,), £ || a„ || < »,“« = real, and if for 

D(t) = i, j = 1, •••,*, 

II D~ l (t) || S (7 < • -oo < « < ao, 

then there exist matrices 6„ = (6",), £ || 6„ || < «, and real numbers j 8 », 
such that the system of equations 

oo k 

23 23 a ij <Pi(t + «n) = ti(t) i = 1, • • • y k 

n-1 j-1 

has a solution of the form 

oo Jc 

<Pi(t) = 22 23 &?3 + /3n). 

n—1 y-1 

If the functions fj(t) belong to Lebesgue class L p , to B, M, C etc., then the 
functions <pi(t) belong to the same class respectively. 

Princeton University, 

Harvard University 


References 

1. S. Banach, Theorie des Operations Lineaires, Warsaw, 1932. 

2. S. Bochner, Integration von Funktionen, deren Werte die Elemente eines Vektorraumes 

sind, Fund. Math., vol. 20, 1933, pp. 262-276. 

3. I. Gelfand, Normiete Ringe, Recueil MathSmatique, vol. 9 (51), 1941, pp. 3-24. 

4. I. Gelfand, Vber absolut konvergente Irigonometrische Reihen und Integrate, Recueil 

Math6matique, vol. 9 (51), 1941, pp. 51-66. 

5. I. Gelfand and D. Raikov, On the Theory of Characters of Commutative Topological 

Groups, Comptes Rendus de l’Adademie des Sciences de I’URSS, vol. 28, 1940, 
pp. 195-6. 

6. N. Wiener, Tauberian theorems , these Annals, vol. 33, 1932, pp. 1-100. 



Annals of Mathematics 
Vol. 43, No. 3, July, 1942 


ON THE LAW OF THE ITERATED LOGARITHM 


By Paul Erdos 
(R eceived December 26, 1941) 


Introduction 

Let t be a real number (0 ^ t ^ 1), and let t = 0.*\(t)e2{t) • • • be its dyadic 
expansion, or equivalently, 


( 0 . 1 ) 


. _ flit) , t 2 (t) 
1 2 * r 2 2 


, «n (l) , 

' 2 n * t ' 


where «»(<) = 0 or 1 according as the integral part of 2"t is even or odd. It 
is well known that {« n (0} (n = 1, 2, •••) is an independent system in the 
sense of probability, 1 and that 



Let us further put 

(0.3) /„«) = £ «(t) - l- 

Ar-l Z 


It was proved by A. Khintchine 2 and A. Kolmogoroff 8 that 

(0.4) lim sup t———— c , = 1 

™ 0 log log .) 

for almost all t. 

Let <p(ri) be a monotone increasing non-negative function defined for all 
sufficiently large integers. Following P. L6vy we say that <p(n) belongs to the 
upper class if, for almost all t , there exist only finitely many n such that 

(0.5) f n (t) > <p(n); 

and ip{n) belongs to the lower class if, for almost all t , there exist infinitely many 
n such that (0.5) is true. According to the well-known law of 0 or 1, each <p(n) 
must belong to one of these classes. Then the result of A. Khintchine and A. 
Kolmogoroff stated above means that <p(n) = (1 + e)(%n loglog n)* belongs to 
the upper class if t > 0, and to the lower class if e < 0. 

The purpose of the present paper is to give a sharpening of this result. The 


1 Cf. M. Kac and H. Steinhaus, Sur les fonctions indtpendentes, Studia Math. 6 (1936), 
46-58, 59-66, 89-97. 

* A. Khintchine, Asymptotische Gesetz der Wahrscheirtiichkeitsrechnung , Berlin, 1933. 

• A. Kolmogoroff, Dber das Gesetz der iterierten Logarithmus , Math. Annalen, 101 
(1929), 126-135. 


419 



420 


PAUL ERDOS 


main results are stated in Theorems 1, 2, 3, 4, and 5 below. Among other 
results, it follows from Theorem 3 that, for k > 3, 

»<“> - (2 log”log n) ( lo ® log n + i lofc ” + 3 l0g ‘ " 

<0 ' 6) 1 n \ \ 

+ • • • + 2 lo gfc-i n + ( 2 + e ) log * n ) 

belongs to the upper class if e > 0 and to the lower class if e £ 0 . 

Our proof is direct and elementary. We do not assume the result of A. 
Khintchine and A. Kolmogoroff, and the paper can be read without knowledge 
of any particular results concerning the law of the iterated logarithm. The only 
facts we need are the notion of independence, and the well known inequality 


(0.7) Cl -e- lx ' ln < Pr(A n (x )) < c 2 -e~ 2xtln , 

X X 

where 

( 0 . 8 ) A n (x) = E[t:f n (t) > x] 

means the set of all real numbers t (0 g t ^ 1 ) satisfying f n (t) > x> and Pr(A) 
means the ordinary Lebesque measure of a measurable set A in the interval 
O^^l. C{ (i — 1 , 2 , • • •) will denote positive constants. 

Throughout the present paper, the sequence \m n ) (n = 1 , 2 , •••) defined 
by mi = 1 and 

(0.9) m n — [e” /l0K ”], n = 2, 3, • • • , 

will play a fundamental r61e. The fact that we adopt the sequence \m n } 
(n = 1 , 2 , • • •) instead of {a n } (n = 1 , 2 , • • •), which was used by A. Khint¬ 
chine and A. Kolmogoroff, is essential in our proof, and will enable us to ob¬ 
tain our sharper results. The following inequalities, which are easy to prove, 
will be used very often: 

m n < ?tin+\ < c 3 m„ , 

m n m n 

Ci log log m n m ”' r 1 Mn Cb log log m n 

a < (m -« 108 log 

- (m. log log rnj < c, (,- g 

It is not difficult to extend our results to the case in which the parameter n 
is continuous, i.e. the case of Brownian motion . 4 We can define the upper and 



( 0 . 10 ) 

( 0 . 11 ) 

( 0 . 12 ) 


4 Cf. A. Khintchine, loc. cit. 2. Cf. also X. Wiener, Differential space. Journal of Math, 
and Phys. 2 (1923), 131-174, and the book of P. L4vy quoted in footnote 5. 



LAW OF ITERATED LOGARITHM 


421 


the lower classes in this case, and can obtain the corresponding results. It was 
stated by P. L6vy 6 that A. Kolmogoroff has proved the following result: Let 
^(A) = <p(A)/A* be monotone increasing. Then a necessary and sufficient 
condition that <p( A) belong to the lower class is given by the divergence of the 
integral 

(0.13) f^(X)e-^ (X))I ^. 

Jo A 

It is easy to see that this is equivalent to Theorem 4. As far as I know, the 
proof of A. Kolmogoroff has not been published. Recently, J. Ville 6 proved 
that the divergence of (0.13) is necessary. This corresponds to a special case 
of Theorem 1, but his proof is entirely different from ours. 

1 

Theorem 1. <p(ri) belongs to the upper class if it is monotone increasing and if 

(1.1) 23 Pr(A mn (<p(m n ))) < *>. 

»—l 

Proof. First we remark that we may assume that 

(1.2) <p(n) ^ (n loglog n )* 

for sufficiently large n. Indeed, otherwise we may consider <p\(ri) = 
min (tpin), (n loglog n )*) instead of <p(n). It is clear that <pi(n) is monotone in¬ 
creasing, that (pi(n) satisfies (1.1) if <p(n) does (because, by (0.7), <p 0 (n) = 

( n log log ?i)* satisfies (1.1)); and that if <pi(n) belongs to the upper class so does 
<p(n) too. 

Next we notice that, under the assumption (1.2), we have 

(L3) Pr(A mn+l ((p(m n ))) < c 8 Pr(A mn (<p(m n ))). 

This is an easy consequence of the relations (0.7), (0.10) and (0.11). We omit 
the proof. 

Now assume that Theorem 1 is not true. Then there exists a constant eg > 0 
such that, for any M 0 = m„ 0 , there exists an No = m n ^ (n 0 > no) such that 

(1.4) Pr( £ Au(<p(m))) > c 9 > 0. 

Af 0 < u 1^0 

Let us put 

B(u ) = A u (>p(u)) - A u (<p(u)) 23 A v (<p(v)) 

(1.5) 

= E[t:f u (t) > <p{u ); /„(<) g <p{v), M 0 < v < u]. 

1 P. L6vy, Thkorie de Vaddition des variables aliatoires , Paris, 1937. 

6 J. Ville, Htude critique de la notion de collectif, Paris, 1937. 



422 


PAUL ERD6s 


Then {B(u)} (Mo < u £ No) are mutually disjoint, and 

(1.6) E B(u) = E A»(v(u)). 

M 0<“S»0 Mo<»S*0 

For each u (M 0 <u| iV 0 ) take an n (n» ^ n < no) such that m„ < w ^ m n+1 , 
and put 

(1.7) A:. m . +l = E[t:f„. +l (t) - m £; 0]. 

Then it is clear that B{u) and At,m n+1 are independent, and hence 

(1.8) Pr(B(u)- At, mn+l ) = Pr(B(u))Pr(At,m n+I ) £ i-Pr(B(u)). 

On the other hand, since t e B(u) • At imn+1 implies fm n+l (t) ^ f u (t) > <p(u) ^ 
<p(m n ), we have 

(1.9) B(u) • A u ,m n + l Cl Am n +i(<p('M'n)) 

for m n < u S m n +i . Hence, since {S(r) • At, mn+1 ) (Af 0 < u ^ # 0 ) are mutually 
disjoint, we have, by (1.3), (1.8), (1.6) and (1.4), 

Cg E Pr(A„ n (<p(m n ))) ^ E PrG4 m „ +1 (p(ro,,))) 

M 0 <u£N 0 


( 1 . 10 ) 


^ E Pr(B(u)- Ah.m n+l ) ^ 5 E Pr(B(u)) 

M 0 <u£N Q Mq<u£N o 


2: *Pr( E B(«)) = *Pr( E 4 tt (*,(«))) > £ > 0. 

Since Cg and c * are positive constants, and since M 0 = m no can be arbitrarily 
large, this contradicts to the assumption (1.1). This proves Theorem 1. 

Corollary 1. <p(n) = (l/(2) 4 + «) (n log log n)* belongs to the upper class 
for c > 0. 

Corollary 2. The expression (0.6) belongs to the upper class for c > 0. 
Proof. Follows immediately from Theorem 1 and (0.7). 


Theorem 2. If <p(ri) is monotone increasing , then a necessary and sufficient 
condition that <p(ri) belong to the lower class is that, for almost all t, there exist 
infinitely many n such that 

(2.1) fm n (t) > <p(*tn). 

Proof. The sufficiency is obvious. In order to prove the necessity, let us 
assume that <p(n) belongs to the lower class. First we remark that we may 
assume 

(2.2) <p(ri) g (n log log n)* 

for sufficiently large n. Indeed, by Corollary 1 to Theorem 1, ^o(n) = (n loglog n)* 
belongs to the upper class. Hence, if we put <pi(ri) = min (<p(n), ^o(n)), then 



LAW OF ITERATED LOGARITHM 


423 


<Pi(n) belongs to the lower class if <p(n) does; and if the necessity of the condition 
is proved for <pi(ri), then it is obviously true for <p(n) too. 

By assumption, there exists a constant c 10 > 0 such that for any M 0 = m no 
there exists an No = m n ' ( n' 0 > no) such that 

(2.3) Pr( £ AuMu))) > co. 

M 0<ugtf 0 

Let us put 

C(u) = A u (<p(u)) - A u ((p(u))- 23 A v (<p{v)) 

(2.4) u<v£n q 

= E[t:f u (t) > <p(u);f v (t) g <p(v), u < v g JV 0 ]. 

Then {C(w)} ( M 0 < n g JVo) are mutually disjoint, and 

(2.5) £ C(tt) = £ iU(*(«)). 

««<»£!*! Afo<’.SA , 0 

Fo r each « (M 0 iV 0 ) take an n (w 0 ^ n < n a ) such that m n < u fk m n + 

and put 

(2.6) A- n .„ = E[t:f u (t) - f nn (t) ^ 0]. 

It is to be noticed that C(u) and &k n , u are not independent, but it can be shown 
by computations 7 that there exists a constant c u > 0 such that 

(2.7) Pr(C(u) ■ A" .„) > c n Pr(C(u)) . 

7 We sketch the proof of (2.7): Let us put 

C(u, k ) = E[t:f u (t) = k; f v (t) ^ <p(v), u < v S No], 

where k > <p(a) is an integer or integer + i according as u is even or odd. Then a simple 
calculation with binomial coefficients shows that 

Pr( £ C(u,k)) > c t ,Pr(C(u)). 

*(u) <k£ <p(u) + u/p(u) 

Thus it suffices to show that, for <p(u) < k ^ <p(u) + u/<p(u), 

Pr(C(u, k) • Am n ,u) > c i7 Pr(C(u , k)). 

Now, it is easy to see that 

_ Pr(C{u,k)) . ^ I U \ /( 

Pr(C(u, k)-Am n ,u) Ca \l + k ) / 11 + kj’ 

and a simple calculation shows that 



which completes the proof of (2.7). 



424 


PAUL ERDOS 


On the other hand, since 1 1 C(u) • A mfl , u implies /m n (0 S /«(0 > <p(u) ^ <p(m n ), 
we have 

(2.8) C(u) • Am n ,u ^ 4m n (^(wi n )) 

for m n < u ra w +i. Hence, since {C(r)* A™ nfU ) (M 0 < w ^ No) are mutually 
disjoint, we have, by (2.7) and (2.5), 

53 Pr(A mn (<p(m n ))) ^ 53 Pr(C(u) 

MoCmnS^O 3f 0 <u^7V 0 

(2.9) S cu 2 Pr(C(u)) = c u Pr( 53 C(u)) 

A*0<“^0 A^0< u Si^0 

= c u Pr( 53 -4u(<p(m))) > Cio-Cn > 0. 

M 0 <u^o 

Since Cio and c n are absolute positive constants, and since M 0 = m no can be 
taken arbitrarily large, this means that the set of all t for which the inequality 

(2.1) holds for infinitely many n, has positive measure. By the law of 0 or 1, 
this set must have measure 1, and thus Theorem 2 is proved. 

3 

Theorem 3. Let <p(n) be monotone increasing and let us assume that 

(3.1) ^(win+i) - <p(m n ) > C 12 (ran/log log m n )\ 

Then a necessary and sufficient condition that <p(ri) belong to the lower class is that 

(3.2) L Pr(A m MmM = 

n-1 

Proof. The necessity follows from Theorem 1, without assuming (3.1). 
In order to prove that the condition (3.1) is sufficient, let us assume that <p(ri) 
is monotone increasing and satisfies (3.1) and (3.2). We first notice that 
(3.1) and (0.12) imply 

(3.3) <p(m n+ 0 - <p(m n ) > ci 3 ((m n+1 log log m n + 1 )* - ( m n log log ra„)*), 
and hence 

(3.4) <p{m n ) > cu(m n log log m n )*. 

From (3.4) and (0.7) it follows easily that 

(3.5) lim Pr(Am n (<p(m n ))) = 0. 

n —*oo 

Next we notice that we may assume 

(3.6) <p(n) ^ (n log log ri) k 

for sufficiently large n. Indeed, otherwise we may consider <pi(n) = 
min ( <p(n ), (n log log n)*) instead of (p(n ). Since <po(ri) = (n log log n )* clearly 
satisfies (3.1), ^i(n) satisfies it too. Further, it is obvious that (3.2) is satisfied 
by <pi(n) whenever it is satisfied by <p(ri). Moreover, since <po(n) belongs to the 



LAW OF ITERATED LOGARITHM 


425 


upper class, by the corollary to Theorem 1, <p\(n) belongs to the lower class at 
the same time as <p(ri). 

Because of the law of 0 or 1, and because of Theorem 2, it is sufficient to prove 
that there exists a constant Ci B > 0 such that there exists, for any M 0 = m no , 
an No = m n ' ( n 0 > n 0 ) such that 

(3.7) Pr( 22 A mn {<p{m n ))) > c a . 

Let 8 > 0 be a small positive number, which we shall determine later. Then, 
by (3.2) and (3.4), there exists an N such that, for any M 0 = m nQ > A r , an 
No = m n £ (n 0 > no) exists such that 

(3.8) 3 < £ PrUmMmn))) < 23. 

M 0 <m n £X Q 

We shall prove that if 5 is chosen sufficiently small (but fixed), then (3.7) is 
satisfied, with the same integers M 0 and No as in (3.8), by a suitable positive 
constant ci 6 > 0. 

In order to prove this, let us first put 

D{m n ) = Am n (<p(m n )) — A mn (<p(m n ))- X Am n + r (<p(m tl +r)) 

^3 g) m n <m n + r £N a 

= E[t:f mn (t) > <p(m„);f mn+r (t ) ^ <p(m n+r ), m„ < m n+r S iVo]. 


Then {D(m n )} (Mo < m n ^ N 0 ) are mutually disjoint, and 

(3.10) 22 D(m „) = 22 A mn (<p(m„)). 

Let us further put 


(3.11) 


AW " *-(*(>«■» - 

- < /-.«> * *<»>.) + f ( log -^S.)‘]- 


Then a simple computation will show that 8 


8 We have clearly 


Pr(Pi(7n n )) 
Pr(A mn (<p(m n )j) 



where the dash indicates that u runs only over, the interval 


v>W, 




C 12 /_ m n V 

2 \log log m n ) 


A simple calculation shows that 


E'( m ")/ E ( mB ) 

\U / / v >*>("»«) \U / 


> Cie 


which proves (3.12). 



426 


PAUL ERDSs 


(3.12) Pr(Di(m n )) > c lt Pr(A m „(<p( m *)))- 
Let us put 

(3.13) Z> 2 (m n ) = Di(m n ).^:/ mn+r ^(0 - f mn+r (t) £ 0, r = 1, 2, . . . , h\, 

where h is a positive integer which we shall determine later. Then it is easy to 
see that 

(3.14) Pr(D,(m n )) g; 2'*Pr(Z> 1 (m„)), 
and that t e D 2 (m„) implies 

(3.15) 

fm n +r (t) = fmJJ) ^ (piV^n+r) y 
for r = 1, 2, • • • , h. Let us further put 

(3.16) D a (m n ) = D 2 (m n )-E[t:f mn+r (t ) g *>(ra n+r ), m n + h < m n + r g N 0 ]. 

Then it is clear that Z> 3 (m„) C D(m n ) C A mn (fp(m n )). In order to complete the 
proof of Theorem 3, it is sufficient to prove that, if 5 is chosen sufficiently small 
and if h is chosen sufficiently large (but both fixed), then there exists a constant 
Ci 7 > 0 such that 

(3.17) Pr(D 3 (m n )) > c 17 Pr(A m M™n))). 

Indeed, (3.17) will imply 

Pr( 2 A mn (<p(m„))) = Pr( £ D(m n )) 

Mq< m n q >V 0 <m n gJV 0 . 

(3.18) = Pr(D{m n )) ^ X Pr(D s (m n )) 

M 0 <m n $,No M o<m n ^Nq 

> cn X Pr(A mn (<p(m n ))) > cn-6, 

M 0<m n ^ N 0 

which means that (3.7) is satisfied by c i6 = Cn-8 > 0, thus completing the proof 
of Theorem 3. 

The rest of the proof of Theorem 3 is devoted to establishing the relation 
(3.17). For this purpose, put 

(3.19) Dz, r {m n ) = D 2 (m n ) *^4 mn+r (v?(m n+r )), 

for all integers r such that m n +h < m n +r ^ iVo. It is easy to see that 

(3.20) D 2 (m n ) C Darrin) + 2 D 3 ,r(w n ). 

m n+fc< m n+r =1^0 

We shall evaluate Pr(^m n+h <m n+r ^N 0 Ds, r {m n )) by decomposing the sum into 
three parts: ^m„+/,<m„ +r g 2 m„ , 2 8,n n<*>»+>■ £>»i> lo * >»» > •»» m »<>»»+r§* r o • 

In the first place, t e D 3tr (m n ) implies 



LAW OP ITERATED LOGARITHM 


427 


/*M+ rW /*»n+*W — /m»(0 


(3.21) 


> - <p(m„) - C ' 2 (—~—) 

2 \log log mj 

> Y 

k-o \log log m n+k J 2 \log log mj 

> Cl2 Wn ^ 

2 \log log m„)' 


Hence 

(3.22) 


Pr(D s , r (m„)) g, a r -Pr(D 2 (m n )), 


where «, - Pr( £ [«:/^. (1 ) - /....ft) > ^(^^J]) 
(3.23) “ Fr ("T (l5i"lh^S;) )) 


(.Wln+r Wln+h) ^ 

< ° 2 7~77 —™-\ * ex P ~ 

c 12 r ( m n \ 


(—Hh—Y 

\log log mj 


<¥)’ 


v 2 ) log log m w 

7fl n +r Wln+h 


Since, on the other hand, m n +h < m n+ , ^ 2 m n implies 


lUn+r ~ Wn+h ^ E (Wn+k+l ~ W»n+*) 


(3.24) 


/ „ mn+r 0 „ _ m n 

^ Cg r - . _ “Cj ^ii > 

log log mn+r log log m n 


we have, by (0.7), 

(3.25) a r < c m e~ Ci,r 
for m n +h < m n+T ^ 2m„ . Consequently, 

(3.26) Pr( E Z)a.r(m„)) < c 18 -Pr(D s (7n„))- E e- c,#r . 

%+A<%+ r ^2m B r-fc+1 

Secondly, t c D t , r (m n ) and 2m„ < m B+r ^ m„ log m„ imply 


A. +r (0-/-.«(<) >fg( 


TWn+t 1 

log log wi„W 


(3.27) 


7 -A 

> ~ E ((mn+*+i log log — (Wn+t log log m^.) 1 ) 

ACl Jk -0 

> C»(mn+r lOg lOg Wln+r) 1 . 



428 


PAUL ERDOS 


Hence 

(3.28) Pr(Dz,r(m n )) < /3 r -Pr(D 2 (m n )) 

for 2m n < m n+r ^ m n log m n , where 

P r = Pr(E[t:f mn+r (t) - U n+h (t) > c 20 (m„ +r log log m^]) 

= PrU m „ +r _ mn+A (c 2 o(Wn + r log log fflnt,) 1 )) 

(m^r - mn+h^ _j~_2r 20 m n+r log lo g m n+r "| 


(3.29) 


< c 2 


cio(m n +r log log m n+r y 


exp 


1Yl n -\-r W-n-HA J 


< c n e~ c " lo “ loem " +r < 


C 2 1 


(log m,) c,! ' 

On the other hand, the number of m n+r ’s satisfying 2m„ < m„ +r S m n log m„ 
does not exceed c 28 (log log m,) 1 .' Hence we have 

(3.30) Pr( E A>,(m„)) < Pr(Z> 2 (m n ))-c 2< ^. 

2m n <m n+r ^ m n log m n (log m n ) 21 

Lastly, £ e DzA^n) and m n log m n < m n+r ^ N 0 imply 

(3.31) /mn+rW “ /mn+ ‘ (<) > ^ (m, ‘ +r) ~ ~ 2 (log log m n ) 

> <p(m n+r ) - 2(m„ log log »»«)*. 

Hence 

(3.32) Pr(D a , r (mn)) < 7 ,-Pr(D 2 (m n )) 
for m n log Mn < m„ +r ^ A r 0 , where 

7r = Pr(E[t:f mn+r (t ) - fm n+K (t) > ^(mn+r) - 2(m„ log log mn)*]) 

= Pr(.4 m „ +r _ m „ +l> Mw„ + r) - 2(m„ log log mj 1 )) 

(m„ +r - Wln+ft)! 


< C 2 


(3.33) 


<?{m n+r ) - 2(m„ log log m n )* 

2(^(m„ +r ) — 2(m„ log log mj 1 ) 2 


•exp 


P 




} 


< Co 


(l7ln+r) 


v>(m n+ r) ~ 2(m n log log ra„)* 

|^_2( < ?(m m +r) ) 2 8 <p(m n +r)(m n log log m, 


•exp 


m n+ r 




4 


9 It follows from (0.11) that the number of m B ’s in the interval ( x , 2x) does not exceed 
c 5 o log log x. Thus the number of m„’s in the interval (x, x log x) does not exceed 

log log x 

CbO log log x —-— < C i3 (log log x) 2 . 

log 2 



LAW OF ITERATED LOGARITHM 


429 


On the other hand, for sufficiently large n , m n + r > m n log m n implies 

(3.34) <p(m„+ r ) - 2(m„ log log «?„)* > \<p{m n+r ), 

(3.35) <p(m n+r ) (m„ log log win)* < m n+r ■ 


Hence 

^ „ (m n+r ) 4 _f 2(if>(m n+r )) 2 ~] 

(3.36) <f>(m n+ r ) L ™n+r J 

< Cm Pr(Am n+r (ip(m„ +r ))). 

Consequently, 

Pr( 52 A,«■(»»»)) 

m ft log w n <wi n + r <iVo ' • 

(3.37) < Pr(Di(m n )) -cm 52 Pr(i4„. +r ((p(m n+ r))) 

M 0< m n J r r 0 

< Cm-28- Pr(D 2 (nin)). 


Combining (3.26), (3.30) and (3.37), we have finally 
Pr( A.r(mO) 

™»+fc<»*ii+r ^*0 


< Pr(D 2 (m n )) <{ci8- Z <T cl9r + c s4 ^ ,0 ^ )S + c*-2&}. 


(3.38) 

^ Pr-fnJ™ 'I'lJ/.-. <f ci9r + £24'-^- v 

r-fc +1 (log Win)'” 

Hence, if we take h sufficiently large and 8 sufficiently small, then we have 

(3.39) Pr( £ A.r(m„)) < 6 ■ Pr(D 2 (m n )), 

m n+h < C ^*n + r =i ^ 0 

where 0 is a constant with 0 < 0 < 1. Consequently, by (3.20), 


Pr(D 3 (m n )) > (1 — 0) • Pr(D 2 (m n )) 

(3.40) > 2~*(1 - 0) • Pr(Di(m n )) > (1 - 0) -CnPriAm^mn) (by 3.12) 

> CtfPr{A mJ^W)), 

which proves (3.17). The proof of Theorem 3 is completed. 

Corollary 1. <p{ri) = (I/a/2 + e)(n log log w)* belongs to the lower class 
for e ^ 0. 

Corollary 2. The expression (0.6) belongs to the lower class for c ^ 0. 
Proof. Follows immediately from Theorem 3 and (0.7). 


4 

Theorem 4. Let <p(ri)/n* be monotone increasing . Then a necessary and suffi¬ 
cient condition that <p(n) belong to the lower class is that 


(4.1) 


00 


z 


Pr(i4 mn (^(m„))) 


We need the following 


oo . 



430 


PAUL ERDOS 


Lemma 1. Let Mi < N\ < M 2 < N 2 < • • • < M x < Ni < • • • be a sequence 
of 'positive integers tending to infinity , and let <p(ri) be such that 

(4.2) <p(ntn+i) — <p(m n ) > ^-) Mi < m„ m„ + i ^ Ni, 

\log log mj 

(4.3) <p(m n ) > c&(m„ log log m n )\ Mi < m n < m n+1 g AT,-, 

(4.4) £ Pr(4 m „(^(?n„))) > c M . 

Then <p(n) belongs to the lower class. 

We do not give the proof of Lemma 1, since it can be carried out in the same 
way as in Theorem 3. 

Proof of Theorem 4. The necessity of the condition is clear by Theorem 1. 
In order to prove that it is sufficient, let us assume that (p(ri)/ri 1 is monotone 
increasing and that (4.1) is satisfied. We shall prove that there exists a sequence 
of integers Mi < Ni < M 2 < N 2 • • • < M t - < Ni < • • • tending to infinity, 
which satisfies the conditions of Lemma 1. 

If we have 


(4.5) <p(m n ) < ^(m n log log m n )* 

for all sufficiently large n, then the fact that win) = ^(n log log n)* belongs to 
the lower class (see Corollary 1 to Theorem 3), together with Theorem 2, will 
imply that <p{ri) belongs to the lower class. On the other hand, if 

(4.6) <p{m n ) > \{m n log log ra n )* 


for sufficiently large n, then 

(4.7) 1) ^ <p(m n ) > (l + .—-)^W 

\m n / \ log log mj 

by (0.11), and hence 


(4.8) 


<p(m n +i) 


<p(m n ) > c 5 i 


<p(mn) 

log log m n 



Consequently, by Theorem 3, <p{ri) must belong to the lower class again. 

Thus, in order to prove Theorem 4, we have only to consider the case when 
there exist two sequences of integers tending to infinity {Mi) = {m n< } 
(i = 1, 2, • • • ) and {AT*} = {ra n ; 1 (i = 1, 2, • • • ) such that Mi < N\ < M 2 < 
JW < • • • < Mi < Ni < • • • , and 

(4.9) <p(Mi) = <p(m ni ) ^ T*(Mi log log M x )\ 

(4.10) <p(Ni) = <p(m n( ) g &(Ni log log Ni)*. 


We may assume that 

(4.11) <pim n ) < T V(m n log log m n )* 

for Mi < m n g Ni (i.e. for ra< < n ^ n<) (i = 1, 2, • •• ). 



LAW OF ITERATED LOGARITHM 


431 


We shall prove that the conditions of Lemma 1 are all satisfied by these {M*} 
(i = 1, 2, • • • ) and {JV<} (i = 1, 2, • • • )• Since <p(Mi)/{M^ g v {Ni)/N\ by 
assumption, we have ^(log log Af,) 1 g ^(log log N$ for i = 1, 2 , • • • . 
Since Af<, Ni —> «> as i —> oo, it follows that we have M\ jV< for sufficiently 
large 

Let now Mi < m n < Ni. Then 

Pr(A m M” In))) > Cl ^ e -*(«>(m*))*/m n 

*>(wi n ) ' 


(4.12) 


> Ci 


10 

(log log m „) 4 


—(log log m n )/50 


_10 

(log log Wl n ) 1 (log Wl„) l/5 ° 


1 


(log Win) 1/49 


for sufficiently large i. Since 2 -log Mi-log log M t < n < 3 -log M t log log M< 
implies log M, < n /log n < 4 log M,, or equivalently M, < e n/logn < Mj, 
for sufficiently large i, we have 


(4.13) 


E Pr(A mn (<p(m„))) > 

M i < ^ IV » 


E 


_ 1 _ 

(log m „) 1/49 


> 


E 


Afi<m n 5AfJ 


1 _ 

(log ra „) I/49 


> 


E 


2p»<M^3pi 


1 

n 1/49 ’ 


where = log iV^-log log Ni . Thus (4.4) is satisfied. (4.3) is clearly satisfied 
with C 29 = tjV; (4.8) shows that (4.2) is also satisfied. This completes the proof 
of Theorem 4. 


6 

Theorem 5. Let <p(n) satisfy 

( 5 . 1 ) <p(n) > c&(n log log n)\ 

(5.2) it Pr(Am n (<p(m n ))) = » . 

Hma 1 

Then tp(n) belongs to the lower class . 

To prove Theorem 5 we need the following 

Lemma 2. Let <p(n) be monotone increasing , and let { m ni } (i = 1, 2, • • • ) be a 
subsequence of [m n ) (n = 1, 2, • • • ) such that 

(5.3) <p (»!„< + ,) ^ ¥>(wi B< ) + C 33 ((WI„ < + 1 log log Wl„, + 1 ) ! - (wi ni log log Wl„ { )*) 

(5.4) E Pr(A mn X<t>(m ni ))) = « . 

<-l * 

T/ien #>(n) belongs to the lower class. 

Since the proof of Lemma 2 can be carried out exactly as in the proof of 
Theorem 3, we omit the proof. 



432 


PAUL ERDOS 


Proof of Theorem 5. As in the proof of Theorem 3, we may assume that 

(5.5) ip(n) g (n log log n)* 

for sufficiently large n. We shall find a subsequence {ra n< } (i = 1, 2, • • • ) of 
{ m n \ (n = 1, 2, • • • ) which satisfies the conditions of Lemma 2. For this 
purpose we classify the integers m n into two classes. The first class I consists 
of all integers m p for which 

(5.6) <p(m Q ) ^ <p(m p ) + e((m q log log m q )* - (m p log log m p ) k ) 

for all q g: p, where € is a positive constant with 0 < e < C& which we shall 
determine later. All other integers m p will belong to the second class II. We 
shall prove that, if we denote by {m ni j (i = 1 , 2, • • • , m ni < m n<+1 ) the integers 
of the class I, then this sequence satisfies the conditions of Lemma 2. Indeed, 
(5.3) is clear from (5.6). In order to prove (5.4) for the m n /s of the class I, 
let us denote by II, the set of all integers m p of the class II such that m p < m ni and 

(5.7) <p(m ni ) < + «((wi„, log log win,)* - (m p log log wi p ) 4 ). 

By definition, for each m p of the class II, there exists an m q (m q > m p ) suclu 

that 

(5.8) <p{rn v ) < <p(m p ) + t((wi„ log log wi„)* - (wi p log log wi p )*). 

Because of (5.1) and the relation e < c 32 , there exists, for each m p of II, a 

largest integer m q (m q > m p ) satisfying (5.8). This m q clearly belongs to I. 
Hence we have II* = II (II* are not necessarily mutually disjoint). 

Thus in order to prove (5.4), we need only prove that there exists a constant 
c 3 4 > 0 such that 

(5.9) £ Pr(A mp (<f>(m p ))) < c u Pr(A m 

m p c n* 

For this purpose we shall first show that 

(5.10) m n . < c^m p 

for all m p t Hi , where c 36 is independent of i and p. Indeed, if (5.10) is false, 
we have 

<p(m ni ) < <p(m p ) + «((wi„,. log log win,.)* - (m p log log wip)*) 

(5.11) < (wi p log log wi p )* + «(wi n< log log Win,) 1 
1 

y/Czb 

and this is a contradiction to (5.1) if C 35 is sufficiently large. 



j (m n . log log win,) 1 , 



LAW OF ITERATED LOGARITHM 


433 


By (5.5), (5.7) and (5.10), if m p = m ni -k € lit, then we have 


Pr(A mp (<p(m p ))) < c 2 


C2 


(log log ra p )* 

r 2{<p(m ni ) - €((m n< log log m n <)* - (m p log log m p ) 5 )) 2 " 


•exp 


mm 


(5.12) 


Cl 


(log log m p )t 

2 (<p(m ni )) 2 - 4e^(mn i )((m n< log log m ni )* ~ log log m^) 1 


•exp 




m n< - fc-c 6l —P- 

log log m p 


< 


C 36 


(log log wi„)* 


,. exp 


< cvPrUnMm^-r? 
M ni J * 


where 


j] = exp 


~( y(m n< )) a 

m ni 


hi 


Mwi.,)) 2 - 2&p(m„,)((wn,- log log w,,.) 1 - (m, log log m p ) 1 ) 

Win.- 


win, - teas 


lOg log Mnj 


< exp 


[ (■p(win< 
Win; 


)) 2 


(5.13) 


_ (»( *” »<))* ~ 2ty(win,.)((w?n ,. lOg lOg Win-)* - (/W p lpg lOgWp) 1 ) 

Win, 


•( 


1 + Jfc 


C 39 


log log ra» 


;)] 


exp ((wi B( log log win,) 1 - (wj„ log log r« p ) J ) 


< Jn,_-V _ k __s 

L Win,. Mog log Win,/ Win,. 

< exp [(2e - C 7 c 32 — c«'Cm)*A:]. 


tew(y>(wi B .)) 2 
Wl n . log lOg Wl ni 

c«>(^(wi„,)) 2 J 


log log Win,. 




434 


PAUL ERD5S 


Hence, if we take e sufficiently small, then 
(5.14) v < e~ etik 

with a positive constant c 4 2 . Hence 


X) Pr(A mp (<p(m p ))) < C 37 PrU m „.(*-(«»,)))-Ee 2c, * k 


(5.15) 


C37 

1 - e~ 5<us 


fc -1 

'zxr, Pr(A m (<p(m ni ))) 


which proves (5.9). This completes the proof of Theorem 5. 

Before concluding this chapter, let us add some more results without proof. 

1) . If <p(n) is monotone increasing and belongs to the lower class , then 
<p(n) + c (n/ log log n)* belongs to the lower class for all c. 

This result is the best possible. For, if yp(n) —» 00 , then we can find 
a monotone function <p(ri) belonging to the lower class such that <p(ri) + 
>f/(ri)(nl log log n)* belongs to the upper class. 

2) . If <p(n) is monotone increasing and belongs to the lower class , then 
<p(n) + c(n/(p(n)) belongs to the lower class for all c. Since we can always assume 
that <p(n) < (n log log n)\ 2) is slightly stronger than 1). 

3) . Let <p(n) be monotone increasing , and suppose that it belongs to the upper 
class. Then for almost all t , there exist only finitely many n such that for some 
m < n, | f n (t) - f m (t) | > (p{n). 

4) . For almost all t , we have 


(5.16) 


lim sup 


nm 

k~ 1 _ 

1 /log log »Y 

2 n (-2—; 


= 1 . 


Professor J. L. Doob suggested that if n\ < n 2 < • • • is a sequence of integers 
with ni+i/ni > c > 1 , then for almost all t, 


(5.17) 


! P Z/*«) 

lim i Z ‘“- V- 

p-+x> p i-1 n\ 


= 0 . 


Indeed, it is not difficult to show that (5.17) holds. In fact, the condition 
> c > 1 can be weakened, but it is necessary that n,- tends to infinity 
with a certain speed (quicker than i). 

5). There exists a continuous strictly decreasing function defined for 0 ^ 
x £ 1, with ^( 0 ) = 1 ,^( 1 ) = 0 , such that , for almost all t> the upper density of 
the set of n’s for which 

(5.18) /.(,) £ x (»j2^28_n 

is exactly yp(x). 10 


10 This problem was suggested by W. Ambrose. 



LAW OF ITERATED LOGARITHM 


435 


6 

In this final chapter, we shall construct an increasing function <p(ri) such that 

(6.1) E Pr(A mn (<p(m n ))) = « , 

rt—1 

and nevertheless <p(ri) belongs to the upper class. This shows that the converse 
of Theorem 1 is not true . 

We put 

(6.2) Pk = 2 2S \ fc = 1,2,..., 

and define <p(n) as follows: 

(6.3) <p(n) = log k-y/p k , for p*~i < n g p k . 

It follows from (0.11) that the number of m n y s satisfying \p k < m n ^ p k 
is ^^43 log log p k and hence ^c 4 42 *. Consequently, from (0.7) we have 

(6.4) E Pr(Am n (<p(m n ))) > ^ e~ Wo * k)l -c«2 k £ c« > 0. 

\Pk<™n^Pk lOg AC 

Since this is true for each k , (6.1) is proved. 

Denote now 

(6.5) M k = E[i: max /„(/) > log k-\/p k ]. 

l^n^Pk 

In order to show that p(n ) belongs to the upper class, it is clearly sufficient 
to prove that 

(6.6) E Pr{M k ) < oo. 

1 

It is easy to see that 11 

(6.7) Pr(M k ) ^ 2Pr(E[t: f Pk (t) > log k\/p k \- 
11 In general, we have 

Pr(E[t: max f n (t) > x ]) ^ 2 Pr{E[t: f p (t) > x]). 

1 

Indeed, we have 

Pr(M*: max /„«) > xl) - Pr(E[t: f p (t) > x]) 

P-1 

+ E P r (E[t: fitt) £ X, •••,/n-i(0 g X, /„«) > X, f p (t) g l]) 

n-1 

- Pr(£[C/ p (i) > *1) 

P—1 

+ E **(*[<: /,«) £ x, ••• ,/n-i(0 Ss *,/»«) > z,/p«) a 2/„«) - x]) 

n—1 

£ Pr(JSU:/p«) > *1) 

p —i 

+ E Pr{E[f.Mt ) ;£ x, ••• ,/n-i(0 5 z,/„(0 > x,/ p (t) > *]) 

n—1 

- 2 Pr(£[t:/„«) > xl). 



436 


PAUL ERD6s 


Thus from (0.7) we have 

(6.8) E Pr(M k ) ^ 2-c 2 -E ---e"’" 01 * 1 ' < 

Jt-1 A--1 log AC 

which proves (6.6). 

My indebtedness to my friend S. Kakutani is very great. In fact, he wrote 
the whole paper after listening to my rough oral exposition. 

University of Pennsylvania 
Philadelphia, Pa. 



Annals of Mathbmatics 
Vol. 43, No. 3, July, 1942 


ON AN ELEMENTARY PROOF OF SOME ASYMPTOTIC FORMULAS 
IN THE THEORY OF PARTITIONS 


By P. ErdOs 

(Received January 7, 1942) 


Denote by p(n) the number of partitions of n. Hardy and Ramanujan 1 
proved in their classical paper that 


p(n) 


1 cnl 

4n3* e ’ 


c = 


using complex function theory. The main purpose of the present paper is to 
give an elementary proof of this formula. But we can only prove with our 
elementary method that 


(1) p(n)~-e c,,i 

n 

and are unable to prove that a = 1/4.3*. 

Our method will be very similar to that used in a previous paper. 2 The 
starting point will be the following identity: 

(2) np(n) = S S v P( n — Z>(0) = p( — m) = 0. 

v-1 Jfc«l 

(We easily obtain (2) by adding up all the p(n) partitions of n, and noting 
that v occurs in p(n — v) partitions.) (2) is of course well known. In fact, 
Hardy and Ramanujan state in their paper 3 that by using (2) they have obtained 
an elementary proof of 

(3) log p(n) ~ cn\ 

The proof of (3) is indeed easy. First we show that 

(4) p(n) < e en \ 


We use induction. (4) clearly holds for n = 1. By (2) and the induction 
hypothesis we have 


- kcl2n\ 


»p(») <EE t** 0 -*”' <LE E 7j ~ k7fin\\ 2 • 

v -i i v-i A-i a- i (1 — e ) 

kv < n 


1 Hardy, Ramanujan, Asymptotic formulae in combinatory analysis, Proc. London Math. 
Soc. 17, (1918), pp. 75-115. 

2 Erdos, On some asymptotic formulas in the theory of factor isatio numerorum , these Annals 
42, (1941), pp. 989-993. 

5 Hardy, Ramanujan, ibid, p. 79. 


437 



438 


P. ERDi)S 


Now it is easy to see that for all real x 

np{n) < e cnl 


’ (1 - e-*) 2 


2 - 


t it - 

k ~\ c 2 k 2 


Thus 


which proves (4). 

Similarly but with slightly longer calculations, we can prove that for every 
« > 0 there exists an A > 0 such that 


(5) 


p(n) > je 


(c—e)ni 


(4) and (5) clearly imply (3). 

To prove (1) we need the following 
Lemma 1: 


( 6 ) 


E = ZE 

v-1 *-1 
kv<n 


ve 


c(n-kv)i r 

1 + 

— kv 




for some fixed c > 0. 

Proof. We omit as many details as possible, since the proof is quite straight 
forward and uninteresting. We evidently have by expanding l/(n — kv) and 
omitting the terms with kv > n* +< 


„ c(Tl—fct))* 1 00 00 1 00 00 

^ ^ = i £ £ \ 

A:—1 W kv Tl k** 1 W wl /c—1 - 

kv<n kv<n kv<n 


EE 


Now 



+ £2 + 0 



cnl oo a 

E = UEfe ! e 

n l r-l ib-l 



(It is easy to see that the other terms of e c(n kv)i can be neglected and that 
the summation for v and k can be extended to ».) Thus 


cn\ 


E*= -t£ 


2k 


n 


, 2 jfcZi (1 - e“* c/2ni ) 3 


+ 0 (£H"S 


2fc-8n* 
t=l k 3 c* 


+ 0 





ELEMENTARY PROOF OF ASYMPTOTIC FORMULAS 


439 


On the other hand 

cn\ 


cn i oo oo . /^ cn *\ 

El - — E E ve- ckvlin \- ckivtlin ' + o(^r-) 

n tZi jfci \n* + 7 

— e "' /V V ,„-**'/*!•* eft*!! 1 -rte/tnA V~. ’P’' , n / 


crfci oo oo cni oo 

// C6 , 2 nx Z e -ckvl2n* _ Ce V 


Z" = —, Z E 

8n V-1 


6Af 


8n* *-i (1 — e —cfc/2 

* 6fc 2 -16" 2 

8n f A; 4 c 4 


- + o(^ 

n*) 4 ^ V« ,+ 7 


ce eB ’ A 6fcM6n z n /V"‘\ 3 e”‘ _/V^\ 

2- ,-1-4 +u ^ n r+.y cn* +U Vn H 7‘ 


cn^ oo oo cni oo -ck/2nl 

f e .. -Ckv!2n\ __ e e 


El--EE* 

W v-1 fc-1 


71 k^l (1 — e ~<*/2n*) 2 ' 

A simple calculation shows that 

e“* 1 . ^ • c-' kl2ni 


“ - s + °( 1 )> i- e - 


(1 - e _x ) 2 x : 


(l — g-ct/2" 1 ) 2 C 2 fc 2 


4?. + 0(1). 


Hence 


But 


And 


cni u a —ck/inl /„ cn *\ 

■ V £ A+£ «r-^)= + °(»4 “ -■* 

4n _ 4n 7r 2 _ 4?i yv 1 _ __ , ^/n\ 

k-ic 2 k 2 c 2 6 c 2 k>uk 2 U c l u \u 2 / 


—cfc/2ni 

V _£_ 

k>u (1 _ e -c*/2»») 


*' Z "’ _ f“ e ~cxl2nt ^ 

r'kl*ni)t J u (1 J e - cxl 2niy dX + U \ U *J 


=__ _^ + 0 ^ = — -- + 0 f- lN ). 

c(l — g-cu/2'* 1 ) c \u!y c 2 u c \n 1+< / 


Thus finally 


.i £*__ 

cn * 


E; = «*"' - + 


°(£ 


Hence 


L-E;-z!'+r.- e - , [i + o( ti ? s )' 


which proves the lemma. 




440 


' P. ERDOS 


Next we show that 

(7) 

To prove (7) write 

( 8 ) 

Clearly by (8) and (G) and (2) 


0 < lim inf lim sup < oo . 

gen* gen* 


(B) mp(m ) 

cl = max —— - 

n gcmi 


c(n+l—kr)l 

(n + l)p(n+l) gcrEZ~ r ~-,-, 

i— 1 Jfc- 1 n + l — LV 
kv < « 


< c[ n) e c(n+l) 




Write 


(ft + i)p(ft + j ) 


e c 


(n+j')i 


C ‘" > ( ] + n‘+«) ’ J 1)2 ’ 


Then 

(n + r + l)p(n + r + 1) < ci* 0 £ 21 


ye 


(n+r-fl— kv)i 


v-1 1-1 n + r + 1 — A;t; 

kvg, n+r 


max bj _ c(n-fr+l—^ 

+c, <n> «&-- zi-r 




n*+* £1 n + r + 1 - 

kv^r 

l h max bj 2 c(ntr+l)i\ 

1 + A + r „£_ 

V n*+ e n*+ e n / 9 


since 


Hence 


]C v ^ r 2 . 

kv£r 


r max bj 

br + 1 < &i + —*-&— . 

ft 

t 

We show that, for r s ^ n/2, h r+ i ^ 2bi. We use induction. We have 


br+l < 6l + 


r 2 -26i 


n 


g 2&i. 


4 6i is chosen such that for every m > 0 


IE 

v k 
m—kv > 0 


e(m—kv)i 

Ve - r(m\) 


m — kv 


< e c 




kv 



ELEMENTARY PROOF OF ASYMPTOTIC FORMULAS 


441 


Thus 


•>+(}»)»] 


< 



2b A 
n !+ '/ ‘ 


Or 


((m+l)2) 

Ol 


< 


Am*) 

Cl 



io&A 
**+•/ ’ 


and since m in+e converges vve see that lim sup Ci n) < ; i.e. lim sup np(n )< 

qc . Similarly we can show that lim inf np(n)/e enk > 0, which completes the proof 
of (7). 

Next we prove that 


( 9 ) 


lim inf 


np(n) 
e cn * 


= lim sup 


np(n) 

e cn * 


and this will complete the proof of (1). 
Suppose that (9) does not hold; write 


( 10 ) 


l ira i„f ??_<».) 
e cn * 


d, 


lim sup 


np(n) _ n 
e cn 


Now choose n large and such that 

> D - 

e CH 

Then since p(n) is an increasing function of n there exists a c 2 such that for 
every m in the range n g m g n + c 2 n 

mp(m) d + D 

gem* 2 


Now we claim that for every n there exists a <5 r , = <5(ri) such that, for n ^ m g 
n + rpi\ 


(ID 


mp{m) 

e cmi 


> d + 5 ri . 


We prove (11) as follows: We evidently have by our lemma 


c(m—fciO* 

mp(m) ^ d X — ~t~ 
7-itZim — kv 

kv <m 



EE 

t— 1 *-l 

« <km—kv$: n-f-co ^ 


^ ( c(m—kv)l 

m — kv 


o(e cmi ) 


5 


6 The term o(e c,ni ) is present because d is the lower limit and not the lower bound of 
mp{m) 




442 


P. ERDOS 


> de cmi + D -~^ -* £ t’ - o(e cm ') > de' mi + c,p c "‘ - o(e mi ) 

2 771 n ^ m—v ^ n+C 2 n i 


> (d + 5 ri )e cmi , ^i.e. —- f > ca^ 


which proves (11). 

Suppose 2n ^ m ^ n + sn\ s sufficiently large; we show that 


E2> 


( 12 ) 

Clearly 

c(m— hr) l c(m—k r)i 

ZZ ve —sZZ-- , 

v A: 77% kv v k 77% KV 

0<m—A-v<n kv>sni 


tv . e 

m-V,f<» m — A'y s 10 


2vf -ckvlU,i 


< e' mi z Z 

» * rn 

\m> kr> s n 4 


c(nt-kv)l 

+ Z Z --r 

v a m — kv 

vi>kv^[m 


x O, „r-divl2ml 

< e «‘ £ Z - + mV 11 ’", 

t; A- /ft 

Jm > Au > a n 4 


since 


Further 


EE.i a- 2 . 

v k 
k v < X 


zz 

vk 

\m>kv>$ n» 


-ckvl2m 4 


< Z Z Z pc' 

«»1 v A’ 

( w +1) « » 4 ^ Av > u a n 4 


—cw«n4/2m4 


< Z Z Z t'e- f “’ /4 <t(« + l) s ~ 2 ”"- c “‘ /4 


xS ft? 


«-l v-1 A=i 
kv £ (ti +1) s « 4 


Thus 


2 Z wT c *" /2ml < ms 2 £ (« + l) 2 e- f “* /4 < 


vk 


/ft 

4$ lfl 


for sufficiently large $. H^nce finally 


Ar)l cm 4 cm§ 

LZ—v< e rB + »V«-"< , - B 

v-i A-i m — kv 2s 10 s 10 

m—kv < n 

h 


for sufficiently large m and s (since $ < ft 1 )- 



ELEMENTARY PROOF OF ASYMPTOTIC FORMULAS 


443 


Consider now the intervals n + tn\ n + (t + l)n*, t > r h t + 1 < nK Split 
it into t 2 equal parts. Write 


. mp(m) 
min 


= d + 5“, n ^ m ^ n + (^t + ^ 


n 


and put 1 = Now let n + (t + u/t 2 )^ ^m^n + (t + (u + l)/t 2 )n h ; 
then we have 

c(m—fry) 1 r(m—fry)i 

*p(m) > d E E '- e - -r + E' E' - . - - o{<r\ 

v ~\ fr-i m — kv v * ^ m — 

fry < m 

where the primes indicate that the summation is extended only over those v 
and k for which n g m — kv g n + (t + u/t 2 )nK Further by Lemma 1 

_ r(m—frr)l 

i»p(»») ^ (d + s { r l, y mi - 8 ( r l) E" -- , 

m — kb 

c(m—kv ) 1 

- 5( Cu ~ I) E'" - o(e cmi ), 

m — kv 

where in the summation is extended only over those v and k for which 
m — kv ^ n, and in J*/" the summation is extended only over those v and k 
for which m — kv ^ n + (t + u/t 2 )nK We have by (11) 

cm! 

e 


E" < 


t"> 


Further we have 


V"' < ” 2 f n ' < &L 

^ t 4 m t 4 


Hence finally 


Hence 


mp(m) > c m ' (d + ST" - - o(e cmi ). 


g U) > ^ - o(l). 

Thus if t is fixed, independent of n, we have 

8 t+ 1 > 5,(l - «)* - o(l), 


therefore 



444 


p. erd5s 


But JJm (1 — 3/w 4 ) m2 converges; thus, if n was sufficiently large, we have 8 t > 
8 ri /2. Now choose r 2 sufficiently large; then we have 8 rz > 8 r J2 , i.e. for n ^ m ^ 
n + r 2 n*, 

gem* 2 

Consider the interval n + tn\ n + (t + 1 )n\ t > r 2 . Split it into t 2 equal parts. 
8t u) and 8 t have the same meaning as before. Suppose n + (t + u/t 2 )n * ^ m £ 
n + (t + (u + 1 )/l 2 ) / n}; then evidently 

«p(*») > w + «< <M_I> ) Z' Z' - . » 

v * m — lev 


where the primes indicate that the summation is extended only over those v 
and k for which n^m — kv^n + n'(t + u/t 2 ). 

Now 


Z'Z 


where and TV" 
of we have 


ve 

m 


c(m—. c(m—Arr) ^ 

= Z Z ! - e -. t - Z" - Z"', 

— kv r-i m — kv 

kv<m 

ire defined as before. By (12) and the previous estimate' 



cm J 

Y" < e 
z -' t w ’ 

E"' 

2e cm> 
t* ‘ 


Hence by Lemma 1 





mp(m ) > e m \d + | 

H) 

_ h(d + a, ( “- 

‘V”* 

i.e. 

- 





d + sl u) > (d + a,'”- 1 ’) 

o-s 

\ _ 6 t (d + 

/ n H« 

-») 

> 

and 






d + 5<+i > (d + 5 ( ) ^1 

i 

rfcl 03, 

6i £ 2 (d -|- 5/ u 
n i+« 

-») 

or 






d + 5. > (d + 

n(i 

t>r t \ 

- 3V l _ M 3 

£ 4 / w*+' 

• 

For 

sufficiently large r 2 we have, 





(“ + t) ( j 


> d + ^, 


and 

if s g (log n) 2 and n is sufficiently large, 




s. 

.8 ’ 





ELEMENTARY PROOF OF ASYMPTOTIC FORMULAS 


445 


that is, forn ^ m ^ n + n } (log n'f 

mpW > d 6ri 

(> cm ^ s 

Now suppose m > n + n*(log n) 2 ; vve shall show that 

c(m—A:r)i cm\ 

v = V V V 1 _ < ?_ 

v k Wl /cy ??2 

0 < m—kv < n 


We have 


2 < mV’* 1 < m 2 e cm *~ 10e Io,m < 6 


for sufficiently large n. Hence by Lemma 1, 


c{m—kv) 1 

E E -- rr >e 

v k m—kv 

m—kv ^ n 


•»(l_ 

V n i+ y • 


Now we continue as in the proof of (7). Suppose t > n + n*(log n) 2 ; write 
d + 8 t = min 7n ^ rt p ? n ^ m ^ t. 

e cmi 


(t + 1 )p(t + 1) ^ (d + W L Z r , > (rf + 

V * C — Acy 

/ —Art* ;> n 




Write 


(t + r)p(t + &r \ 

-^>Y - - W + W V 1 - (T+,; 


Then as in the proof of (7) we have 


_c(<+/+l-fcv)l 


(t + j 4* 1 )p(t + j + 1) > (d + 5|) 5Z ;-t- 

V k t ~r J “T 1 — KV 

t+j+l—kv n 


> « + - r r+f+I).) 

= (d + 8,)e cit+i+l)i (l - , 


max b r .2 

- (d + 5,) r -^ +r 3 - e' u+i+l)i 
max br .2 

(j -1- x\ r £i 3 _c(#+>+!)i 


w + 5 ' ) ^-i e ' 


.c(M-/+l)* | 


6 As in footnote 4 b[ is chosen such that for every m > n + nKlog n ) 2 

c{m-hv)l / 5' \ 

r^)' 


t> A; 
n < m —kv 



446 


P. ERDOS 


where 


6 ; + 1 < b'l + max b' r -i- . 

r£j t 

We show that for / < t/2 we have, b ]+1 < 2b[ . We use induction; we have 

&y+i < b[ + ^ = 26i. 


Thus 


That is, 


Therefore 


d + > (d + 5> 


■>(•-£) 


d + 5(a 4 i)2 > (d + 5«a) (1 


(-$) 


d + 5„ s > (d + ^njl - > d + 


which contradicts (10); and this completes the proof of (1). 

As can be seen, the main idea of our proof is rather simple; unfortunately the 
details are long and cumbersome. By the same method we can prove the 
following result: Let m be a fixed integer. Denote by Pa?^,-• ,a r ( n ) the number 
of partitions of n into integers congruent to one of the numbers a\ , <t 2 , ■ * • a r 
(mod m). Then 


(13) 


Pa?,a 2 ,"-,a r (n) 


a 

n a 


e cn \ 


((fli > > • • • > a?) — 1) 


where C depends on m and r, and a and a depend on ra, a\ , a<i , • • • a r . 

The same method will work if we consider partitions of n into rth powers. 
Denote the number of partitions of n into rth powers by p,(n), Hardy, 'Ramanu¬ 
jan and Wright 7 proved that 


(14) 




Cl n l/(r+1) ^e Cinl/(r+1) 


Clearly as in the case of p(n) we have 


np r (n) = EE v r p r (n — kv r ). 

v k 
vk<n 


7 Hardy, Ramanujan, ibid. p. 111. Maitland Wright, Acta Math. 63, (1934), pp. 143-191. 
Wright proves a very much sharper result than (13). 



ELEMENTARY PROOF OF ASYMPTOTIC FORMULAS 


447 


To prove (14) we should only have to prove the analogue of our lemma, namely 


( 15 ) 


E E (» - r r *) l ' lr+I) - , e cl( "-" , * >1/(r+,) 

v k 
v * A. < n 


_ l/(r-H)— \ c> 2 n l l O + l 

— 71 C 



_1 

i-(l/(r+I»+* 



If (15) is proved the proof of (14) proceeds as in the case of p(n). 

I have not worked out a proof of (15); it seems likely that a proof would be 
longer than that of Lemma 1, but would not present any particular difficulties. 

Recently Ingham 8 proved a Tauberian theorem from which (1) and (14) 
follow as corollaries. In fact his Theorem 2 gives a more general result, from 
which (13) also follows as a very special case. 

Denote by P r (n) the number of partitions of n into powers of r. (Nearly 


nP r (n) = XI Z r Pr(n — r v k). 

v k 
r v k < n 


It might be possible to get an asymptotic formula for P r (n) by our method. 
I have not succeeded so far. But we can show without difficulty that 

(10) log /',<») ~ ( i°. g ■ 

2 log r 

We have 

fix) = E p,(n)x n = II , 1 —. 

n =0 r=* l 1 


It is easy to see that for 0 ^ x ^ 1, 


(17) 


<■[ 



(1/(2 logo)) 1 on 1/(1— jt) 


< fix) 



(1, (2 log a) i log l/( l— «r) 


Thus 

Pr(n) - ~y < /^1 - 0 < 

that is 

J\in) < Ct n il0 ' mm *' 0 * m \ log Prin) < (1 - «) for n > m. 

2 log a 

Suppose now that for a certain large n log (P r (/fc)) < (1 — e)(log n) 2 /2 log a; 
then, since for m < n P r (?n) ^ P r (ft) we have 

(18) fix) < n) * /<2 E x k + E c, k iUtk)M lwa) x k , 

A-0 k>n 


8 A. K. Ingham, A Tauberian Theorem for Partitions, those Annals, 42 (1941), p. 1083. 




448 


P. ERDOS 


and a simple calculation shows that (18) contradicts (17). (Choose x = (1 — 6)n , 
8 = 6(c)). The same method would of course give 

log (p(w)) ~ 7T 

We can also prove the following results: 

I. Let ai < a 2 < • • • be an infinite sequence of integers of density a, such 
that the a’s have no common factor. Denote by p'(n) the number of partitions 
of n into the a’s. Then 

(19) log (p'(n)) ~ c(an) i . (c = x(f) J 

II. Let ai < a 2 < • • • be an infinite sequence of integers of density «, such 
that every sufficiently large m can be expressed as the sum of different a’s. 
Then denote by P'(n) the number of partitions of n into different a’s. Then 

(20) log P'(n) ~ c ^ nj . 

We shall sketch the proof of II; the proof of I is similar but simpler. Denote 
by Pin) the number of partitions of n into different summands: it is well known 
that 9 



( 21 ) 

First we show that 

( 22 ) 


log P(?i) ~ c 



limsup 1 ” 8 , ™ i 1. 


■G-y 


To the partition n = a tl + a,* 2 + • • • + a ir we make correspond the partition 
ii + H + • • • + it . For i > i Q we have i < a<(a + c) therefore ii + i 2 + 
* • • + 4 < n(a + c) + 4 . Thus each partition of n into the a’s is mapped 
into a partition of integers ^ n(a + 2e) with all integers as summands; hence 
from (20) we obtain (22). Next we prove that 


(23) 


limmt'He™ 

‘(I“) 


^ 1 . 


Split the sequence a» into two disjoint sequences bi , b 2 , • • • and ci , C 2 , 
where the b ’s have density 0 and every sufficiently large integer is the sum of 
different b’s and the c’s are the remaining a’s. It is easy to see that we can 
find the b } s with the required property; also the density of the c’s is clearly a. 
Denote by Q(n) the number of partitions of n into the c’s. Now associate 


9 A well known result of Euler states that the number of partitions of n into odd integers 
equals the number of partitions of n into different summands. Thus (20) follows from i. 



ELEMENTARY PROOF OF ASYMPTOTIC FORMULAS 


449 


with the partition n = i\ + i 2 + • • • + 4 , i\ < ii < • • • < 4 the partition 
c*! + Ci 2 + • • • + c,** ; as before, we have 


n . . . , . n 

^ c *i Ct 2 + • • • + Ci k < - 

a + € a — e 

Hence for at least one n/(a + e) < m < n/(a — e), Q(m) > p(n)(a — €)/n. 
Thus there exists a sequence of integers X\ < Xi < • • • with lim x l+i /xi = 1 and 

24) lim inf lo f 0< f, - 1. 

‘ (! Xl ) 

Now suppose Xj g m < xj+i . Choose x { such that era > ra — Xi > C. Then 
ra — Xi is a sum of different b’ s, hence P(?n) ^ Q(Xi). Thus (23) follows from 
(24), and this completes the proof of II. 

If might be worth while to mention the following problem: Let ai < a 2 < 
be an infinite sequence of integers, such that log P(n) ~ c(ctn) h . Does it then 
follow that the density of the a’s is a. I cannot decide this problem. Perhaps 
the following result might be of some interest in this connection: Let a\ < a 2 • • * 
be an infinite sequence of integers. f(n ) denotes the number of a’s ^ n, and 
(p(n) denotes the number of solutions of a { + ^ n. It can be shown trivially 

that if \\mf{ri)/n a = Ci then lim (p{ri)/n 2a — c 2 . But the converse is also true, 
and can be simply proved by using a Tauberian theorem of Hardy and Little- 
wood. 10 We have 

(/(z)) 2 = (£ z ) 2 = t 

and, since ^ ^ = y(n) ~ c 2 n a , we evidently have 


/(z) 

L — l 


C& 


(1 - *)“ 


and hence by the theorem of Hardy and Littlewood, 

fin) = 1 ~ c x n a . 

n 


By the same methods that were used in proving II, we can prove the following 
result: Denote by R(ri) the number of partitions of n into integers relatively 
prime to w. We have 

log R(n) ~ c((p(ri))K 


Similarly, if we denote by R'(n) the number of partitions of n into different 
integers relatively prime to n, we have 


log R'(n) ~ c (^Y^) ■ 


10 Hardy-Littlewood, Tauberian Theorems , Proc. London Math. Soc. 13, (1914), pp. 
174-191. 




450 


P. ERDOS 


I have not succeeded in finding asymptotic formulas for R(n) and R'(ri). 
This problem seems rather difficult. 

March 12, 1942. 

In the meantime I have proved the above conjecture. Consider 
fix) = i P(n)x n = ft rj-i-r . 

n-1 k-l 1 ~ X a k 

If we assume that log P{n) ~ a(n)*, we obtain by a simple calculation 

log/(x)~ ~- 1 “ - . 

x-1 o 1 — X 

But 

log/Or) - Z + J Z * 2a< + ■ • • '= Z l> k x\ 

i *»1 

Denote by A(n) the number of a 1 s not exceeding n. We have 


Thus 


Bin) = ±b, = ± ! A (fj . 

Jfc-1 *-l K \K/ 




But by the well known Tauberian theorem of Hardy-Little wood, 13 we Imvt! 


Bin) 


aw 2 n 


Hence 


A { \ V' air2 71 i 

A(n) ~ 2s -tv 77 — ~ cm. q.e.d. 

k=. 1 o 


Similarly we can show that if log P'(n) — c[(a/2)7i]*, the density of the u’s is a. 
University of Pennsylvania 


11 Hardy-Littlewood, ibid. 



Annals op Mathematics 
Vol. 43, No. 3, July, 1942 


ON A PROBLEM OF I. SCHUR 

By P. Erdos and G. Szkgo 
(Received June 11, 1941) 

To THE MEMORY OF I. SCHUR 

1. Introduction 

1. Let n ^ 3, and let Q n denote the class of polynomials f(x) of degree n 
satisfying the condition |/(t) | ^ 1 in the interval — 1 ^ x ^ +1. Let Q n (x 0 ) 
denote the subclass of Q n characterized by the further restriction /"(t 0 ) = 0. 

A well-known theorem of A. Markoff 1 states that | f'(x) | ^ n for — l g 
x ^ +1 provided that f(x) e Q n ; here | f'(x) | = a holds if and only if x = ±1 
and f(x) = ±T n (x ), where T n (x) denotes the n th Tchebycheff polynomial. We 
observe that T n (x) does not belong to the classes Q„(zbl). 

Some years ago I. Schur“ proved the following interesting theorem: Let 
— 1 ^ .To ^ +1, and let /( x) belong to Q n (x o). Then \f'(xo) j < \n. Moreover 
he showed: Let m n he the least 'positive constant (depending only on n) such that 
| /'(to) | ^ m„-n 2 for all f(x) tQJxo), and t 0 in — 1 ^ x ^ +1. If y = 
lim sup,, ->ao ni n , then 

(1.1) 0.217 ••• 0.465 

Obviously 

(1.2) ntn-n 2 = max max |/'(t 0 )'. 

— Isxo^+l fix) *Qni*o) 

The main purpose of the present note is to determine the constant y and the 
polynomial fix) for which the extremum (1.2) is attained. In terms of the con¬ 
stant m n , we obtain a bound for the derivative/'(.r) of a polynomial /( x) which 
satisfies the condition that |/'(t) | has a relative maximum at the point x 
considered. 

2. Lei u„(x) be the polynomial of the class Q„( +1) for which u n (1) is a maximum . 
This polynomial u n (x) = u n (x; A n ) can be determined from the transcendental 
equations (2.5), (2.6) and (2.17) of §2 (see below). It is a special case of a 
remarkable class of polynomials u n (x ; .1) considered first by CL Zolotareff 3 

1 A. Markoff, On a certain problem of 1). I. Mendeleieff (in Russian), Zapiski Imperatorskoi 
Akademii Nauk, vol. 62 (1889), pp. 1-24. 

2 ]. Schur, Vber das Maximum des absoluten Betrages eines Polynoms in einem gegebtnen 
Inter vail, Mathematischc Zeitschrift, vol. 4 (1919), pp. 271-287. 

*G. Zolotareff, (a) On a question concerning a minimum value (in Russian), Dissertation 
‘‘pro ve.nia legendi,” published in lithographed form, 1868, Oeuvres, vol. 2 (1902), pp. ISO- 
166; (b) Application of elliptic functions to questions concerning functions which deviate the 
least from zero (in Russian), Zapiski Imperatorskoi Akademii Nauk, vol. 30 (1877), Oeuvres, 
vol. 2, pp. 1-59; (c) Sur Vapplication des fonctions elliptiques aux questions de maxima et 
minima , Bulletin de V Academic des Sciences de St.-P6tersbourg, series 3, vol. 24 (1878), 
pp. 305-310, Melanges, 5, pp. 419-426, Oeuvres, vol. 1 (1931), pp. 369 374. 

451 



452 


P. ERDOS AND G. SZEGO 


playing also a role in the important investigations of W. Markoff. 4 Recently 
N. Achyeser fi used polynomials of the Zolotareff type in his investigations on 
polynomials of least deviation in two disjoint intervals. With the previous 
notation, our main result is: 

Theorem 1. The extremum m n -n 2 in (1.2) is attained for xo = +1 and for the 
Zolotareff polynomials ztu n (x) [or for Xq = — 1 and for ztw n ( — x)], provided n is 
sufficiently large. Furthermore 

(1.3) lim m n = m 

n~» oo 

exists and 

(1.4) ju = AT 2 (1 - E/Kf = 0.3124 
where k 2 is the only root of the transcendental equation 

(1.5) ( K - E) 3 + (1 - k?)K - (1 + k 2 )E = 0 

satisfying the condition 0 < A: 2 < 1. Here K and E are the complete elliptic 
integrals associated with the modulus k . 

A further analysis and discussion of a few special cases furnishes the more 
informative 

Theorem 2. If n > 3 the extremum m n -n 2 in (1.2) is attained in the cases 
mentioned in Theorem 1, and only in these cases. If n — 3, it is attained for 
Xq — 0 and for the Tchehycheff polynomials AzT 3 (x), and only then. 

In §§2 and 3 of the present paper we first study as a preparation the general 
polynomials u n (x ; A) of Zolotareff and the special case u n (x) — u n (x ; A n ) men¬ 
tioned above. The proof of Theorem 1 is then given in §§4 and 5, and that of 
Theorem 2 in §§6 and 7. In §8 we consider two problems of Zolotareff in which 
the polynomials u n (x ; A) were first used; §9 contains another application. 

The polynomials of Zolotareff occur in numerous other related extremum 
problems. They satisfy a simple differential equation by means of which they 
can be brought in relationship with the multiplication problem of elliptic 
integrals. In what follows we have tried to reduce the use of elliptic functions 
to a minimum. 8 


4 W. Markoff, Oder Polynome , die in einem gegebenen Intervalle mdglichst wenig von Null 
abweichcn, Mathematische Annalen, vol. 77 (1916), pp. 213-258. The Russian original 
appeared 1892. 

5 N. Achyeser, (a) Ober einigc Funktionen , welche in zwei gegebenen Intervallen am wenig - 
sten von Null abweichen, Bulletin de PAcad&nie des Sciences de l’URSS, Classe des sciences 
math^matiques et naturelles, series 7,1932, pp. 1163-1202; (b) Ober einige Funktionen,die in 

gegebenen Intervallen am wenigsten von Null abweichen , Bulletin de la Soci6t6 Physico- 

Math&natique de Kazan, series 3, vol. 3 (1928), pp. 1-69. 

• Zolotareff and Achyeser make extensive use of the theory of elliptic functions; however, 
W. Markoff does not. 



ON A PROBLEM OF I. SCHUR 


453 


2. On the polynomials of Zolotareff 

1. It is a classical fact that there is a unique polynomial T n (x) of degree n 
(the n th polynomial of Tchebycheff) having the following property: The curve 
y = T n (x) y ~1 ^ x ^ +1, consists of n monotonic arcs varying between +1 
and —1; 7«(1) = 1, and T n ( — 1) = ( — 1) M . This polynomial satisfiesthe 
differential equation 

(2.1) n\ 1 - y 2 ) = (1 - x)y A 

from which follows 


( 2 . 2 ) 



2. We show that there are infinitely many polynomials y of degree n possessing 
the following property: The curve y, — 1 ^ x ^ +1, consists of n — 1 monotonic 
arcs varying between 4*1 and —1, y = 1 for x = 1, and y = (— l)" -1 for x = — 1. 
Such a curve necessarily has n — 1 roots in —1 ^ x ^ 4*1 and consequently 
one more outside this interval. If this additional root is >1, y satisfies a 
differential equation of the form 

(2.3) n 2 (l - tf) = (1 - x 2 )y fi 


where y' = 0 for x = A , y = 1 for x = B, y = — 1 for x = C, and 1 < A < 
B < C. A similar differential equation holds if the additional root of y men¬ 
tioned above is < — 1. (The second case can be obtained from the first one by 
replacing x by — x.) 

Solving the differential equation (2.3), we obtain 

(.4 - t)(n - t)~\c - o~ ! (i - 

From the properties of y mentioned above we find 
r 

(2.5) J i (A - l)(B - t)~\C - t)~K 1 - r)~ J d< = (» - 1 )w/n, 

(2.6) r U ~ 0(5 - t)~\C - trKt 2 - = 0, 

(2.7) f C ( t - A)« - B)~\C - t)~\r - Ipdt = t/u. 

By a well-known application of Cauchy’s theorem we see that the sum of the 
integrals (2.5) and (2.7) is tt, so that (2.7) is a consequence of (2.5). 

Conversely, if (2.5) and (2.6) hold, an easy discussion (encircling the singular 
points — 1 , 4-1, B, C) show's that (2.4) is an analytic function single-valued and 
regular in the whole finite x-plane. If x —■> we find y — 0(| x | n ), so that y 




454 


P. ERDOS AND G. SZEGO 


must be a polynomial of degree n. Of course it satisfies the differential equation 
(2.3), and it has all the properties mentioned above. 

For later purposes we note that 


( 2 . 8 ) 


(2.9) 


y = n 


(-DV = n 


U - l ) 2 


(B - 1 )(C - 1) 

U + I ) 2 

(B + 1 )(C + 1) 


at x = 1, 


at x = — 1. 


These values can be obtained from the differential equation (2.3). 

3. Lemma 1. Of the three quantities A, B, C (1 < A < B < C) satisfying * 
the two transcendental equations (2.5) and (2.6), any one can be prescribed arbi¬ 
trarily provided that 

(2.10) A > 1, or B > 1, or C > c n = 1 + 2a n = 1+2 tan 2 [x/(2n)] 

respectively; the two others are then uniquely determined. As .4 increases mono- 
tonically from 1 to + °°, B and C increase likewise from 1 to + <*> and from c n 
to + oo, respectively. 

Furthermore the values of y 1 y\ • • • , y (n) for a fixed x not less than one , and the 
values of ( — 1 ) n y, ( — 1 ) n ”V, • • * , y in) for a fixed x not greater than — 1 , are in¬ 
creasing functions of A. 

The only exceptions are y = 1 for x = 1 and ( — \) n y = — 1 for x = —1. 
In particular, the expressions (2.8) and (2.9) are respectively increasing and 
decreasing functions of A. 

In order to prove this Lemma, let B denote a fixed value, greater than 1, 
and let C be variable, such that C > B; we define A = A{C) bv (2.6) so that 
1 < A < B. Then 

. I’ (s? - \ 2-"!) < B - - o-v - ■" - 

hence 


dA 

dC 


^ ^-1 0 , where 1 < t 0 < B. 

2 C — to 


Now consider the function \(C) defined by the left-hand member of (2.5), where 
A = A{C). We find 


x ' (c) - C (a? - - ,rKc - ,r,<1 - fr ' d ‘ 

- C G -1 A c^) {B - ,rKc - ' r ' (1 - er ' dl < °- 


so that X(C) is decreasing. Let C —> B; then from (2.6) we see that A —> B, 
so that 



ON A PROBLEM OF I. SCHUR 


455 


r +l 

x(C) — J i (i 


r) * dt = ir. 


Since limc-oo X(C) = 0, the equation \(C) = (n — 1)7t/m has precisely one 
solution. 7 

4. Further let p(x) and < 7 (x) be two special cases of (2.4) corresponding to the 
values A' y B' y C ' and A", B" , C" of /t, B, C, respectively. First suppose that 
p'(l) < g'(l). Considering the polynomial 8(x) = p(x) — q(x) at the n points 
in—l^^^+lat which p(x) = ±1 and assuming that 5(.r) ^ 0, a familiar 
argument furnishes the existence of n — 1 distinct points +1 > rn > > 

■ • • > Tj n -i > — 1 such that 6'(r?i) > 0, <5 '(t 7 2 ) < 0, • • • . Furthermore 5'(1) < 0, 
so that 8'(x) has n — l roots (that is, all its roots) in — 1 < x < +1. The same 
holds for 5 "(j), 5"'(.r), • • • so that 8(x) < 0, b'(x) < 0, 5"(.r) < 0, • • • for x ^ 1 
[except that 5(1) = 0], and also ( —l) n 5(x) < 0, (— l) n “V(x) < 0, 
( — 1)"' 2 8'\x) < 0, • • • for x ^ —1 [except that 5( — 1) = 0]. From this we 
easily conclude that the relations A' < A", B' < B ", C' < C" hold for the 
constants corresponding to p(x) and q(x). 

If p'{\) = q'(\) the previous argument still holds good [unless 5(x) ss 0], 
except that 5'(1) = 0 so that the roots of 5'(.r) are in — 1 < x ^ +1. Conse¬ 
quently 5'(.r) < 0 for x > 1. Interchanging p(x) and q(x) we obtain 5'(.r) > 0, 
x > 1, which is a contradiction; so that in this case p(x) = q(x) y A* = A", 
B' = B", C = C". 

From the previous considerations we conclude that B and C are increasing 
functions of .1. It remains to calculate the limits of B and C as A —> 1 and 
A —> +x. In the former case, (2.0) shows that B —> 1, and from (2.5) we 
obtain C —> c n since the equation 

J i (1 + i) ~Ky — t)~* dl = (n — 1 )t/h 


has the unique solution 7 = c n . If .1 —> + 00 it, is obvious that B —> + 00 , 
C —> + x. This completes the proof of Lemma 1. 

5. In what follows we denote the polynomial (2.4) [for which (2.5) and (2.0) 
hold] by y = u n (x; A). We note that, from (2.4) and (2.10), 

lim u n (x; A) 

A —1 

co, {»/;» + tr\c. - ,)-> <*} - - r. (f-; “•). 

0. Also 

(2.12) «"(+!; +1) = -(1 + «„r 2 r"[cos <*■/»)] = -in* cot 2 [r/(2»)]. 


Wn(x; +1) = 

( 2 . 11 ) 

Hence u»(+l; +1) = 


7 These considerations require only slight modifications if we replace the right-hand 
members in (2.5) and (2.6) by w/n and 7 r - vv/n, 1 ^ v ^ n — 1 . The resulting polynomials 
have been used for various purposes by Achyeser; see loc. cit. 



456 


P. ERDftS AND G. SZEGO 


Further, let A —> + <» so that B —> + oo and C —* + oo . From (2.5) 

(A - <1 )(B - h)-\C - <,)'* = (n - l)/» 

where 6 is a suitably chosen number between 0 and 1. Hence A(BC)~* —► 
1 — 1/n, so that, from (2.4), 

(2.13) u n {x ; + =») = limM„(a;;4) = T„-i(x). 

A-+ oo 

Hence 

(2.14) w n (+l; + 00 ) = (n — l) 2 ; 

(2.15) u'(+ 1; +oo) = £n(n - l) 2 (n — 2). 

Therefore, as A increases from 1 to +<», v! n (+1; A) increases from 0 to 
(n — l) 2 , and w«( +1; A) increases from the negative value (2.12) to the positive 
value (2.15), corresponding respectively to A = 1 and A = + <». There is 
precisely one value of A, A = A n , for which u"( + 1; A n ) =0. We denote the 
corresponding values of # and <7 by 2? n and . In §§4 and 5 we shall prove 
that the function u n (x; A n ) furnishes the solution of I. Schur’s problem formu¬ 
lated above, provided n is sufficiently large. 

From the differential equation (2.3) we obtain 


(2.16) 


u "(+i. A) = . n * A ~ ^ L* A ~ D* 

; * {B - 1)(C - 1)\ (B - 1 )(C - 1) 


- i - 2 ( -A— -L_ 

\A - 1 B - 1 


so that the condition «»( + !; .4) = 0 is equivalent to 



(2.17) 


n 


(A - If 
(B - l)(C r - 


1 ) 


= 1 + 2 




The transcendental equations (2.5), (2.6) and (2.17) determine the constants 
A = A n , B — B n , C = C n uniquely. These constants depend only on n. 

The polynomial y — m„(.t; A „) is completely determined by the following 
conditions: The curve y, — 1 5= x ^ +1, consists of n — 1 monotonic arcs 
varying between +1 and — 1, y = 1 for x = 1, y = ( —1)" _1 for x = — 1 and 
y" = 0 for x = 1. 


3. The limiting process n —> co 

1. First we prove the following 

Lemma 2. The constants A n , li n , C n defined by the transcendental equations 
(2.5), (2.6), (2.17) satisfy 

lim n 2 (+„ — 1) = a s /2, lim n 2 (B„ — 1) = fc 2 /2, 

, „ n—► oo n —►oo 

(3.i) 

lim n 2 (C„ — 1) = c 2 /2 

n —►oo 

w/iere 0 < a < 6 < c. The numerical values of a, ?>, c are griyen in (3.17). 



ON A PROBLEM OF I. SCHUR 


457 


By Lemma 1 

lim inf n 2 (C n — 1 ) ^ ir 2 /2 

n~*oo 

and from (2.17) 


S An - 1 ^ 2 _ 2 

n c n - i - A n - i c n - r 

(3.2) n\A n - if + 2n\A n - 1 ) £ 2 n\C n - 1), 
so that 

(3.3) lim inf n(A n — 1) S (1 + tt 2 )* — 1. 


The same inequality holds if we replace A n by B n . 

On the other hand, let us assume that n\C n — 1 ) —> + for a proper sub¬ 
sequence n — n v as v —> qo ; then, from (3.2), n(A n — 1 ) —> + ^c, so that 
it (B n — 1 ) —> + oo. Therefore, by (2.17), for the same subsequence n = n r , 


(3.4) 


(An - l) 2 
(Bn - 1 )(C n - 1) 


0 , 


Now let gj be a fixed positive number; for large n, from (2.5), 

*■ = n / r a ~ di[i ~ Un ~ ^ ~ tr ^° n - /)_!i 


f>uln 

1 ' 


(3.5) > n I d<p{ 1 — (An — cos <p)(B n — cos (p ) \C n — cos y?) 


= f d\f/{ 1 - (An — cos (\P/n))(B n - cos ty/n)) *(C„ - cos (\p/n)) *}. 

Jo 

Since 

(An - cos (t/n))(B n - cos (\l//n)y ] (C n - cos (4//n))~ h 

g u„ - i)(B n - ir J (c* - ir*|i + — 

and since for n = ti v , as p —> oc, 

« 2 (1 - cos (i/'/a)) 0 

n 2 (<4„ - 1 ) 


uniformly in for 0 ^ ^ ^ co, we find ?r ^ co. This is a contradiction if we 
choose co > 7 r. Thus we have proved that the points of accumulation of the 
sequences n 2 (A n — 1), n 2 (B n — 1 ), n 2 (C n — 1) are positive and finite. 

2. Now let n = n„ be a subsequence for which the limits (3.1) exist, where 



458 


P. ERDOS AND G. SZEGO 


0 < a ^ ii g c < + », From (2.5), (2.6) and (2.7) we shall derive a < b < c 
and 

(3.6) f {1 - (a 2 + + wTV + du = », 

Jo 

(3.7) f (a 2 - u 2 )(ft 2 - tt*rV - uT'du = 0, 

Jo 

(3.8) f (u 2 — a 2 )(u‘ — ft 2 )“*(c 2 — u 2 )~* du = 71 *. 

Jb 


Also from (2.17) by the same limiting process (ra = n p 


(3.9) 





°°), 


Instead of (3.7) we can show more precisely 



- t)(B n - trKc H - ir\e -1 r ! <u 


(3.10) 


f(« 2 - u 2 )(6 2 - « 2 rv - 

Jo 

n [*' (t - A n )(B n - trKCn - trKr - l)-‘ dl 

J An 

(u 2 — a 2 )(b 2 — u)~*(c 2 — u 2 )“* du, 



the two limits being the same. 

First, (3.9) is obvious and this equation shows that a = b = c is impossible. 
In case a < fr= c both formulas ( 3 . 10 ) follow easily [writing t = 1 + u 2 /(2ri 2 )\; 
but the first limit is finite and the second one turns out to be + =©, which is a 
contradiction. In case a = b < c the same formulas can be easily established 
again, but the first limit is positive whereas the second one is 0 [since 
max|(£ — -4 n )((7„ — t)~ h {i 2 — 1)~*}, A n ^ t g, B n , is bounded]. Therefore 
a < b < c. 

Now (3.7) and (3.8) follow directly, and (3.6) can also be easily obtained. 
However (3.6) follows also from (3.8) by applying Cauchy's theorem to 

f(z) = 1 - (a 2 - Z 2 W - z 2 y\c - z 2 )~ l 


integrated along the half-circle ( z | = R, 9 iz ^ 0 and along the segment 5Hz = 0 , 
— R g 32 ^ +R, R -» + *>. 

3. Substituting u — ft 2 sin 2 <p in (3.7) and u = c — (c 2 — ft 2 ) sin 2 <p in (3.8) 
we find 

(3.11) / (o 2 — ft 2 sin 2 <p)(c 2 — ft 2 sin 2 <p)~* dip = 0, 

Jo 



ON A PROBLEM OF I. SCHUR 


459 


(3.12) J [c 1 — a 2 — (c 2 — b 2 ) sin 2 <p] [c 2 — (c“ — b 2 ) sin 2 <p] *d<p = t. 

Using the standard notation these equations can be written in the form 

(3.13) (1 - a 2 /c 2 )K = E, cE ' - tf/c)K' = tt 

where the complete elliptic integrals K and E belong to the modulus k = b/c. 
Eliminating a 2 /c we find 

(3.14) E/K + (J?' - t r/c)/K' = 1. 

Comparing this with the classical equation 8 9 

(3.15) EK f + E'K - KK' = t/ 2 
we obtain c = 2K. Hence 

(3.16) a = 4 K(K - E), b = 2&A, c = 2K. 

The relation (3.9) furnishes the transcendental equation (1.5) of Theorem 1 
(see §1) for the modulus k. This equation has precisely one root as k 2 goes 
from 0 to 1 [which shows that the limits (3.1) exist as n —► qo unrestrictedly|. 
Indeed, differentiating the left-hand member of (1.5) with respect to k\° we have 

\E\k'~'\K - Ef - 1|, 

where k' is the complementary modulus. The expression in the curly bracket 
increases with A: 2 , as the well-known power scries expansion of K and E shows; 
it is negative for small k 2 and positive as Af approaches 1. Therefore the left- 
hand member of (1.5) first decreases and then increases; but for k 1 = 0 it is 
zero and for k 2 — > 1 — 0 it tends to + . This establishes Lemma 2. 

Using the tables of Milne-Thomson 10 we find 

A 2 = 0.84 • • , a 2 = 11.4055 • • ■ , b = 4.3245 

(3 

c = 4.7185 • • • , a/b 2 c = 0.3124 

We also note that (2.4) implies that 

(3.18) lim u n ( cos ( z/n ); A n ) = cos ( f (a 2 + u 2 )(b 2 + u 2 ) ^(c 2 + M 2 )“*dw| 

uniformly in z , for all complex z such that \z \ g R. 

4. Another limiting formula important for the proof of Theorem 1, is 
Lemma 3. Let A = A n be a sequence of values such that A' n — l = o(n~ 2 ). 

8 See for instance, E. T. Whittaker and G. N. Watson, .4 course of Modern Analysis, 
Fourth edition, 1935, p. 520. 

9 See Whittaker-Watson, loc. cit. p. 521. 

10 L. M. Milne-Thomson, Ten-figure table of the complete elliptic integrals K, K', E, E f and 
a table of l/tfjj(0! r), l/tff(01 r'), Proceedings of the London Mathematical Society, series 2, 
vol.33 (1932), pp. 160-164. 



460 


p. erd6s and g. szeg6 


Denoting the corresponding values of B and C determined from the equations (2.5) 
and (2.6) by B n and C'„ , respectively, we have 

(3.19) lim w„(cos (z/n); .41) == —cos {(a- 2 + z 2 ) 1 }. 

n —»oo 

The last equation holds uniformly in z y for all complex z such that | z | ^ R. 

We note that (3.19) arises from (3.18) on writing a = b = 0, c = t. 

For the proof we use an argument similar to that of Part 1. Let co be fixed, 
w > 0; we find [see (3.5)] 

(3.20) tt> [ d\f/{ 1 — (An — cos ( \p/n))(B' n — cos (\p/n))~*(Cn — cos (\l//n))~*\. 

Jo 

Assuming for a certain subsequence n = n v , v —> oo , that the limits 
lim n 2 (B'J — 1) = 0, lim n 2 (Cl — 1) = y 
exist, we have 0 ^ 0, y ^ t 2 /2. Thus we conclude from (3.20) 

’r ^ f d*{ 1 - (* 2 /2)(0 + *72 r\y + *72) _i }. 

Jo 

so that 


(3.21) it ^ [ d*{ 1 - (* 2 /2)(0 + *72)-*(7 + *72)-*}- 

Jo 

Now 

(3.22) ir = [ dill - *(/ + *T ! ); 

Jo 

consequently (3.21) and (3.22) involve a contradiction, unless 0 = 0,y = ir 2 /2. 
Further 


m„(cos (z/n); d'„) 


(3.23) 


cos jj (An - cos (*/n))(J3'„ - cos (*/n)) *(C'„ - cos (*/n)) J d*i. 


Now let 0 < e < x < R and | z | = R. Then 


[ (An - cos (i/n))(B’ n - cos (*/n)) *(Cl - cos (*/n)) ‘d* 

Jo 

^ [ (A' n - cos (*/n)) } (Cl - cos (*/n)) -i d* -► [ *(ir 2 + * 2 r* d* 
Jo Jo 


as n —> »; the last integral is arbitrarily small with e. Integrating from e to 
z, we can assume that* ^ 0, ± v on the path of integration; and the assertion 
follows immediately from (3.23) for n —» ». 



ON A PROBLEM OF I. SCHUR 


461 


4. Proof of Theorem 1 

In what follows, the symbols Q n , Q n (xo) defined in §1 are used. 

1. Lemma 4. Suppose —1 ^ x 0 ^ +1, and let / 0 (x) be a polynomial of the 
class Q n (x o) for which max |/'(xo) |, /(x) eQ n (xo), is attained. Then |/ 0 (x) | as¬ 
sumes its maximum 1 at least n times in — 1 ^ x tk +1. 

The proof follows the usual lines. Let /o(x 0 ) > 0 and let us suppose that 
the assertion of Lemma 4 does not hold. Denote by xi , x 2 , • • • , x t ; l < n, 
the distinct values in —1 ^ x ^ +1 for which |/ 0 (x„) | = 1 and write co(x) = 
IIU (x — x„). If — 1 < x 0 < +1 we have x 0 ^ x v [otherwise /o(x 0 ) would 
be 0]. However if x 0 = dbl we may have x 0 = x v , in which case w(x 0 ) = 0 
but w'(xo) 5^ 0. 

We form the polynomial 

(4.1) r{x) = —52 sgn/o(x„) -rr\r^ - v + w(x){a(x - x 0 ) + b } 

y<mal CO \Xy) ^X Xy) 

and want to determine the constants a and b such that r'(x 0 ) > 0, r"(x 0 ) = 0; 
this can certainly be done provided the linear equations 

au)(x o) + bw'(xo) = G y 


2ao/(x 0 ) + ba)''(xo) = H 


have a determinant Now w(xo)c/'(x 0 ) — 2|o/(x 0 )} 2 ^ 0 is obvious if 

<o(.T 0 ) = 0 (cf. above); but if o>(x 0 ) ^ 0, 


_ 2 v<*>y 

O)(xo) \u(x Q )j 



'<Axo)Y 
M x o) I 


< 0. 


Obviously r(x) is of degree l -hi ^ n and we find for sufficiently small e > 0 
that |/ 0 (x) + €r(x) |gl in— 1 ^ x ^ +1; hence /o (x) + er(x) belongs to 
Q n (x o). On the other hand/ 0 (xo) + cr'(x 0 ) > /o(x 0 ) which is a contradiction. 
This proves Lemma 4. 

2. Let the extremum (1.2) be attained for the value x 0 and for /(x) = / 0 (x), 
fo(x) e Qn(xo). Then f”(xo) = 0, and / 0 (x) possesses the property formulated in 
Lemma 4. Further we show that fo'(xo) ^ 0. By Lemma 4, [/ 0 (x) | attains 
its relative maximum 1 in —1 < x < +1 for at least n — 2 distinct points for 
which fo{x) = 0. Since fo(x) vanishes an odd number of times between two 
consecutive roots of /o(x), we find that/o'(x) has precisely one simple root between 
two consecutive roots of /o(x), and these roots of fo(x) are maximum points of 
|/o(x) |. The number of these maximum points is at least n — 3. If x 0 is one 
of these points, we must have fo\x o) ^ 0. If x 0 is different from these maximum 
points (whose number in this case is n — 3), then we must have again / 0 (x 0 ) ^ 0, 
and thus there is a relative maximum of |/o(x) | at x = x 0 . 

If we assume that/o(x 0 ) > 0 then fo(xo) = 0,/o"(xo) < 0, so that/i(x) has a 
relative maximum at x = xo. 

Now we distinguish various cases. 



462 


p. erdOs and g. szego 


(a) xo = ±1. 

Let £o = +1 and let us denote an extremum polynomial of our problem by 
u n (x ), Wn(l) > 0, u"( 1) = 0. As we showed before, u n (x) has at least n — 2 
and u'n(x) at least n — 3 distinct roots in—l<x<+l. Since u”{ 1) = 0, 
we find that n — 2 is the precise number of roots of u n (x) in —1 < x < +1. 
Consequently |u n ( —1) | = |u n (+l) | = 1; and, since u n ( 1) > 0, we find 

Un( 1) = l,Un(-l) = (~l) n_l . 

Thus the curve y = u n (x)> — 1 ^ x ^ +1, consists of n — 1 monotonic arcs 
varying between +1 and —1, and w n (l) = 1, w n ( — 1) = ( — l)" -1 , ?/ n (l) > 0, 
1 ) = 0 . 

Hence from the last remark of §2 we conclude that u n (x) is identical with the 
polynomial u n (x; A n ) defined there. 

Consequently, under the assumption x Q = zb 1, the extremum polynomials of 
our problem are ±:u n (x ; A n ) and ±u n (—x; A n ), respectively. The asymptotic 
value of | u n ( 1; A n ) | is aW“ 2 -n 2 [see (3.17)]. 

(b) — 1 < Xo < +1, and there exists a polynomial g{x) of Q n for which 
Ifif'Oro) I > |/o(^o) |. Suppose/o(x 0 ) > 0, g\x 0 ) > 0. 

Consider the polynomial h e (x) = f 0 (x) + e| g(x) — /o(x)}, 0 < e < 1. Ob¬ 
viously h t (x) € Q n ; furthermore h' e (x 0 ) > fo(xo). For sufficiently small e there 
is a root of h e (:r) in the neighborhood of x 0 , .r 0 say, and h t (x) attains a positive 
relative maximum at x = x'o . We evidently have 

h[(x o) ^ h[(x 0 ) > f 0 (xo) 

which shows that f 0 (x) cannot be the extremum polynomial. 

6. Proof of Theorem 1. (continued) 

The remaining case requires a more elaborate discussion. This case is: 

(c) — 1 < Xo < +1 and/ 0 (x) is the polynomial in Q n with the maximum value 

of /'(xo). 

Then W. Markoff has shown 11 that / 0 (x) must be one of the polynomials 

±r„ Or), ±r_i(*), ±r»fc^), ±t„ , 

(5.1) V 1 + «/ V + «/ 

±24n(±x; A) 

where 0 < a < ot n = tan 2 [7r/(2n)], and w n (x; A) are the Zolotareff polynomials 
defined and discussed above. As n-—» », the largest relative maximum of 
| T n (x) | in — 1 < x < +1 is asymptotically M •n where — M is the minimum 
of sin 0/0 for real 0 , that is M = 0.2172 • • • . Comparing this result with the 
asymptotic value of u n ( 1; A n ), that is with aV"V~ 2 *tt 2 [see (3.17)], we see that 
for large values of n the four first types in (5.1) can be excluded. 


11 Loc. cit. p. 249. 



ON A PROBLEM OP 1. SCHUR 


463 * 


As W. Markoff has further shown 12 , /o(x) = dtu n (x ; A) if and only if (a) Xo 
belongs to certain open intervals in—l<x<-fl, and (b): 


(5.2) 


/ (I - x 2 )u' n (x; A)\ = Q 
dx \ x — A ) 


at x = xo . 


Since ii n (x o ; A) 7 * 0, and ; A) = 0, the latter-mentioned condition im¬ 

plies that 


(5.3) xo = /I - U 2 - 1) J , A - 1 = (1 - Xo)'7(2xo), 


so that 0 < x 0 < + 1. Now we distinguish again two cases: 

(c') 0 < xo £ (1 — lCm~ 2 )\ According to S. Bernstein’s theorem 

(5.4) | u' H (x ; A) | g n(l - x 2 ) H g n~/\. 


(c") (1 — 10 n~ 2 )~ ! < xo < 1. Then A — 1 = A' n — 1 = 0(/t~ 4 ). Now 
we assume that this case occurs for an infinite number of values of n, and we 
write Xo = cos (zo/n) ; then Zo is bounded. From Lemma 3 we conclude that 


(5.5) 


lim n 2 u n (cos (z/n); A*) 


?n((^+/)*) 

~ Or 2 +“**)* 


The maximum of the absolute value of the last expression for real z is 
M = 0.2172 • • • so that this case can be also eliminated. 

The assumption f 0 (x) = ±u n ( — x; A) can be dealt with similarly. 

Thus for large n only Case (a) remains. This completes the proof of 
Theorem 1. 


6. Proof of Theorem 2 

1 . First we consider again the case (c) defined in §5 and let Xo belong to one 
of the open intervals in — 1 ^ x ^ +1 in which the maximum of/'(.r 0 ),/(.r) c Q n , 
is attained for the Zolotarcff polynomial/(x) = u n (x; A). [The argument is similar 
for —u n (x; A) or =fcM»(—x; A).] Then f(x) = u n (x; A) = / 0 (x), where /«(x) has 
the same meaning as in §§4 and 5, so that fo(x) eQ n (xo); that is, fo(xo) = 0 . 
We have/o(.r 0 ) > 0, fo\xo) < 0. 

By an important theorem of W. Markoff 13 , to every positive e correspond 
values Xi such that 14 

(a) 0 < | Xi — Xo | < €; 

( 0 ) if fi(x) = Unix; A') denotes the polynomial of Q n for which f'(Xi) becomes 
a maximum, then 

( 6 . 1 ) fi(xi) > /o(x 0 ). 


12 Loc. cit. pp. 233-246. 

12 Loc. cit. p. 257. 

14 In fact, a whole half-neighborhood of £0 satisfies this condition. 



464 


P. ERDi>8 and o. szegO 


Now if < is sufficiently small, f[\x) will have a root, say x[ , in the neighbor¬ 
hood of xo ; we can assume that —1 < x[ < +1. Also f["(x[) < 0, so that 
fi(x) has a relative maximum at * = x[ ; hence 

(6.2) f[(x[) Z f[(xi) > /„(x„), 

which shows that f 0 (x) can not be the solution of our problem. 

This argument leaves as the only possibilities for f 0 (x) either the Zolotareff 
polynomials ±u n (x; A n ) with x 0 — ±1, or the Tchebycheff polynomials db T n (x). 

2. Let D n be the largest root of u n (x; A n ), B n < D n < C n • Using the con¬ 
vexity of u n {x\ A n ) for x > 1, we deduce 

(6.3) D n - B n > C n ~ D n . 

Further we make use of a theorem of I. Schur on the largest roots of the deriva¬ 
tives of an algebraic equation with only real roots. 15 Applying this theorem 
to u n (x ; A n ) we obtain 

(6.4) Dn ~ An g An - 1 
so that 

2 (An - 1) fc - 1 > \{B n - 1 + C n - 1) > \{B n - 1)(C„ - 1)} } . 
Hence, from (2.8), 

(6.5) *4(1; A n ) > n 2 /4. 


3. On the other hand we show that 

(6.6) | T' n {x) | g n 2 /4 if T"{x) = 0 


provided n ^ 5 (with equality only if n = 5). Incidentally, I. Schur has proved 

(6.6) for all large w. 16 

Let <p be a root of the equation tan rup = n tan <p, 0 < <p < x/2. Then the 
assertion is 


(6.7) 


n 


sin rap 
sin ip 


n\n 2 sin 2 p + cos 2 p) * g n 2 /4, 


sin <p"£ 



It is sufficient to show this for the largest root x n = cos p n of Tn(x) ; that is, 
for the smallest positive value p n , * < np n < 3tt/ 2, satisfying the equation 
above. 

The function 


( 6 . 8 ) 


hit) 


tan 

n tan \j/ 


18 1. Schur, Zwei Sdtze iiber algebraische Gleichungen mil lauler reellen Wurzeln , Journal 
fiir die reine und angewandte Mathematik, vol. 144 (1914), pp. 75-88. 

14 1. Schur, loc. cit.*, p. 277. 



ON A PROBLEM OP I. 8CHUR 


465 


increases from 0 to + °° as ^ increases from t/ n to 3tt/ (2 n) . Let a be the small¬ 
est positive root of the equation tan z = z, w < z < 3jt/ 2. Since 


(6.9) 


h{z/n) = 


n tan (z/n) 

we have <p n > z/n, so that (6.7) follows from 


< 1 , 


( 6 . 10 ) 


sin (z/n) 


■ (A)' 


Since n sin {z/n) increases and n 2 /(n 2 — 1) decreases as n increases, the last 
inequality will be proved for n ^ 6 if we prove it for n = 6. But 

(6.11) sin {zj 6) ^ (3/7)* = 0.6546 

since 17 z = 4.4934 • • • and sin (2/6) = 0.6808 ••• . 

In the case n = 5 we have 

(6.12) T b {x) = 320x 3 - 120x, x 5 = cos <p 5 = (3/8)*, sin <p b = (5/8)*. 

Comparing (6.5) and (6.6) we obtain ±u n (±x] A n ) as the only eligible ex¬ 
tremum polynomials [and x 0 = d=l as the points at which the extremum is 
obtained] provided n ^ 5. 

7. Proof of Theorem 2 (continued) 

The previous result holds also for n = 4, as a direct discussion shows; how¬ 
ever, it fails for n = 3. 

1. We have for n = 4: 

(7.1) T t (x) = 8x 4 - 8x 2 + 1, T' a (x) = 32x 3 - 16 x, T"(x) = 96x 2 - 16, 

so that, with the same notation as before, x\ = 6 ’ and 

(7.2) | T[(x/) I = (16/3X2/3) 1 = 4.3546 • • • . 

On the other hand, let us denote by y\ and y 2 , the values of x for which the 
relative extrema of ua(x; A/) in — 1 ^ ^ +1 are attained; thus — 1 < y x < 

2/2 < +1, say. Then 

(7.3) Ua(x; Aa) = 1 — A(1 — x){B A — x)(yi — xf 
must satisfy the following conditions: 

\(B< + l)(j/! + l) 2 = 1, 
x(l - - y 2 )(yi - y 2 ) 2 = 2 , 

1 + _i__ + _ ?_ = o, 

2/2—1 2/2 — B\ y 2 — yi 

2 + 2/1 — 3. 


(7.4) 


(«): 

w 4 (— 1; 2 I 4 ) = — 1, 

(0): 

Ui(y2-,A t ) = — 1, 

( 7 ): 

u[(y 2 ; -4<) = 0, 

((«): 

«"(1; il«) = 0, 


17 See, for instance, E. Jahnke-F. Emde, Funktionentafeln, 1933, p. 30. 



466 


P. ERDOS AND G. SZEGti 


Hence B t < 2. Let 


(7.5) 


1 - 2/2 = ~ 1 ), 


then (y) becomes: 


B 4 - 2/2 = (h + l)(fl 4 - 1), 

2/2 ~ 2/1 = (2 - h)(Bi - 1); 


(7.6) i + ^ - 0; i.e., h = (1 + (33)‘)/8 = 0.8430- • 

Further, writing v{x) = x(x + l)(x — 2) 2 , we obtain from (a) and (0) 

(7.7) * ( b - 2 _ ,) - 

Since v(x) = v(h) has h as a double root, it can be reduced to a quadratic equa¬ 
tion giving 

r 

(7.8) , y 2 — = 3/2 - h + |(10/i + 5)» = 2.4893- • -. 

Li 4 — 1 

Now 

2 

(7.9) *4(1; A«) = X(B 4 - 1)(1 - 2/i) 2 » 4 l . = 4.7881 • • -. 

Comparison of this value with (7.2) furnishes iti(x; A 4 ) as the solution. 

2. Finally in the case n = 3, 

T,(x) = 4t* - 3.c, TJ(jp) = 12x- - 3, 7’"(.r) = 24.r, 

(7.10) 

x 3 = 0, | 7’ 3 (x 3 ) I = 3. 

On the other hand, 

(7.11) ** 3 (x; A,) = 1 - X(1 - .t 2 )(Z/ 3 - ar) 


with a relative minimum at x = 2/1, — 1 < 7/1 < +1, satisfies the following 
conditions: 

(a): w 3 (2/i ; As) = -1, X(1 - 2/?)(5s - 2/0 = 2, 

(7.12) 09): *4(2/1 ; A.) = 0, 32/1 - 2B s yi -1=0, 

(t): m 3 '(1; As) = 0, B 3 = 3, 

so that 
(7,3)1 

{ U 3 (X) As) = 1 - 3 J (1 - X 2 )(3 - x)/8, Ms(l; As) = 3*/2 < 3. 

This -completes the proof of Theorem 2. 



ON A PROBLEM OF I. SCHUR 


467 


8. Two problems of Zolotareff 

1. The previous considerations permit a very simple approach to the follow¬ 
ing interesting theorem of Zolotareff: 18 

Theorem 3. Let a be a given positive number and fix) an arbitrary polynomial 
of degree n of the form 

(8.1) }{x) = - <rx n ~ l + • • • . 


Then max \f(x) |, —1 g x g +1, is minimized if and only if 

(a) f(x) = const. u n ix; A) provided a ^ na n , 

(b) f(x) = 2 1 ~”(l + a/n)" T n provided 0 < <r g ««„. 

Here u n ix; A) denotes the polynomial (2.4); and in case (a) A = .4 (<r) is a uniquely 
determined function which increases monotonically from 1 to + oc as a increases 
from na n to + ; a n = tan 2 [tt/(2r)]. 

A corresponding result holds for negative <r , obtained by replacing fix) by 
( —1)"/(— x). For (7 = 0 the extremum is given by Tchebycheff’s polynomial. 
From (2.4) we obtain, for x > C, 


I dt 


- Unix ; A) = Rx" - Sx n ~ l + • • • 

= cosh jn J it, — A)it — B)~ l it — C)~\t 2 — 1 )“*(// 

= cosh j/i (log x — log C) 

/ oo 

t 10 - A)(t - B)~Ki - C)-\r - ir‘ - r 1 ] 

-» JT i« - A)o - j?r‘o - crKe - 1r 5 - rv<}; 

so that, as x —► + *>, 

— u n ix ; A) = 4(x/C)" exp jn J [(J — A)(J — B)~ l it — C)~*(£ 2 — l) _i — t~']dt 

~~ n f KiCB + C) — A)£ 2 + Oit 3 )] dt^ + 0(x n ). 

Consequently 
R = i(T n X 


( 8 . 2 ) 


exp 


n f [(< - 4)0 - B)~\t - CT\e - 1)- J - r'd/j > 0, 


S/R = n{\(B + C) - A) > 0, 


so that R and S are continuous functions of A. 


l8 Loc. cit.* (a), (b), (c). 



468 


F. ERDOS AND G. SZEOd 


From the results of Lemma 1, 

(8.3) (J^ n Un(x; A) = —n!fl, 

is an increasing function of A. Let A\ < A 2 , and let R \, Si , R 2 , S 2 be the cor¬ 
responding values of RaxidS,Ri > R 2 . Considering RT l u n (x; Ai) — A^Unfe/As) 
at the extremum points of u n (x;A 2 ) in — 1 g x ^ + 1 we see that it cannot be of 
degree n — 2, so that S 1 /R 1 j* S 2 /R 2 . Hence S/R is monotonic. Its minimum 
value is attained for 

*X.;+1)--T. (*-;-*). 

so that min (S/R) = na n . Its maximum value is attained for u n (x; + °o) = 
5H n _i(x), so that max (S/R) = +°o. 

Now let f(x) be a polynomial of the form (8.1), and let a ^ na n . Then there 
exists a definite polynomial u n (x\ A), A = A (a) ^ 1, for which S/R = a so that 

(8.4) d(x) = f(x) + RT 1 u n (x; A) 

is of degree n — 2. Let max |/(x) | g Z2 - ” 1 , — 1 ^ x ^ +1. Then the poly¬ 
nomial (8.4) is alternately ^ 0 and ^0 at the points at which u n (x ; A) = =fcl. 
Unless d(x) ss 0 this gives n — 1 distinct points at which d'(x) is alternately 
>0 and < 0, and hence n — 2 roots for d'(x) which is impossible. 

2. The argument is similar in the other case, 0 < cr < na n , since the poly¬ 
nomial 

(8.5) 2 l -"(l + «■/»)" T. =• *" “ + • • • 

assumes its maximum modulus 2 l 7, (l + <r/n) n precisely n times in — 1 ^ 
x ^ ~f"l. 

Replacing — R~ l u n (x; A) in (8.4) by the left-hand side of (8.5), We obtain the 
desired result. 

3. Another theorem of Zolotareff is the following 19 : 

Theorem 4. Let xo , t/o, be arbitrary real numbers , of which Xo > 1, and let 
f(x) be an arbitrary polynomial of degree n satisfying the conditions 

(8.6) f(x) = x n + • • • , f(x 0) = yo . 

Then max \f(x) |, — 1 2s x 2* +1 , is a minimum if and only if f(x) is one of the 
polynomials 

-r‘«„(i; A), 2 1 ~ n a + a) n T n 

(8 - 7) / a. \ 

2 l -(l + a) n T n (?-±-?), (-1 r*IT l iU-x; A). 

>• Loc. cit.* (b), p. 27, (c), p. 371. 




ON A PROBLEM OF I. SCHUR 


469 


Here A^l, 0|a^«# = tan 2 [w/(2ri)] are certain numbers uniquely deter¬ 
mined by x 0 and yo . 

The values of the polynomials (8.7) at x = Xq increase 

from - oo to 2*-"(l + «»)" T„ ~ ^ = 0 

as A decreases from + oo to +1; 
from 0 to 2 1 ~ n T n (xo) as a decreases from a n to 0, 

from 2 l_n T n {x<j) to 2*-(l + a n ) n T n (j—J*) = V 

as a increases from 0 to a n ; 
from 0' to + oo as A increases from 1 to + <», 

respectively. These facts determine for a given y 0 the extremum polynomial 
fo(x) in question. Indeed, consider the difference f(x) — f 0 (x) at the points in 
— 1 ^ x ^ + 1 at which fo(x ) = ±1, and in addition at x = x 0 . Since this 
difference is alternately 0 and g 0 at these n + 1 points, the usual argument 
gives n — 1 distinct roots for its derivative [unless f(x) == fo(x)]> which is im¬ 
possible. 

4. The problem defined by the condition 
(8.8) f (k) (x 0 ) = y 0 

where 1^/b^n — l, x 0 >l, and y 0 is arbitrary, can be treated in a similar 
manner. For k = n — 1 we obtain the first problem dealt with above. 

9. A further application 

The previous considerations furnish another property of the polynomials 
u n (x ; A) of Zolotareff which play a role in the interesting investigations of 
W. Markoff [see 4 ]. 

1. We prove the following application of Lemma 1: 

Theorem 5. Let 

(9.1) 1 > Xi > Zi > X 2 > Z2 > ■ ■ • > X n -t > Z n -2 > X n -l > — 1 
be the values of x characterized by the conditions 

(9.2) u„(x, ; A) = 0, v = 1, 2, • • • , n — 1, 

(9.3) «'„(«,; A) = 0, v = 1, 2, • • • , n — 2; 

then the functions x , = x y (A) and z, = z,(A) increase as A increases. 20 
The roots x, of u n (x; A) satisfy the.equation 

f (A — t)(B — t)~\C — 0 -i (l — < 4 ) - * dt = (i> — i)ir/n, 

(9.4) 

v — 1,2, •••,»— 1. 

10 Concerning x ¥) see W. Markoff, loc. cit. p. 242. The largest root D =* D(A) also in¬ 
creases, as can be concluded from the result of §2, No. 4 . 



470 


P. ERDOS AND G. SZEGO 


We,can assume that A = -4(p), B = B(p), C = C(p) are increasing functions 
of a parameter p,p > 0, all these functions having continuous derivatives. Then 
x v — x ¥ {p) and 

(A — x,)(B — rr„)""*(C — ar,r*(l — ^) -l **(p) 

= Jf 1 ^ iu - m - trKc - tr \i - < j rM dt 

= £' (B - <r‘(C - 

> |^'(p) - iB'(p)^-J - iC'(p)^±^} J\b - t)~Kc - t)~K 1 - <T^<' 

> 0 


since the expression (2.9) increases with p. » 

The assertion about z v can be proved in a similar manner. 

2. The assertion about z v follows also from the following general remark. 
Suppose the roots of an algebraic equation are real and distinct, and that they^ 
are increasing functions of a parameter; then the same holds for the roots of 
the derivative. Indeed, using the notation above: 

- -|_ —-— + • • * H--— =0, x n = D. 

Z v — Xi Z v — X2 Z v — Xn 


[Here x n = D denotes the only root of u n (x; A) which is > 1.] Differentiating 
this relation, 



so that x' > 0 implies z v > 0. 

Repeated application of this argument shows that the roots of all derivatives 
u { n\x; A) increase as A increases. 


University of Pennsylvania 
Stanford University 



Annals of Mathematics 
Vol. 43, No. 3, July, 1942 


ON ABEL’S CONVERSE THEOREM 

By Guido Fubini 

(Received March 20, 1942) 

1. Introduction 

In studying Lie’s famous papers 1 on the surfaces of translation, I began to 
think that it was perhaps useful to consider them as the statement of a theorem 
which, in Lie’s case, is the converse of Abel’s theorem, so fundamental in the 
theory of Abelian integrals. Lie generalized his results to other manifolds of 
translation. I think it will be possible to look for a generalization in another 
direction; in other words, it is perhaps possible to find new theorems which may 
be considered as the converse of Abel’s theorem. If we start from a curve of 
genus p, we obtain some particular manifolds of translation of p — 1 dimensions. 
This has no geometrical meaning if, for instance, p = 1; but the question of 
stating for p — la theorem which is the converse of Abel’s theorem is always 
interesting, at least according to my opinion. Also Poincar6 occupied himself 
with Lie’s results and looked for some generalizations of the surfaces of transla¬ 
tion. Though my point of view is different, there are some analogies between 
Poincare’s problem 2 and the problem to which this paper is devoted. But the 
methods of this paper are, I think, quite new; Lie starts from a system of two 
partial differential equations with one unknown, whereas I start from one ordinarij 
differential equation with four unknowns (two of which are functions of a pa¬ 
rameter u , and two functions of an independent parameter v). And I succeed 
in transforming this equation into a reduced equation which contains only the 
first derivatives with respect to one of the parameters ?<, v. After this the 
problem is easily solved, but there are many particular cases to be considered 
(see the conclusion of this paper). On the other hand it is not possible to take 
advantage of Poincare’s methods. We do not suppose from the beginning that 
the functions which determine our curves are analytic. Their analyticity will 
be a consequence of the differential equation referred to above. But even if 
we supposed that they are analytic, it would not be possible to make use of 
Poincare’s method. Let us suppose that Ci is a curve defined by analytic func¬ 
tions. I choose on it a point A\ ; afterward I define another analytic curve c 2 
by giving the coordinates of its points A 2 as analytic functions of the slope m 
of the line A\Ai joining A 2 (variable on c 2 ) with the point A 1 chosen on ci. 
These functions may exist in a certain region of the plane of m, which we now 
consider as a complex variable. If this region does not contain the slope of the 
line t which is tangent to Ci at A\ , this tangent will neither intersect c 2 , nor be 

1 See Lie: Berichte ti. die Verhandl. der Kon. Siichs. Ges. der Wissensch. zu Leipzig, 

1896, Bd. 48, p. 141. Comptes Rcndus 1892 (1st Sem.), pp. 277, 331. 

* See Bull, de la Soc. Mathem. de France, 1901, tome 29, p. 61. (See also Journal de 
Liouville, 1895, tome 1, p. 219.) 


471 



472 


GUIDO FUBINI 


tangent to it; and it is possible that all the straight lines of a certain neighbor¬ 
hood of t (for instance, the tangents t' to Ci at the points A[ of a certain neigh¬ 
borhood of Ai) have the same property. In this case, and in other analogous 
cases, it is not possible to make use of Poincares methods, and I have not 
succeeded in demonstrating directly that the preceding suppositions are absurd 
for the curves which we shall study in our problem. In order to state this 
problem clearly, I recall that the simplest form of Abel's theorem for the curve 
of genus 1 is the following. For a non-rational cubic we can choose the Abelian 
integral of first kind in such a way that its values at three points of the cubic have a 
sum equal to zero if and only if the three points are collinear . (I shall later speak 
of the generalizations to six points lying on the same conic etc.) I now state' 
the following converse problem. Let us suppose we have three real arcs of 
curves c, 7, C defined by parametric equations in which x , y are Cartesian, or, 
more generally, non-homogeneous projective coordinates, and u , v, w are the 
parameters: 

fx = far = <px{v), (x = Fi(w), 

(c) \ (r) \ (CM 

{:V = Mu), [y = M.v), 12 / = Fi(w), 

and that three points, one on every curve, are collinear if and only if 

(1.1) u + v + w = 0. 

Can we deduce that the three curves belong to the same curve I 1 of third degree and 
that the corresponding parameters are identical with the Abelian integral of first 
kind corresponding to this curve ? The answer will be given in this paper; it is 
positive, but we must take into account the cases where the cubic degenerates 
(into a conic ami a straight line or into three straight lines) or is rational (pos¬ 
sesses a double point or a cusp). 

We could also state a more general problem by substituting 

*i(ti) + Mv) + $3 (w) = 0 

for (1.1) and by supposing that $ are continuous differentiable functions of the 
corresponding parameter; but, by a suitable change of parameters, we realize 
that the new problem is equivalent to the preceding. 

2 . The fundamental equation 

I choose at random two of the three curves c, 7, C, for instance c, 7, and I 
suppose that neither the former nor the latter is the line at infinity (x = 00 , 
y = 00). (By means of a collineation we can always satisfy this condition.) 
On the straight line joining a point (/1, / 2 ) of c with a point (<pi, <^) of 7 I must 
find a point (Fi, F 2 ) of C. Its coordinates will be given by 


( 1 . 2 ) 


Fi = kfi + (1 — k)<pi , F 2 = kfi + (1 — k)<p 2 y 



ON ABEL'S CONVERSE THEOREM 


473 


in which k is unknown. This point belongs to C; therefore its coordinates must 
be functions of w = — (w + v) y (see (1.1)). Therefore 


dFi 

du 


dFi 
dv ’ 


dFi __ dFi 
du dv 9 


or 


(.•=1,2) [/>-£, ✓ 

We have obtained two linear equations in two unknowns: k and dk/du — dk/dv. 
By solving these equations, we obtain 

( 2 . 2 ) Afc = <fi[fi — <^i] ~ v>i[/2 ~ ^2], 

( 3 . 2 ) A^— ~~ dy) = v?1 ^ 2 ^ ^(/i + <pi) = <^1/2 — <£2/1 

in which 

( 4 . 2 ) A = (/1 — <pi)(fi + (fi) — (/a — ^X/i + ^ 0 - 

By^ differentiating (2.2) with respect to u or to v , and by subtracting, we obtain: 

■ S) * +A (S ~ S) = ^ ^ + ^" (/z “ ^ - ^ ,(/i “ ^ 

(f» = 

\ du 2 ’ fit; 2 / ’ 

By recalling (3.2) we deduce: 

(du _ = ~ ^1/2) + («/*2 ~ <p2 ) “ ^2 (/1 “ <Pi)- 

By multiplying by A, by comparing with ( 2 . 2 ), and by remarking that 


^ - f v = {ft - «*)(/! - *») - (/. - WCtf ~ *?) 

we conclude that 

{{fi ~ W)(/l ^0 (/2 ~ ^)(/l — )} {^2(/l — (pl) — (P1U2 “ ^>2)} 

(5.2) = {(/l — ^l)(/a + (P 2 ) — (/2 — ^)(/l + <Pl)\ 

{ 2 (<p 2 fl “ ^1/2) + ^(/a — V^) ~ ^(/l ~ ^l)}, 

or 

{^a(/i ^1) <Pi(f2 “ ^2)} {/a (/1 “ ^1) — /1 (/2 “ (P2) — 2 (<pifi — (pifi)} 

(6.2) = {/X/i ~ ^ 1 ) — /i(/2 — ^ 2 )} 

{(piif* ““ ^2) “ ^(/l ~ ^1) + 2 (<pifi — ^1/2)}. 



474 


GUIDO FUBINI 


The second member can be deduced from the first by interchanging the /, <p 
and changing the sign. This is the fundamental equation of our problem. If 
this equation is satisfied, and if A 5 * 0, the (2.2) gives us a value of k such that 
the functions Fi , F 2 determined by (1.2) are functions of w — —(u + v) and 
the three curves c, 7, C satisfy the conditions of our problem. If A = 0, the 
same method which led us to (5.2) or (6.2) proves that these equations are 
satisfied, which can be checked directly. 

If A = 0, we have 

(/l ~ <Pl)(p2 ~ (^2 ~ <P2)<Pl = — {(/l — <Pl)f2 “ (U ~~ *P%)f 1} > 
so that (5.2) or (6.2) are equivalent to the equation 

{«(/! - *0 - v'lih - <&)){(f* - «)(/> ~ vi) ~ (/* ~ «*)(/? - <p")} = 0 


or 


~ <Pl) “ <Pl(f 2 ~ 



6A 

dv 



which is a consequence of the equation A = 0. Consequently the fundamental 
equations are satisfied if A = 0; but, in this case, the (2.2) does not determine 
the value of k . But, if we neglect the case withont any geometrical meaning 
(at least for our problem) that c and 7 are pieces of the same straight line , the 
slope 


m 


_ fl ~ <P 2 
fl ~ *Pl 


of the line joining the points (/1 , f 2 ) of c and (<pi , (p 2 ) of 7 cannot be a constant. 
And the equatiQn A = 0 is equivalent to the equation 


dm __ dm 
du dv 1 


which proves that m is a function of w = — (u + v). In the exceptional case 
A = 0, the conditions of our problem are satisfied too; but the third line C is 
the line at infinity of the plane (x, y). Therefore (if neither c, nor 7 is the line 
at infinity) the fundamental equation gives us all the necessary and sufficient 
conditions for these lines c, 7 . 


3. Remarks on the fundamental equation 

We can arrive at this equation also*by another way. If three points (/ 1 , / 2 ) 
of c, (ip \, <p 2 ) of 7 and ( Fi , F 2 ) are collinear, we can suppose that y — mx + n 
is the equation of the line joining them. Therefore 

(1.3) f 2 = mf\ + n y (2.3) <p 2 = nupi + n , (3.3) F 2 == mF 1 + n, 


from which we deduce 
(4.3) * 



<P2 

<Pl 



on abel’s converse theorem 


475 


By differentiating (1.3) with respect to v , and (2.3) by respect to u we deduce 
that 


(5.3) 


dn _ _ j. dm dn _ dm 

dv dv 1 du ^ 1 du 


Since F t are functions of w = -(w + v) f we have (if t = 1, or 2): 


dFi = dF, 
a^ dv 


From (3.3) and (5.3) we deduce consequently 


(6.3) 


£S(* or 


am - am 

<£l 5-/l-r- 

du dv 


dm 

du 


dm 

dv 


By comparing with (1.2) we deduce that 

dm 



du dv 


and this equation is identical with (2.2). The fundamental equation (6.2) 
states only that F x defined by (6.3) is a function of w = - (u + v). 

"lhe same method which led us to the fundamental equation proves that this 
equation is invariant under the group of collineations in the plane (x, y). In 
order to check this statement, we shall make use of homogeneous projective 
coordinates, by writing 





I shall indicate by 


Otf'f) or (x"£x), etc., 

the determinants of third order, the r th row of which (r 

av , £r, or x" , £ r , x r , etc. 
The fundamental equation can be written in the form 



1, 2, 3) is 


/l — <P\ <pl 
/ 

fl <Pl fl 

// 

+ 

fl — <pl /l 

/ " 
/I — ^>1 <pl 

J2 <P2 *P2 

fi ~ <p2 y*2 


/2 ““ <P2 f2 

J 2 ““ ^2 ^2 


/I 

! 

<Pl 

fl — ^1 

/i + vl 

fz 

/ 

<P2 

«/*2 ~ <P2 

/£ + ¥>S 



476 


GUIDO FUBINI 


or, by supposing at first x* *» & = 1 and consequently z* = /», (i = 1, 2) 


(xt'()(xx"() + (xx'$)(xt"ti = 2(x'x() 

fi V\ A. 
ft V* B 

+ 2{x?& 

ft Vi c 

ft Vi D 


0 0 1 


0 0 1 


in which A, B, C, D are arbitrary. By supposing A = ifi , B = ^> 2 , C = /i, 
Z) = / 2 , we find 

(7.3) (atf'*)(»c"*) + 2(m)(€*'*) - + 2(xtt)(xr*). 

But it is very easy to see that if we substitute px» and <r£ t - for Xi , £*, and by 
supposing that p is a function of the only w, and <r a function of the only y, this 
equation does not change, because both of its members are multiplied by the 
same factor p 3 <r 3 . It is consequently useless to suppose x* = £ 3 = 1. They 
can be arbitrary functions of u , v, respectively. 

Moreover, if I transform the x , £ by means of the same linear homogeneous 
transformation, 3 both the members of (7.3) are multiplied by the same factor 
(the square of the determinant of the transformation). Therefore the form (7.3) 
of the fundamental equation proves that it is invariant under the group of the 
collineation. The two members divided, for instance, by 

(zxv'x&ro 

are absolute projective invariants of the pair of curves c, y; they have a geo¬ 
metrical meaning if the parameters u , v have a geometrical meaning (for in¬ 
stance, if they are the projective length of an arc of the curves). I think that 
nobody has noticed these invariants heretofore. 


4. Reduction to the first order 


It does not seem to be an easy matter to study directly the fundamental 
equation; but we can succeed in transforming it into an equation in which there 
are only derivatives of first order, for instance with respect to u. In the follow¬ 
ing lines, if is a function of u , v, I shall indicate by the derivative d$/du. 
For short, I shall write 

fa = fi - <Pi , t\ = H = /" (i-1,2). 


The fundamental equation can be written in the form 
(<fifa — ififa)(fafa — fa fa) — 2(^i — <fifa)(fafa — fafa) 

= 2(<p 2 h — ififa) (fMi ififa) + (fat 


1 


fafa)(<Pifa — <p 2 1 \). 


Dividing both members by (<p 2 ti — <pifa) z I obtain 


d fa fa — £1 fa 
dt (if 2 fa — ififa) 2 


= 2 


<p 2 fa ~ <f 1 fa 
(if 2 fa — <f 1 fa ) 2 


+ 


tU 1 


fa fa if 1 fa — if 2 fa 


(if 2 fa — ififa) 2 if 2 fa — if 1 fa 


9 It is now sufficient to suppose that the coefficients of the transformation are constant. 



on abel’s converse theorem 


477 


Let A, B be any two functions of v only. It is obvious that 


d_ At i - Btj 
du 1\ — <pi t% 


— {A<p[ — B(pi) 


t\ t>2 — t>2 t\ 

(^ 2*1 — 9 l ^) 2 


Avl 


ipl t>2 <P2 t\ ___ (fii <p2 ~ At\ 

<P 2 t\ (Pit* A<p\ — B(p2 t\ — <Pifa A<p i — B<p2 


Btz Btp2 


so that our equation becomes 


d At\ — Bt>2 
du (f 2 1\ — (pi ti 


- 2 (A<p[ - Bw) 


du (f2 t\ — (pi t* 


_|_ / <pl <P2 ~ <pup 2 A^i “ /^2 _j_ Bp2 — A<pi \ d All — Btn 
\ A^>i Bif 2 *p 2 1\ — ^>1 ?2 Av?l — Bip 2 f du <P 2 ti — (fi\ t ,2 * 


By integrating with respect to u , we deduce 

d A/i — Bt 2 _ a ' d 1 

--- - — 2(A<pi — £<p 2 ) - 7 -- j— 

j (XU (p2*l — <P 1*2 <P2*\ — ^1*2 


(1.4) 


+ 


B<p$ — A*pi At\ — Bt% 


+ 


1 ^ 1 <^ 2 ~~ <Pl <P 2 / At \ — Bl2 V _j_ jj 

2 A<^>i — Bp 2 V^2 ^1 — <p\l2' 


A<pi ^ 2^1 ^ 1^2 2 A^i — Btp 2 V ^2 ^1 — ^ 1 ^ 2 - 


in which Z> is a new (unknown) function of t’, whereas A, fl are functions of v, 
arbitrarily chosen, such that 

(2.4) Atp 1 — Btp2 ^ 0. 

(If we change A and /?, the function D also will change.) 

We will give to (1.4) the name of reduced fundamental equation. This reduc¬ 
tion allows us to solve our problem. If v is a point of 7 at which <p[ = 0, I may 
suppose B = 1, because we can always suppose that we consider points at which 
the direction cosines of the tangent to 7 are determined, by stating that they are 
proportional to <p[ , <p' 2 ; so that, if <p[ = 0, we conclude that <P 2 0) and (2.4) 

is satisfied if B = 1. 

We may also suppose A = 0; it is easy to check that a change of A corresponds 
only to a change of D, and therefore we conclude: 


(3.4) 


d / /*2 ___ d /W\ ^ _ 2^' 1 _5f2_ f 2 — <P2 

du V/i — v?i/ du Vi/ /i — <^2 /1 — ^1 

+ f($=* y + a. 

^V>2 Vl — ^1/ 


In this equation t; is no longer a variable, because we have considered a par¬ 
ticular point of 7 at which ^ = 0; therefore <p 2 , <pl , <p" , Di are constants in (3.4). 



478 


GUIDO FUBINI 


We could consider v variable in (3.4) only if <p[ = 0 is an identity, or if x = 
<pi = const, is the equation of y. In the same manner, we find that 


(3.4) bis 



y+a 

^1 J2 ““ <P2 \j2 — <p2/ 


(if ^2 = 0). In the general case (p[(p 2 0, the equation (1.4) may be written 

in the form: 


tit 2 ~ t 2 1[ = 


— 2{(p 2 t\ — (p\t 2 ) 
, 1 


(A<p[ — B<p 2 ) 2 

_l_ (^1 <P2 ~~ ^1^2)(A/i ~ B/2) 2 1 _j_ 


(5</?2 “ A^iXAZi — Bt 2 )(<p 2 ti — ^ 1 ^ 2 ) 


2 ) 


A^>i — B<p 2 


7 (<P2 tl—<pit 2 ) 2 


The second member must obviously be independent of A, B ‘(but D may change 
if we change the functions A, B). In order to check this statement it is suffi¬ 
cient to remark that the equation 

2{B(p 2 — A<pi)(Ati — Bt 2 )((p 2 ti — (pifa) + (<pi <P2 ^ 1 ^ 2 )(AZi — BU) 2 

= —(1 A 2 4~ 2mAB -f- nB 2 )(<p 2 ti — <p\ t 2 )~ + (nt 2 + 2intit 2 -f* lt 2 )(<p 2 B — <p\A) 2 


is an identity if l, m , n are functions of v satisfying the two following linear 
equations: 


(4.4) 


7 /2 >2 ft f . ft f t It 7 t 

Up 2 — rupi = (pi <p 2 -+- <p 2 <p 1 ; rrupi = <pi — Up 2 

(or m<p 2 = —<p 2 —• rup[). 


Our equation becomes consequently 
( fat 2 — ^ 2 ^ 1 ) = — 2(^2 h — ^1 £2) 4 " ^ (nt\ -j" 2 mtit 2 4 " U2) 

IA 2 + 2 mAB 


+ 


D IA 2 + 2mAB + nB 2 ~\ , t ,, . 2 

_Av[ - B<p' t 2(A<p[ — Bip' 2) 2 J {<Pi 1 2 


But I may suppose that l , m, n satisfy not only (4.4) but also 
(5.4) 2D(A<p[ - B<p 2 ) = 14 2 + 2mA B 4- rcB 2 . 


The equations (4.4) and (5.4) are a system of three linear equations in l , m, n; 
and it is easy to see that one can solve these equations with respect to l , m, n 
if <Pi<P 2 j* 0 and A<p[ — B<p 2 ^ 0. We obtain 

t\t 2 — Ufa = — 2((p 2 ti — <p\U ) 4" 2(^1 4- 2mtiU 4“ ft*) 


in which Z, m, n are functions of v satisfying (4.4). (We can deduce nothing 
from (5.4), because D is unknown.) By writing 

hp? + rufi[ 2 = 




on abel’s converse theorem 


479 


we deduce: 

iW = l(<Pl(P2 + <p2<p[) + H<p'i<p ' 2 , \rupi = — %(<Pl<P2 + + H<p[<p2 , 

\rrupnp2 = i<pi <?2 — h ^2 = \(<P\<P 2 — vvpi) — H<p[(p ' 2 , 

so that our equation becomes 

t\t 2 — tit l = ““ 2(<p2 t\ — <piti) h (~7 — - 7 ^ *Pl <P 2 

\<pl #2/ 


+ K<Pl <P2 + <P2 <Pl) ^ 


<£> 2 / \<pl 

If we divide this equation by <^2 and put 

/1 — <^1 _ ^ y *2 


<P 2 ) > • 

^> 2 


(6.4) 

we obtain 




= * 1 , 


^2 


= *2 


^2 


2i2 2 — Z2 Z\ 2(^2 — Zi) + H(zi — Z2) 2 + i (—7 + ^7^ (z 2 — zj) 
(7.4) Vl ^ , „ 


^ 2 / 

( A/ //V 

<Pl <f>2/ 


ZlZ 2 . 


This equation is equivalent to the reduced fundamental equation (if we inter¬ 
change the indices 1, 2, we have to change only the sign of H). If we put 

H. Df w, -\(i + 4 ) 

* Vl <£>2 / 

this equation becomes 

(/l — <Pl)f2 ~ (/2 — <P2)/l = ~2[<^ 2 (/l — (fl ) — <^l(/ 2 

- n / . n f 

_ I <Pl <P2 + <P2 <P\ i 

2 


v’s)] 


(8.4) 


(*[) 




■ (fi ~ Vi)* 


+ (fi — ~ *pz) + DlipUfi — <pi) — <pi(fi — ^)} 2 > 


Vi 


or 


(9.4) 


d fi — <pi _ _2 j 

du a a ^ 1 " r 


n t / n 

<P 1 ¥>2 — 


yip* 


<p 1 J 1 — ^1 
fi 


+ 


2 *x \ ** / v?x 

[fi = <p 2 (/l ~ <pl) ~ <£>l(/2 ~ <^ 2 )]. 

5. Remarks on the reduced fundamental equation 

We could repeat our considerations by substituting the curve C for the 
curve c. In this manner we arrive at another equation which may be deduced 



480 


GUIDO FUBINI 


by (7.4) or (6.4) by substituting Fi , F 2 , w for f x , / 2 , w, and by changing, if 
necessary, the function H of the function D. We can demonstrate that it is 
sufficient to substitute Fi , F 2 , w for /i, / 2 , w, without any change of the func¬ 
tions H or D . Since w = — (w + r), we deduce that 


_d 


_d 

dw 


(if v is considered as a constant). If we write the equations 

Fl — <pi n F 2 ~~~ <p 2 

-— > ^2 -/- 

V?1 <p2 


z ' = r = -f (kz) ’ 

dw du. 


[which are analogous to (6.4)], we deduce from (1.2) that 
Zi = kzi , Z 2 = A:z 2 , 

and therefore 

~ dZ 2 ~ dZi , 2 / dZ2 dzA ,2/ f f \ 

dw; dw; \ dw dw / 

The equation deduced from (7.4) by substituting Z, w for z, w, will be 

-(ZlZi - Z2Z1) = | (z 2 - 3!) + #(Zl - Z2) 2 


+ 


5 (4 r + 4) (A -zb +1 (4 - 4) * *2 

* \<pl (f2 / * \<pl <P2 / 


in which // plays, for the curve C, the same role played by H for the curve c. 
In order to prove that H = H, it is consequently necessary to prove that 

— (ziz 2 — z 2 Zx') — | (z 2 — zi) = ziz 2 — z 2 z[ — 2(z 2 — zi), 


or 


Z2Z1 - ZxZ 2 + (Z2 ~ Zl) = 7 (Z 2 — Zl), 

K 


and this is an obvious consequence of ( 2 . 2 ). 

The reduced equation for f x , f 2 depends on a parameter v . In the most general 
case we can suppose it is possible to give to v three different values such that the 
three equations deduced in this way from the reduced equation are independent 
of each other (none of them is a consequence of the other two. (Obviously this 
supposition has to be demonstrated .) If it is so, by eliminating the two deriva¬ 
tives,/£, we obviously obtain an algebraic equation between fi , / 2 ; and there¬ 
fore the curve c would be algebraic. But the curves c and C satisfy the same 
reduce*! equation, as we have just proved. Therefore C and c would belong to 



r on abel's converse theorem 


481 


the same algebraic curve. And since c and C arc any two curves chosen from 
among the three curves c, C, 7 we can conclude that, some particular cases 
excepted , the three curves c , (7, 7 , are arcs of the same algebraic curves . 4 

But it is necessary to study the question more deeply in order to find the 
exceptional cases and to prove that the quoted algebraic curve is a cubic. 


Other remarks on the reduced fundamental equation . 

If we apply a collineation, the reduced fundamental equation has to be 
transformed into another equation which may be deduced from the initial one 
by changing, if necessary, the function D or H. We shall check this statement 
in a case of great importance for us. Instead of writing that the coordinates 
x, y (of a point of c ) and the coordinates £, r\ (of a point of 7 ) are equal to f x , 
h , <pi y <P 2 respectively, we suppose that 


/1 1 . x - 1 . (pi 1 £ 

* y V f 1 /l j ft ~ 1 £ 1 V 1 ~ t 

h h y y <pt w v 

We shall write also 

u = -n'ix = f) - ?(y - n), 
and we find, without difficulty, that 

fi — <pi _ ft'v — tv' * — a $\ 

a "l - ? + ?)' 

£(?—*) - ,) " y ' (x ~ () 




n t t n 



O) 2 

n 

<P 1 __ 

ton - 

/ 

<Pl 

- 

[>_ 

1 + 

U 

£ 


2 <p[ 2 rj £'r? - £77' 

— 2 g = 2 g fo ~ - (to - ft')^ - p + 

12 (A) L W £ C W J 

Our equation (9.4) becomes: 

- 2 (f, - (,') (i + 1 ' IrJ) 

\« £ « / 


,<r, - tv') —— ~ - 


+ 

+ 


vto" - vVf/to -w*- tY , - €,') x - 

2 i'u - ft' i\ e « v + to 

(w^jf - - W) 


4 We can arrive at the same results by starting from the initial (non-reduced) funda¬ 
mental equation (6.2). We ought to suppose that we can choose five values of v such that 
the corresponding five equations are independent. Therefore we could eliminate the 
four derivatives f [, f '% , f [', ' and obtain an algebraic equation between fi , ft . This equa¬ 

tion would also be satisfied by Fi , F *. But, when one follows this course, the research is 
much more difficult and long. 




482 


GUIDO FUBINI 


(in which A depends on v only), or 


or 


x'(y —’n) — y'b - $) 

w 2 




r x - g 

CO 


+ A(v), 


- - A(®)r, 


which can be deduced from ( 9 . 4 ) by changing the value of D and by substituting 
x, V> V, « for /i, U > ^1, <p2, ft respectively. Moreover o> can be deduced from 
ft by substituting x, y , £, rj for fi, fz , <pi , w respectively. We have consequently „ 
checked that ( 9 . 4 ) is invariant with respect to the collineation which we have 
considered. 


6 . Two of the three curves c, 7, C are straight lines 

In this case I may suppose that c, 7 are straight lines and suppose moreover 
(by using a collineation) that they are the lines y = 0 and x = 0, so that fz = 
<pi = 0. The fundamental equation becomes 


f" Jx _ 

/f _ h ~ 



Since the first member is independent of v, and the second of u, both of them 
are equal to the same constant h, so that 


( 1 . 6 ) 

If 


A 

du 


><*$ - * - 

J 1 


d log 

<P 2 


dv 


A = fl<p 2 + <pzfl 


should be equal to zero, we already know that the third line C would be the 
line at infinity. We have therefore only to study our problem if A 9* 0 . (More¬ 
over, by means of a collineation, we could always exclude the case that C is the 
line at infinity.) From (1.2) and (2.2) we deduce that 


( 2 . 6 ) 


X = l\ = kf, = ;--, and 7 = 1 


1 , Vifj 

h vlfi 


Jl -4- fill 

1 " 2 j 

Vi Vi J1 


are the parametric equations of the third curve C. If h = 0 we deduce from 
(1.6) that 


( 3 . 6 ) 


i = Au + B, — = Bv + M, (A, L, B, M = const.) 

Ji Vi 


X = 


1 


A(u + v) + 


MA + LB’ 
B 


Y = 


1 


B(u + v) + 


MA + LB ’ 



on abel’s converse theorem 


483 


which are functions of w = — (u + v). The third line has the equation 
BY — AX = 0 and is therefore a third straight line belonging to the pencil 
determined by the lines c, 7 . In this case the coordinates of the pencil of every 
one of our curves are rational functions of the corresponding parameters. The 
three curves c , 7 , C form together a cubic with a triple point (which degenerates 
consequently into three straight lines). 

If h 0, we obtain analogously from (1.6) and (2.6), 


(4.6) 


I = Ae hu + M, 

h 


= Be~ hv + M, 


<P2 


X = 


HjT X ’ XJL 

M - - B 


NA a( U _i_ v ) 


Y = 


1 


AT MB —A( tt +») 

* ~A 


(A, B, M , N = const.) The third curve satisfies the linear equation 
MX + NY = 1 , and is again a straight line which does not belong to the pencil 
determined by c, 7 ; the coordinates of the points of every one of our lines are 
not rational functions of the corresponding parameters: it is necessary to make 
use of the exponential function. (The same thing is true if A = 0.) The three 
lines c, 7 , C form a cubic with three double points. In any case, if two of our 
curves are straight lines , the third line is also a straight line. 


7. Only one of the three curves c, 7 , C is a straight line 

We suppose that 7 is a straight line so that, by means of a collineation I may 
suppose ^>2 = 0 (identically). The corresponding reduced fundamental equation 
(3.4) becomes (if we write p for pi): 


A 

du 




^f±r * 
/ /* 


+ $(y) 


[<t> a function of v only] or 


(1.7) 



4'£ + *w. 

<P J2 


By giving to v two values Vi, v 2 such that <p{v 1 ) ^ ^>(v 2 ), we obtain two linear 
equations in (/ 1 // 2 )' and (l// 2 )'. By solving them, we deduce that 


(2.7) 



°7,+ b }, + ’' 



lj + m \ + n 
h h 


(a, b f g y 1, m, n = const.) and, by substituting in (1.7) we deduce.that 

(a - pi) £ + (6 - <pm) j + (g - <pn) = ^ j -—,y + <Kv). 
h ft v ft </> ft 

Since only the curve 7 is a straight line, this equation must be an identity in 
/ 1//2 and I //2 and consequently 

— 2 <p f2 = (b — <pm)<p f 'y 


—if" = (a — hp)<p' 



484 


GUIDO FUBINI 


from which we deduce successively 

b — <pm = —2 <p' -f* <p(bp — cl), 

2<p' = bp + (m — a)(f> — b, 

2 <p" — {2lf> + [m — a])<p' = 2 (Up — a)<p'. 
Since <p is not a constant, m — a = — 2a, or 

(3.7) a + m = 0. 

Our equations become: 


(4.7) 


<p' = + m<p — -, 


(s)-- m s + ^ +9 ’ G;) ■ i f +m i +n - 


Let us now consider the collineation 

_ Ax + By + E y 

Lx + My + N’ y Lx + My+N 

It transforms the line y (y = 0) into itself. The point (/i, / 2 ) of c is trans¬ 
formed into the point {ji , / 2 ) determined by 


( 4, B, • • •,iV = const.N 
AN - LE* 0 /' 


(5.7) 


§ = + 5 + I = + M + 

j2 n h h h h 


The new function <p will be given by 

(6.7) 

and therefore 


__ A<p + E 
* L<p + N’ 


(7.7) 

if 


1 = 


m = 


6 = 


l 


AN - LE 
1 

AN - LE 


' — i (k>* + rrup — 0 
(IN 2 - 2 mLN - bL 2 ), 


1 


AN - LE 


(-IEN + m[LE + AN) + bLA), 
(—IE 2 + 2 mAE + bA 2 ), 


from which follows 


lb + ff? = lb + 7ft 2 . 



on abel’s converse theorem 


485 


We could prove the same equations by studying (5.7) instead of (6.7). If 
16 + m? 0, we can choose the values of N:L and A:E such that 1 = 6 = 0 
even if it is necessary (as in our case) that AN — EL 9* 0 , or N:L 9* E:A. 
If 16 + to 2 = 0,1 can choose N : L such that 1 = 0 ; since 16 + m 2 = lb + m 2 = 0, 
fn also will be equal to zero. Therefore, by changing notation and disregarding 
the dashes, I can always suppose that 

(8.7) 1 = 6 = 0 , m 0 or 1 = m = 0 , 6 ^ 0 . 

In the latter case I have added the condition b ^ 0. If it were not satisfied 
and l, m, b were equal to zero, from (4.7) we could deduce 



The line c would be a straight line fi/f 2 = const., if g = 0 , a straight line ft = 
const., if n = 0 , or a straight line g 1 /ft — nfi/f 2 = const., if ng ^ 0 ; which is 
contrary to our hypothesis that among our three curves only y is a straight line. 
I have now to study the two cases (8.7). In the former I can multiply u (and 
consequently also v ) by the same constant factor (which is of no importance for 
our problem), such that m — 1 ; in the latter I can analogously suppose 6 = 1. 
Therefore in the first case: 



in the second case: 



In the first case 

h —g = Ae~ u , ~ + n = Be u , y = e Dv , (A, B, D const.) 

J 2 fi 

(1 + nftHfi - gft) = ABfl. 

The line c is the conic 


(1 + ny){x - gy) = ABy\ 


By means of (4.2), (2.2), (1.2) we calculate the coordinates X = F\, Y = Ft 
of a point of C. We find 


x = Fl = ~g + BPe u+ ° j 
±_Ae- (u+v) + n 


Y = Ft = 


-1 






+ n 


[both functions of w = — (u + »)], and we remark that the point (X, 7) gen¬ 
erates the same conic c, so that the curves c , C are identical. There are two 



486 


GUIDO FUBINI 


distinct points of intersection of the conic c =» C and of the line y (y = 0): the 
point x = y = 0 and the point at infinity of y. The representation of the 
coordinates of the points of our lines by means of the corresponding parameter 
requires the exponential function. The conic c and the line y form together a 
cubic with two double points. In the second case I will apply, for short, such a 
collineation that g = 0 . (I have to substitute 


^ gf/+ 1 ’ ^ gh + 1 ’ 9 91 g<P 2 + 1 * l 

*fa = <p 2 = 0, for/i ,/j , (pl<P 2 

respectively, and to neglect the dashes afterward.) We find 


— f 2 


<Pi 


r = nU, fy = |nC/ 2 + B, <p = - *7, 

J2 J2 

where t/ = w + const., 7 = v + const., B = const. Therefore the line c 
generated by the point x = f \, y = fi is again a conic 

2n(xy - By 2 ) = 1. 


If 17 = — (17 + 7), we find that the point ( X , 7) of the third line (7 is the point 


Y = ^4-^JL y== IJ_ 

2 ‘ n T7 ’ nW’ 

so that it generates the same conic c — C. In this case rational functions were 
sufficient, but the conic c and the line y (y = 0 ) are now tangent to each other 
(at infinity). In conclusion: If one of the curves c> y , C is a straight line , then 
either the other lines are straight lines too , or they belong to the same conic. The 
coordinates of their points are rational functions of their parameters if the curves 
are lines of the same pencil , or are a straight line and a conic tangent to each other ; 
otherwise it is necessary to make use of the exponential function. 


8. General deductions 


From now on we can suppose that none of the three lines is a straight line. 

I shall now choose as origin a point 0 of the curve y. This point has to 
satisfy the following condition. If y and c belong to algebraic curves y and c, 
and these algebraic curves are not identical (or, if c , y do not belong to the same 
algebraic curve), the point 0 must not belong to c (is not an intersection of c, 7 ). 
If the curves c, 7 are identical, one obviously cannot satisfy this condition, and 
0 is an arbitrary point of 7 . Afterwards I choose as 2 /-axis (x = 0) the tangent 
to 7 at 0. Therefore (pi = <fa = <p'\ = 0 at 0 and consequently (p% 5 * 0 . Since 7 
is not a straight line I can also suppose that O is not a point of inflection of 7 , 
so that (pi 5 * 0 at O. The reduced equation (2.4) reads 



+ l H + wi ~ + n, 
fi h 


( 1 . 8 ) 


2 



ON ABEL’S CONVERSE THEOREM 


487 


in which the constants A, l, m,n are the values at 0 of <p" /<p 2 , — vi/vi , —2^ , 
Di . Therefore in our case 


(2.8) Am r* 0. 

I write now the reduced equation (8.4) for any other point ft = (<pi , ^) of y. 
And I remark that one can deduce from (1.8) that the left-hand member of 
(8.4) is 

(/l “ ““ (/2 ““ <Pl) fl = (/l <Pl) j^/l + j^/lj ~ (fl ~~ <P 2 )fl 

= (/2 ~ <Pl)(lf2 + m + nfi) + (y*2 — (p2)Af2 + (v?2/l — <Pi/2)^A J + . 


The right-hand member of (8.4) is a polynomial of second degree in fi , / 2 . 
Consequently we deduce from (8.4) that 






is a polynomial P of second degree in f 2 /fi , 1/fi , the coefficients of which are 
functions of the (arbitrary) value of v corresponding to the (arbitrary) point 
ft of 7. Therefore 



Ji 


If P were divisible by <p 2 — <pi/ 2 //i the quotient would be a polynomial of first 
degree: 

(4.8) a j + b j + g- 

J i J i 

Its coefficients a, b } g ought to be independent of v (and therefore constant) 
because, if it were not so, by equating the values of (4.8) corresponding to two 
different values of v , we should find an equation of first degree in/ 2 //i, l//i, 
and the curve c would be a straight line; which is contrary to our hypotheses. 
Therefore in this case 


(5.8) 


or 

( 6 . 8 ) 




+ lj + m \ + n, 
h J i 



a j + b ~ + g, 
Ji J i 


(A, 1, m, n, a,b,g = const.) 


—fi = + gfi) + bfi + Aft, 

-fl = / 2 O /2 + gfi) — nfi + (b - l)fi - m. 



488 


GOTDO TOBIN! 


If P is not divisible by ^ — vift/fi, we remark that, by giving to v two different 
values t'i, t> 2 , we can deduce from (3.8) that 


(7.8) 


'(/V/‘) MM) 


f f 9 

<fi2(vi) — <Pi(vi) ~ Viiv*) y 

Ji h 


[Pi and P 2 are the polynomials deduced from P by supposing v = v\ or v = t/ 2 .] 
We can deduce from our supposition that (7.8) cannot be an identity in 
/ 2 // 1 , l//i, because the denominators are not proportional to each other. (If 
they were proportional, the straight lines L» defined by 


<P2(Vi) ~ y = 0 , 

/1 


(i = l,2) 


would be the same line; but these lines are the straight lines joining the origin 
with the points v = v\ and v = t / 2 of 7 ; these three points are not collinear, 
because y is not a straight line; and therefore L \, L 2 cannot be identical.) 
Therefore (7.8) is an equation satisfied by all the points of c. By means of a. 
collineation I can suppose 


( 8 . 8 ) 


1 

* fi 9 


y = r 

h 


The equation (7.8) becomes 

(9.8) yS = T 

if 

S = <pi(vi)P t(y, x) - <pi(vt)Pi(y, x); 


T = <fii(Vl)P 2 — <Pi(t>i)P 1 . 


The S, T are polynomials in x, y of a degree not greater than two. The curve c 
will satisfy the equation ( 8 . 8 ) and the equations [deduced from (1.8) and (3.8)] 


y' = Ay 3 + ly + mx + n, 


( 10 . 8 ) 


x' = Axy + 


Qiy, x) 
y-n 


[Q is a polynomial in x, y of a degree not greater than 2. Its coefficients are 
functions of v and y = •] We have consequently to study two systems of 

equations: the system (5.8) and the system ( 9 . 8 ) and ( 10 . 8 ). 

9. The equations (5.8) or (6.8) 

These equations demonstrate that the first member of (8.4) is equal to 
— <Pl) — Vi(fl — <Pl )](a/s + gfl) + (fi — + Aft) 

1-9 ' + (/i - *i){n/i + (l- b)ft + n}. 



ON ABEL’S CONVERSE THEOREM 


489 


This quantity must be equal to the second member of (8.4). We obtain in this 
way an equation in fi , f 2 (of a degree not greater than two). I demonstrate 
that I can suppose that this is an identity in /i, f 2 . If one at least of the curves 
c, y, C does not belong to A conic, I can choose this curve as curve c. Since c 
is neither a conic nor a straight line (according to our actual hypothesis), its 
points cannot satisfy an equation of a degree not greater than two; and therefore 
the preceding equation is an identity in fi , / 2 . If every one of the arcs c, 7 , C 
belongs to a conic, these three conics cannot be identical to each other, because 
it is not possible that three points of the same conic are collinear. Therefore at 
least two of these conics are different from each other; and I can suppose that 
c, 7 belong to different conics. But, by writing that (1.9) is equal to the second 
member of (8.4) I obtain an equation of second degree in x = fi , y = f 2 , which 
is satisfied when we suppose that x = <pi y = <p 2 . 5 If this equation were not an 
identity, it would be the equation of a conic, to which the point x = <p\> y = <p 2 
belongs; but this point is any point whatsoever of 7 ; and therefore every point 
of 7 would lie on the same conic to which c belongs; which is contrary to our 
supposition. Therefore (1.9) and the second member of (8.4) are identically 
equal to each other. I order both according to powers of fi — <pi , / 2 — ^ ; 
by comparing the coefficients of similar terms, I conclude that 

// 

Dipi = a<pi + A, —7 2 Dipi <p 2 = l + g<pi — a<p2 , 

<P 1 


(2.9) 


// / . // / 

<Pl <P2 -T <P2 <Pl , n '2 , 

+ D<p2 — —g<p2 + n , 


'\2 


2 (^i) 


— 2 <p 2 = — <p2(d(p2 + g<P\) + m + n<pi + (l — b)ip2 , 

2 (pi = <pi(a<P2 + g<pi) + b<pi + Aip 2y 
and, by eliminating D from the first three equations: 

<Pi = 2 <p 2 (A + cupi]) + (l + g<pi — a*P2)<p\ y 
<P2 = <P2(a<p2 — g<pi — l) + %(g<p2 — n)<p[ . 


(3.9) 


By comparing (3.9) with the values of <p" deduced from (2.9) by differentiating, 
we obtain 

(4.9) (6 — 21 + 3cup2)<pi — 3(A + cup\)<Pi ~ 

(5.9) (6 + l + 30 < 0 i)<s >2 + 3 (n — g<P 2 )(Pi = 0. 

If a 9 * 0, the equation (4.9) gives 


, b - 21 

~w~ 

, A. 

?i + - 


const., 


1 This would be contrary to what we supposed at the beginning of §8. 



490 


GUIDO FUBINI 


and the line y would be a straight line, which is now absurd. Therefore a = 0, 
and the equation (4.9) proves that 

(b — 2l)<pi — 3A<p 2 = const. 

Since y is not a straight line, we deduce 

b - 21 = 3A = 0. 


(We do not now take into account that i ^ 0 [according to ( 2 . 8 )].) If g 0, 
we deduce from ( 5 . 9 ) 

b + l 

<Pi + -s—- 

- 52 - = const., 

n 

V2 ~g 

and we deduce, as above, that g = 0; and, as above, (5.9)*proves that b + l = 
n = 0. Therefore ^4 = a = gr = 6 = Z = n = 0 ; and therefore, from ( 2 . 9 ) 
we deduce = const., which is absurd because y is not a straight line. Conse¬ 
quently we can disregard the equations ( 5 . 8 ) and (6.8). 

10. The equations (9.8) and (10.8) 

I have already supposed that none of the three lines c, y, C is a straight line- 
If every one of the lines c, 7, C belongs to a conic, I have already remarked that 
these three conics cannot be identical, and that I am therefore allowed to sup¬ 
pose that the conics to which c and 7 belong are not identical. In this case I 
am therefore allowed to choose as origin 0(^i = <p 2 = 0) a point of 7 which 
does not lie on the conic to which c belongs. If 

A u/i + 2^12/1/2 + ^22/2 + 2^13/1 + 2^23/2 + .4 33 = 0 (Aij — const.) 

is the equation of the conic to which c belongs, we can therefore admit that 
Am j* 0 . By means of the collineation (8.8) the equation of this conic is trans¬ 
formed into the equation 


( 1 . 10 ) 
in which 


T = a n x 2 + 2a i2 xy + a^y 2 + 2a n x + 2 a 2 *y + = 0 

(on = ^. 33 , U 12 = A 2 z, etc.), 


( 2 . 10 ) on =- -4.33 5* 0. 

The equations ( 10 . 8 ) can be written in the form 

( 3 . 10 ) y ' = Ay 2 + ly + mx + n, x' = Axy + —— , 

y — *i 

in which rj = ^ /<p\ , and Q is a polynomial in y, x (of a degree not greater than 
two), the coefficients of which are functions of v only. 



on abel’s converse theorem 


491 


By differentiating (1.10) and taking into account (3.10) we find 
(anz + a l2 y + an) ^Axy + 

+ (#2i x + (fay + 02 s) (Ay 2 + ly + mx + n) = 0 , 


(y — v) T Axy + — (Ay 2 + ly + mx + n)l 

( 4 . 10 ) L anx + a u y + O 13 J 

= - Q(x>y ). 

By giving to v two different values, we obtain two equations; if 77 *, Q { (i ' = 1 , 2) 
are the corresponding values of rj and of Q, we find by subtracting that 


Axy + 


fl 2 i# ~t ~ 022 y + 023 
On x + any + 013 


(Ay 2 + ly + mx + n) 


Qi ~~ Q2 _ ^ 

?7i — V2 


where 72 is a polynomial in x, ?/, of a degree not greater than two, with constant 
coefficients. From (4.10) we can deduce that yR also is, on the curve c belong¬ 
ing to the conic T — 0, equal to such a polynomial, and therefore we can find 
an identity (in x, y) 


yR = (px + qy + r)T + (a polynomial of a degree not greater than two) 

( p , q,r = const.). By comparing the terms in £ 3 , and by recalling that a n 5 * 0, 
we find p = 0 ; and, since the preceding equation is an identity, we deduce that 

R — qT + (a polynomial of first degree in x, y). 


Therefore, on the curve c (which satisfies the equation T = 0 ) R is equal to a 
polynomial of first degree in x, y; its coefficients must be constant, because 
7 is not a straight line. And therefore we deduce that, on c , 


Q 


v — %1 


= -R 


can be considered equal to a polynomial of first degree in x, y with constant 
coefficients. The equations (3.10) are therefore identical with equations which 
we have already studied; and therefore we can suppose, from now on, that 
none of the three curves c, 7 , C belongs to a straight line, and that at least one 
of them does not belong to a conic. 


11. The equations (9.8) and (10.8) in the general case 

We can also suppose that c does not belong to a conic, and consequently the 
equation ( 9 . 8 ) is precisely of third degree. If 


ySi = Ti 


(Si, Ti polynomials in x, y) 



492 


GUIDO FUBINI 


is another equation of third degree, satisfied by the points of c, this equation 
must be identical to (9.8) up to a constant factor k so that 

y(Sx - kS ) = Ti - kT (identically). 

Therefore T\ — kT must be divisible by y; if Lx + My + N is the quotient 
(L, M y N = const.), we conclude that 

Si = kS + Lx + My + N y T x = kT + y(Lx + My + N ). 

Since x' is independent of v f we deduce from ( 10 . 8 ) that Q/(y — rj) is independent 
of v . We give to v two new values v 3 , v *, and suppose that Q 3 , Q 4 are the poly¬ 
nomials deduced from Q in this manner. We deduce that 

Qt[y - vM] = Q*[y - »?(%)] 

is another non-identical equation satisfied by the points of c. We deduce 
therefore that 

Q3 — Qa = kS + Lx + My + Ny 
QMv*) ~ Q*vM = kT + y(Lx + My + N) y 


in which k , L, M, N depend on v 3 , v 4 . Therefore, by recalling that yS = T 
on the curve c, we find that, on the curve c: 


Qs 


Qs 

y - vM 


(p<) 1 (ft) {fc[7 - Sr,(v 3 )] + [y- v (v 3 )](Lx + My + N)}, 

-.-, 1 . .{kS + Lx + My + N], 

v(v*) - vM 


which must be independent of both v 3 , v 4 [see (10.8)]. Therefore, since c does 
not belong to a conic, 


k L M N 

rj(o 4 ) — ri(v 3 ) ’ y(v 4 ) — rj(v 3 ) ’ vM — ’ vM — i?(»s) 

must be constant, and therefore 

—— = -^ 7 —\ = hS + px + qy + r ( h , p, q, r - const.). 

y - n y - »j(t>») 

I shall write 


3 = hS + px + qy + r, T — hT + y(px + qy + r). 
The equations (9.8), ( 10 . 8 ) become: 

yS = 7; y' = .Ay 2 + ly + nw + n, x' = 4xy + 5 
or, by changing notation and disregarding the dashes: 

(i.n) yS = T, 

(2.11) , y' = Ay 2 + ly + mx + n, 


x' = Axy + S. 



ON ABEL’S CONVERSE THEOREM 


493 


According to our hypotheses (§ 8 ) the origin (<pi = = 0) does not belong 

to the cubic yS = T, if the curve y does not belong to this cubic. We have 
applied a projective transformation ( 8 . 8 ); if we make use of homogeneous 
projective coordinates x* such that xi:x 2 :x 3 = x:y: 1 , this transformation is 
defined by 

XiiX 2 iXs = l:/ 2 :/i (for the points of c), 

and the analogous equations 

fi= 1 • <p 2 • <pi (for the points of y). 

The equation (1.11) is turned into an equation 
(3.11) X2 ClikXiXk = X 3 UkXiXk y 

(da and t ik are the coefficients of £, T) which is satisfied by the point x 2 = x 3 = 0 
(which is the transform of the former origin). Therefore the initial origin lies 
on our cubic; and consequently all the curve y belongs, like c, to this cubic. 

It may perhaps be interesting to prove directly that the curves c, y, have 
the same tangent at the origin 0(x 2 = x 3 = 0). The tangent to 7 at this point 
was the line <pi = 0 , which is now defined by x 3 = 0. By writing x 3 = 0 in 
the equation ( 3 . 11 ) of c, I find 

x 2 (a n x 2 + 2a l2 xy + artf) = 0 . 

We shall have proved that the line x 3 = 0 is tangent also to c, if we prove 
that an = 0. By differentiating the first of the equations (1.11) we find [by 
taking into account the values of x', y' given by ( 2 . 11 )] that, on the curve c, 

0 = [2a n xy + 2 any 2 + 2 a n y - 2{t n x + t\ 2 y + t\ 3 )](Axy + S) 

+ [dnx + 4ai 2 xy + 3 a^y + 2ai$x + 4023 y + «33 — 2(/ 2 jX + fay + fa)] 

[Ay 2 + ly + rnx +n]. 

Therefore the polynomial of fourth degree, (on the right member), must be 
equal to the product of yS — T by a polynomial p of first degree in x, y. By 
comparing the terms of fourth degree in the preceding polynomials and the 
terms of third degree in yS — T, we conclude easily that 

p = 2onX + (3 A + 2an)y + s (s = const.). 

By comparing the coefficients of x 8 , we deduce 

— 2fa an + on?n = — 2on fa, 

and consequently an = 0, because m 0 [see (2.8)]. The right-hand members 

of ( 2 . 11 ) are polynomials in x, y of a degree not greater than two. 

What happens if we make a change of projective not-homogeneous coor¬ 
dinates? If 

Bx + Dy + F __ Lx + My + N 

Px + Qy + R’ V Px + Qy + R 


(4.11) 


x 



494 


GUIDO FUBINI 


{B f D, • • •, Qy R constants, the determinant of which is different from zero) 
are the new coordinates, we find, for instance: 

-/ _ (BQ “ »P){x'y - xy') + (RB - FP)x' + (RD - FQ)y f 
( ' (Px + Qy + P ) 2 

From ( 2 . 11 ) we deduce that x', y' and 

x'y + y'x = yS — x(lx + my + n) = T — x{lx + my + ft), 

and therefore also the numerators of the right members of (5.11) are, on our 
curve Cy equal to polynomials in x, y , the degree of which is not greater than 2 . 
But the equations (4.11) can be written in the form 

x y 1 

B Px + Qy + R + D Px + Qy + R + F Pi + <& + «’ 

- T x _j_ y _j_ 1 

V ~ L Pi TWTR P* + Qj/ + R + Pa; + Qy + R' 

i n_ x _ I q _ y _ I n_i_ 

Px + Qy + R v Px + Qy + R Px + Qy + R’ 

and we can deduce from these equations that 

x y _ 1 

Px + Qy + R 9 Px + Qy + R’ Px + Qy + R 

are linear integral functions of x, y. Therefore the right-hand member of 

( 5 . 11 ) is a polynomial in x, y of a degree not higher than two; and we can obtain 
an analogous result for y\ Therefore: For any system of not-homogeneous pro¬ 
jective coordinates x, y, their derivatives x', y f are {on c) equal to polynomials in 
Xy y of a degree not higher than two. 

12. The canonical equation of the cubic 

I can now choose such not-homogeneous projective coordinates that the 
equation of the cubic is 

(1.12) y 2 = 4x 3 + 2 gx + r {g, r = const.) 

(according to Weierstrass, 2g = — g 2 , r = — < 73 ). If 

(2.12) 8g 3 + 27r 2 ^ 0, 

the cubic is not rational and has no double point. According to what we have 
already proved, we can write 

(3.12) x' = P 2 + Pi + Po, y f = Q 2 + Qi + Qo, 

(P, and Q, are homogeneous polynomials of degree s in x, y) {s = 0, 1, 2). By 
differentiating ( 1 . 12 ) we deduce that 

(4.12) , ( 6 x 2 + g') (P 2 + Pi + Po) — y{Q 2 + Qi + Qo) 



on abel’s converse theorem 


495 


must be equal to zero at the points of the cubic, and that therefore the poly¬ 
nomial ( 4 . 12 ) must be identically equal to the product 

(5.12) (ax + by + k)(ix 3 + 2gx + r — y 2 ) 

if we choose the constants a, b, k in a suitable manner. If I compare in (4.12) 
and ( 5 . 12 ) the terms of fourth and third degree, I obtain: 

(6.12) Pt = \x(ax + by). Gx 2 Pi — yQ 2 = 4 kx 3 — y 2 (ax + by). 

From the latter equation one deduces: 

(7.12) Pi = ffcr + py; Qt = 6 px 2 + y(ax + by) (p = an unknown constant). 
By comparing the terms of second degree in (4.12) and (5.12), one finds: 

6 x 2 P 0 — yQi + gP 2 = 2gx(ax + by) — cy. 

The value of P 2 is given by ( 6 . 12 ); we deduce 

(8.12) Po = %ga, Qi = ky - igbx. 

By comparing the other terms of (4.12) and (5.12), we obtain 
gPi — yQo = r(ax -f by) + 2 kgx, gP 0 = kr, 

and from ( 7 . 12 ), ( 8 . 12 ) we deduce: 

( 9 . 12 ) Q 0 = gp — rb, 

(10.12) ra + ^kg = 0, 2 g 2 a = 9 kr. 


These last equations are linear in a, A;; if ( 2 . 12 ) is satisfied and the cubic is not 
rational, we deduce a = k = 0 , and therefore 


( 11 . 12 ) 

( 12 . 12 ) 


x' = Pi + Pi -t- Po = y(lbx + p), 
y' — Qi + Qi + Qo = 6 px 2 + by 2 - igbx + gp — rb 
= (6x 2 + g)(%bx + p). 


Therefore, if 

(13.12) U = f — 

J V 


is the Abelian integral of first kind connected with our cubic, we deduce that 

< 14 - 12) ^ = l bx + p■ 

But the equation ( 8 . 4 ) is invariant under the group of collincations; I can con¬ 
sequently write it by substituting x , y , £, rj for f\ , / 2 , <pi , ^2 respectively. It 
proves that, on c 


(x - fly' - (y - rj)y' 



496 


GUIDO FUBINI 


is equal to a polynomial of second degree in x, y . Since y' and x', and conse¬ 
quently also { y' t ijx' are equaL to such polynomials, xy' — yx f also is a poly¬ 
nomial of second degree in x, y . From (11.12) and (12.12) we deduce that on 
our cubic 

{\bx + p)(6x s + gx - y 2 ) = (J6x + p)(hv 2 - gx - r) 

is such a polynomial; and this can be true only if b = 0. The equation (14.12) 
proves that u can also be considered as the Abelian integral of first kind (for 
our cubic), because this integral is defined up to a multiplicative and an ad¬ 
ditive constant. 

We can now develop the same consideration for the curve 7, which is identical 
with c; and we should find that the corresponding parameter v is given by 


dV 

dv 


q 


(q = const.). 


[See the analogous equations (13.12) and (14.12).] If p = q our theorem is 
completely demonstrated, because it is a consequence of Abel's theorem; the 
third curve is identical with c y 7; the corresponding parameters are the same 
integral of first kind. Consequently we have to prove only that p = q. If' 
p = <7, our theorem is true and therefore the reduced equation (9.4) is satisfied, 
even, if we substitute, as before, x, y y £, tj for fi , f 2 , <pi , . If our problem 

could be solved also for another value of p ^ q, the equation (9.4) would be 
satisfied also if we write ( q/p ) du for du y and if we change, if necessary, the 
function D. By subtracting the two equations which we deduce in this way 
from (9.4) we obtain that 

_d x - £ = _ x — £ 

du il du 7}'(x — {) — £'{y — rj) 

is only a function of v. Since (x — {)/& cannot be independent of u y because c 
is not a straight line, we would find, by integrating, that u is a linear (non¬ 
integer) function of x, y; which is absurd. Our theorem is therefore completely 
proved. 


13. The cubic is rational and possesses a double point 

Let us now suppose that (2.12) is not satisfied and that the cubic possesses 
a double point x = p, y = 0. The equation of the cubic is now 

(1.13) = 4(x pf{x + 2 p) (flf = — 2p*; r = 8p‘). 

At first we suppose p y* 0. The equations (10.12) state only that k — ap so 
that we obtain 

x' = f(ax + by) + fapx + py - lap’, 

y' = 6 px* + y(ax + by) + apy + 8 p'bx — 6 p*p — 8 p*b. 



ON ABEL’S CONVERSE THEOREM 


497 


By writing that xy f — yx f is, on the cubic, equal to a polynomial of second 
degree in x, y, we find that 

6 px z + \xy{ax + by) 

must be equal to such a polynomial if we take into account the equation of c. 
Therefore a = b = 0, 

(2.13) x' = py , y f = 6 p(x 2 - p 2 ), du = , 

V V 

which is completely analogous to the definitions (13.12) of the integral of first 
kind for not rational cubics. In the same manner we shall find, for the param¬ 
eter v corresponding to the curve y 

(3.13) dv = - — (q = const.). 

Q V 

As above, it will be sufficient to demonstrate that our problem is solved by sup¬ 
posing p = q; which is no longer a consequence of Abel’s theorem. By in¬ 
troducing a new parameter 


m = 


y 


2(* - p) 


we find that 
(4.13) 


x — m — 2 p, y = 2 m{m — p), 


are new parametric equations of our cubic. We also prove easily that three 
points of the cubic, corresponding to the values ra, p, M of the new parameter, 
are collinear if and only if 

(5.13) Mm + Mu + rap = —3p. 

From (2.13), (4.13) it follows that 

dm 


(6.13) 


u 


or m 


_ 1 f dm 

p J m 2 — 3p’ 

[U = 2py/s p u + h]. 


= - .AT ** + 1 
v/3p - 1 


[The constant h is arbitrary.] In the same manner we find that the value p 
of the new parameter can be obtained from the corresponding value of v by 
means of the equation 

(7.13) m = -VZp % + ] , [V = 2p\/3pv + l), ( l = const.). 

The equations (5.13), (6.13), (7.13) give 


M = — y/z p 


e w + 


e w _ 


1 

1 



498 


GUIDO FUBINI 


if W = — (17 + V). We have found the condition for the parameters, which 
was our starting point. Our theorem is proved also in this case; the three 
curves e, 7 , C are identical. The introduction of the parameters U y V, W or 
u y Vy Wy requires, in this case, only the exponential function. 


14. The cubic is rational and possesses, a cusp 

In this case p = 0; the equation of the cubic is 


(1.14) 


y 2 = 4x 3 


(g, r = 0 ). 


The equations (10.12) in a, k are identities. From the results of §12 and from 
the remark that xy' — yx f must be, on the cubic c, equal to a polynomial of 
second degree in x t y , we deduce that 

(2.14) x' = § kx + VV, V ' = 6 px 2 + ky . 


We can write the reduced equation (7.4) by substituting x y y y £, 77 for / 1 , / 2 , 
<pi, <P 2 , respectively. From (2.14) we conclude (by taking into account that 
the equation of 7 is r? = 4 £ 3 because 7 is identical with c that the first member 
of (7.4) is 


f t 

Z\ Z 2 — #2 — 


X - $1/ 

t' v' 


v' r 


= J , {(* - £)( 6 px* + ky) - (y - ri)(%kx + py)} 
? V 


= ff— {Hv'z 2 ) s + 2WZ* - 

£ V 


+ fc p—[W y 1 Z1Z2 + vt 'zi — %£y'Z2} 

£ y 

' _ * - i 

21 " ~T 


( 



By comparing the coefficients of 2 zi, 2z 2 in the two members of (7.4), I obtain 


1 = V J, - W |, 

- 1 = -6 pi' + pi. 

V V 

But 77 /$' = 6 £ 2 /V is a consequence of the equation of 7 ; therefore, by summing 
we obtain k = 0 , and consequently 

x f = P 2 /, 

We find again 


which is completely analogous to the definition of u in the other cases. 


y f = 6 px 2 . 
1 dx 



ON ABEL S CONVERSE THEOREM 


499 


I introduce a new parameter m — y/2x and find that 

x = ra 2 , y = 2ra 3 

are the new parametric equations of c, and that 

u —--—b h (h — const.). 

pm 

The introduction of u requires in this case only rational functions. As above, 
we have only to demonstrate that, by supposing 


v = - — + l 
p/z 


(l = const.), 


our conditions are satisfied. Now the three points of c , corresponding to the 
values m, /x, M of the new parameter, are collinear if and only if 

- + - + -t = 0, 

m ii M 


u + v + w = 0, 


in which we have written 


Consequently our theorem is demonstrated in every case. 


Conclusions 

It is possible to state a theorem which may be considered as Abel’s converse 
theorem for a cubic, but, besides the general case which leads to elliptic functions, 
we have to consider many particular cases: 

1 . The cubic degenerates into three straight lines forming a triangle, 

2 . The cubic degenerates into three straight lines belonging to the same pencil, 

3. The cubic degenerates into a conic and a straight line, which intersect at 

two distinct points, 

4 . The cubic degenerates into a conic and a straight line, which are tangent to 

each other, 

5 . The cubic is rational and possesses a double point, 

6 . The cubic is rational and possesses a cusp. 

In cases 1, 3, 5, the introduction of the corresponding parameters requires the 
exponential function, whereas rational functions are sufficient in cases 2, 4, 6. 

Generalization s 

It is obvious that the preceding results can be easily generalized to curves or 
a degree higher than 1. For instance we can suppose we have six curves (or 



500 


GUIDO FUBINI 


small arcs of curves) c x , c 2 , • • • , c% ; the coordinates of a point of are functions 
of a parameter Ui . I choose on every curve c* a point A t , and I suppose that 
these points lie on a conic, if and only if the sum of the corresponding param¬ 
eters Ui is equal to zero. To study this problem I have to consider the conics 
which degenerate into two straight lines, and to choose three curves arbitrarily 
among the six curves c,,—for instance c x , c 2 , c 3 . I choose three collinear 
points Ai on ci, A 2 on c 2 , A 3 on c 3 , and consider the points of c 4 , c 5 , c 6 and the 
corresponding parameters w 4 , u h , u Q as variable. Our problem becomes: When 
does it happen that three points A 4 of c 4 , As of c 6 , As of c 6 are collinear, if 

u 4 + Us + Us = — (Ui + U 2 + Ws) = const.? 

From our preceding results we deduce that c 4 , c 6 , Cs form together a cubic; 
Therefore three curves , chosen arbitrarily among the six given curves , form a cubic. 
and it is very easy to discuss completely this new problem. We remark also 
that the problem is no more general if we suppose ^2 ^»(^») *= 0 (4>, a function 
of Ui only). It may be reduced to the preceding problem by means of a change 
of parameters. 


Another Generalization 

Let us suppose we have four arcs of curves C \, c 2 , c 3 , c 4 . We shall consider 
four functions u x , u 2 , u 3 , u 4 of the points A x , A 2 , A* , A 4 of the curves c x • • • c 4 
respectively. It is possible to choose these parameters in such a way that, 
when the points Ai are collinear, then 

Ui + u 2 + u 3 + u 4 = 0? 

Let us suppose that the choice of these parameters is possible. We can state 
the question: Is this choice completely determined (up to non-essential con¬ 
stants)? If not, in how many ways can we choose linearly independent systems 
of such parameters u ? When we have three (linearly independent) systems of 
parameters u , our question is identical with Lie's problem on the surfaces of 
translation. But if one disregards for a moment these surfaces, it seems to 
me that the above-stated general problems and these generalizations also are 
very interesting from the point of view of one who studies Abel's theorem. 

Institute for Advanced Study 



Annals of Mathbmatics 
Vol. 43, No. 3, July, 1942 


HAUSDORFF INTEGRAL TRANSFORMATIONS 

By H. L. Garabedian 
(Received February 6, 1942) 

1. Introduction 

It is the object of this paper to study the integral transformation 

( 1 . 1 ) v(x) = f u(y) d4>(y/x), 

Jo 

where u(x) is bounded and continuous, x ^ 0 , where </>(z) (called the mass 
function of the transformation) is a Hausdorff mass function {vide infra), and 
where the integration is in the sense of Riemann-Stieltjes. The transformation 
is said to be regular if the existence of lim*-**^#) implies the existence of 
lim^aot^z) and the equality of the limits. 

A mass function <t>{x) is said to be a Hausdorff mass function when 

(i) (j){x ) is of bounded variation on the interval 0 ^ x < 1 , 

(ii) <t){x ) is continuous at x = 0 , and <£( 0 ) = 0 , 

^ (iii) 0 ( 1 ) = 1 , 

(iv) <t>(x) = %[<l)(x — 0 ) + <t>{x + 0 )] if 0 ^ x g 1 . 

In what follows we shall use the symbols (//, <t>(x)) and [H, <f)(x)] to designate 
integral and matrix transformations respectively involving the mass function 
<t>{x). 1 

The transformation ( 1 . 1 ) may be written in the form 

(1.3) v{x) = [1 - 0(1 - 0)] u{x) + [ u{y) d<t>(y/x), 

Jo 

where a possible discontinuity of </>{x) at x = 1 has been removed from the 
integral in (1.1). In 1924 Silverman [ 1 ] studied the transformation (1.3) with 
the following restrictions on <t>{x ): 

(i) <t>{x) is continuous, 0 g x g 1 , 

(ii) <l>'(x) is continuous, 0 < x ^ 1 , 

(1.4) (iii) </>(0) = 0, 

(iv) f j </>'(x) | dx ^ M . 

Jo 

1 A discussion of the relationship of mass functions to matrix transformations may be 
found in [5]. 


501 



502 


H. L. GARABEDIAN 


In connection with these conditions it is understood that 0(1 —0) =0(1). 
Silverman termed a function 0 ( 3 ) satisfying the conditions (1.4) absolutely 
regular . Silverman proved that the transformation (1.3) is regular when 0(x) 
is absolutely regular, 2 and also established a condition in the form of an integral 
equation in order that ( H , 0 o (x)) ( H , 0&(x)), where <t> a (x) and <f> b (x) are ab¬ 

solutely regular mass functions. 

In a recent paper [2] this writer clarified and interpreted the results of Silver- 
man in the light of certain recent developments ([3] and [4]) in the field of 
Hausdorff matrix transformations, and thus made significant extensions of 
Silverman’s results on inclusion and equivalence relations among Hausdorff 
integral transformations. 

In this paper we extend Silverman’s results to include a much wider class of 
mass functions than the absolutely regular mass functions just defined. 

2. Regularity of (H , 0(x)) when <f>(x) is a Hausdorff mass function 

This section is devoted primarily to a proof of the theorem which follows. 

Theorem 1. A necessary and sufficient condition that the integral transforma¬ 
tion ( H , <t>(x)) be a regular transformation is that 0(x) satisfy the conditions (1.2). 

Let us first prove that if 0(z) satisfies the conditions (1.2) then the transforma¬ 
tion (1.1) is a regular transformation. Assuming that lim J _ 00 w(r) = Z, we 
wish to prove that lim ar _* JO i>(o:) = Z. 

We observe that 

d<t>(y/x) = 

Then we may write 

v(x) - l = [ [ u(y ) - /] d<t>(y/x). 

Now, choose p so large that | u{x) —l | < c, x ^ p. Hold p fixed and denote 
by M a number greater than | u(x) — l | in 0 ^ x ^ p. Then, for # > p: 

I v(x) - 1 1 g [ | u{y) - 1 1 I d<f>(y/x) \ + f | u(y) - l\\ d<p(y/x) \ 

Jo Jp 

£ M [ I d4>{y/x) I + « f I d<f>(y/x) |. 

Jq Jp 

We observe that 

J | d<t>{y/x) | g jT | d<j>(y/x) ! = j[ I ^(s) I = v > 

2 We note also that Silverman required only that u(x) be bounded and integrable, 
0 ^ x ^ xi . In this paper we have to require the boundedness and continuity of u(x), 
x ^ 0, owing to difficulties arising from the non-existence of the Stieltjes integral 

J f( x ) dg(x) f in the case of common discontinuities of f(x) and g(x). 


[ d<t>(s) = 0(1) - 0(0) = 1. 
Jo 



HAUSDORFF INTEGRAL TRANSFORMATIONS 


503 


where V is the total variation of 0(x) in the interval (0, 1). Moreover, since 
is continuous at x = 0 we can find a number X so large that 


fP pplx 

/ | d${y/x) | = / | | < t, 

Jo Jo 


Now, let X ' be the greater of the numbers p and X. 

| v(x) — 1 1 < 77, x > X\ 
where rj = (M + V)e , whence 


x > X. 

Then, we have 


lim^(x) = l. 

x—*30 

Finally, we show that the conditions (1.2) are necessary for the regularity 
of the transformation (1.1). Here we assume that limx-aoi^x) = l and obtain 
the conditions (1.2). 

We observe first of all that the existence of the integral in (1.1) implies that 
<t>(x) be a function of bounded variation on the interval 0 :§ x g 1. 

If in (1.1) we set u(y) = 1, we get 

v(x) = f dtiy/x) = 0 ( 1 ) - 0 ( 0 ). 

Jo 


For regularity we must have <£( 1 ) — </>( 0 ) = 1. If we take <f>( 0 ) = 0 , then 

*( 1 ) = 1. 

Suppose now that <t>(x) has a discontinuity at x = 0 , that is, <£(+ 0 ) = A, 
0 ( 0 ) = 0 . Then, let us define the function 


0(z) = 


|0, x = 0, 

[A, x > 0 . 


If we put 0* = 0 — 0, then 0*(O) = 0 and <f>(x) is continuous at x = 0 . Note 
that 0 *( 1 ) = 1 — A. Now we write 


( 2 . 1 ) v{x) = (1 - A) [ u(y) dd{y/x) +[ it(y) d^iy/x), 

Jo Jo 


where we set 0 (x) = 0*(x)/(l — A), so that 0 ( 1 ) = 1 . We suppose now that 
u(x) is any continuous function, x ^ 0, so that lim x _ +oe w(x) = l. Since 0(x) 
satisfies the conditions (1.2), the first integral in (2.1) defines a regular trans¬ 
formation and we have 


lim y(x) = 1(1 — A) + A-u(O). 

*-♦00 

Now, we choose w( 0 ) = 0 . Hence, a necessary condition for regularity is the 
requirement A = 0 ; in other words, 0(x) is continuous at x = 0 . 

We note that the regularity condition (1.2, iv) is in a sense superfluous since 
it serves merely to determine 0(x) uniquely at every point of the interval (0, 1). 
With this remark the proof of our theorem is complete. 



506 ' 


H. L. GARABEDIAN 


Now, we find it convenient to define 6 a (x) = <£ a (l - 0), Ob(x) * fo(l - 0), x Jg; 1. 
Then, we may write 

w(x) = a0u(x) + a f u(ocs) d$ b (8) + f u(xs ) dd b (s) 

Jo Jo 

n uixy) ddaiy/t) dd h (t). 


+ 


The last integral is of a type studied by Bray [8, Th. 5, p. 183] relative to a 
change in the order of integration. In the present situation the hypotheses of 
Bray’s theorem are fulfilled with a comfortable margin of safety. Accordingly, 
we have 


(3.4) 


w(x) = a/3u(x) + « / u(xs) ddb(s) + (J / u(xs) dB a (s) 

Jo Jo 


+ 


f u(xy) d y f B a {y/t) de b (t). 

Jo Jo 


Comparing (3.4) with (3.3) we have 

(3.5) Oc(s) = adb(s) + PO a (s) + f d a (s/t) ddb(t ), 

Jo 


or 


OM = ad b (s) + (30 a (s) + 0„(1)06(«) + fdais/t) dBb(t), 
or 

(3.6) 0 c (s ) = 6 b (s) + P6 a (s) + ^ O a (s/t) dOb(t). 

Observe also that from (3.5) we can write 

(3.7) <t> c (s) = <f> a {s/t) d<t>b(t) = <t>b(s) + I* <t>a(s/t) d<l>b(t). 

We now proceed to show that <f> c (x) satisfies the conditions (3.1). From (3.7) 

we have <f> c { 1) = 1. Now, in (3.6) we set s = ty to obtain 

/ 

de(s) - $b(s) + pe a (s) - J' 6 a (y) dtib(s/y). 


It follows now, from a theorem of Bray [8, Th. 3, p. 180], that 8 c (x) is continuous 
on the interval 0 £ x £ 1 and hence that 0 C (O) = 0. It results a fortiori that 
the conditions (ii) and (iii) of (3.1) are fulfilled. Finally, if we consider (3.6) 
and use another result of Bray [8, Th. 4, p. 181], it follows that 0 c (s) and hence 
<f> c (s) is of bounded variation on the interval (0, 1). 



HAUSDORFF INTEGRAL TRANSFORMATIONS 


507 


Using (3.7) and integrating by parts we obtain 

= ^a(^) “ J d<ffa{s/ £). 

If we set s = ty we get 

(3.8) 4 > c (s) = <t> a (s) + J 4>h(»/y) d4o(y). 


This equation is of the same form as (3.7) except that <t> a and <f> b have been 
interchanged. Hence, AB = BA, whence the transformations A and B are 
permutable. This completes the proof of Theorem 2. 

We are now in a position to state and prove the main theorem of this paper. 

Theorem 3. Let <j> a (x) and <fo(x) satisfy the conditions (3.1). If there exists 
a solution <t> c (x) of the type (3.1) which satisfies either of the Silverman-Schmidt 
equations : 


(3.9) 


4>a{x) — f (frcix/t) d<t>b(t), 
Jo 

<t>a(x) = f 4>b{x/t) d4>c(t), 
Jo 


then ( H , <t> a (x)) 3 {H, </> b (x)). 

We wish to find sufficient conditions on the mass functions <t> a (x) and <t> b (x) 
in order that ( H , <t> a (x)) 3 ( H, <t>b(x)). Suppose that <f> a (x) and <t> b (x) satisfy 
the conditions (3.1). Let u(x) be any function transformed by ( H , 4> a (x)) 
and ( H, <f> b (x)) into v(x) and w{x) respectively. We seek conditions under 
which liinx-octt^x) = £ implies lim x ^ QO t;(x) = £. Symbolically we have 


w(x) = £{u(x)| -»£, 
v(x) = A{u(x)}. 

Suppose that C is any transformation with an associated mass function of the 
type (3.1). Since C is regular we have 

C{w(x)} = CB{u(x)} -*£. 

If there exists a transformation C such that A = CB, then 
£ = limC{tu(x)} = limCB {w(x)} 


= limA{w(x)} = limy(x). 

*-*oe *-*00 


In other words, if there exists a solution 4> c {x) of the type (3.1) satisfying either 
of the Silverman-Schmidt equations, we have (H, <t> a (x)) 3 (H, <fa(x)). 


4. Implications of Theorem 3 

In the field of Hausdorff matrix transformations it is known that [H, ^«(a:)] 3 
[H, ^»(x)], </> a (x) and <f>t(x) being mass functions of the type (1.2), if and only if 



508 


H. L. GARABEDIAN 


there exists a mass function <t> c (x) of the type (1.2) satisfying the Silverman- 
Schmidt equations (3.9), for all except at most a countable set of values of x in 
the interval 0 < x < 1 [4], Thus, the Silverman-Schmidt equations serve as 
a connecting link between the theories of Hausdorff matrix and integral trans¬ 
formations. This relationship has already been discussed at considerable 
length by this writer in a paper already referred to [2]. It will suffice to note 
here that all of the inclusion relationships among Hausdorff matrix transforma¬ 
tions involving mass functions of the type (1.4) were shown to be valid in the 
field of Hausdorff integral transformations. Since mass functions of the type 
(3.1) embrace a far wider class of Hausdorff mass functions than the absolutely 
regular mass functions of Silverman, Theorem 3 extends considerably the class 
of known inclusion relationships among Hausdorff integral transformations. 
We provide an example in illustration of the last statement. 

Let us consider briefly the mass functions 


<t> i(x) = 



0 ^ x ^ 1 / 2 , 
1/2 < x ^ 1 , 


faix) = Xy 0 ^ X ^ 1. 


We observe that <f>i(x) and fcix) are mass functions of the type (3.1). In the 
field of Hausdorff matrix transformations it is easily proved that [H, <t>i(x)] 3 
[H, 0 i(z)]. It follows that there exists a solution of the type (3.1) of the cor¬ 
responding Silverman-Schmidt integral equations. We conclude finally that 
( H , (x)) 3 ( H , <fo(x)), and thus obtain a hitherto unknown inclusion relation¬ 

ship between Hausdorff integral transformations. 

In this writeris opinion there is little hope of obtaining Theorem 3 for a much 
more general class'of mass functions than those included in (3.1). There is 
some evidence available in support of this statement. Let us first observe that 
it is necessary to allow for a possible discontinuity of <f>(x) at x = 1 , since it is 
known that if ( H , <t> a (x)) ~ (H,<t> b (x)) or [.H , <t> a (x)] ~ [H, 0 &(x)] then a solution 
<t>c(x) of the Silverman-Schmidt equations will have a discontinuity at x = 1 [2], 
If we allow for additional discontinuities of <t>(x) in the interval 0 < x ^ 1, in 
special cases, at least, we should be led to contradictions. For example, let 
<t> a (x) and 0 &(x) be the mass functions associated with Euler methods of sum¬ 
mation whose associated moment sequences [ 6 ] are } and [8 b }, 0 < 8 a < 8 b < 1 . 
We have [H , <t> a (x)] 3 [H , 0 &(x)], while (i vide supra) (H , <t> a (x)) = (//, <t>b(x)), 
both methods of summation being equivalent to convergence. Consequently, 
the Silverman-Schmidt equations would lead to contradictory results in this 
case. Evidently, all mass functions of step-function character are associated 
with integral transformations equivalent to each other and to convergence. 
Whatever improvements in generality it would be possible to make in Theorem 
2 would involve the tedious and difficult matter of reproving the theorems of 
Bray used in this paper for mass functions of a very special character. 


Northwestern University 



HAUSDORFF INTEGRAL TRANSFORMATIONS 


509 


Bibliography 

1. Silverman, L. L., The equivalence of certain regular transformations , Trans. Am. Math. 

Soc., vol. 26 (1924), pp. 101-112. 

2. Garabedian, H. L., A class of linear integral transformations , Am. Jour, of Math., 

vol. 64 (1942), pp. 208-214. 

3. Hille, E. and Tamarkin, J. D., Questions of relative inclusion in the domain of Hausdorff 

means , Proc. Nat. Acad. Sci., vol. 19 (1933), pp. 573-577. 

4. Garabedian, H. L., Hille, E., and Wall, H. S., Formulations of the Hausdorff inclusion 

problem, Duke Math. Jour., vol. 8 (1941), pp. 193-213. 

5. Hausdorff, F., Summationsmethoden und Momentfolgen, I and II, Math. Zeit., vol. 9 

(1921), pp. 74-109, 280-299. 

6. Garabedian, H. L., Hausdorff matrices, Am. Math. Monthly, vol. 46 (1939), pp. 390-410. 

7. Schmidt, R., Vber divergente Folgen und lineare Mittelbildungen , Math. Zeits., vol. 22 

(1925), pp. 89-152. 

8. Bray, H. E., Elementary properties of the Stieltjes integral, these Annals, vol. 20 (1919), 

pp. 177-186. 



Annals or Mathematics 
Vol. 43, No. 3, July, 1942 


THE PROBLEM OF DIFFUSION OF WAVES 

By J. Hadamard 
(Received March 26, 1942) 

To the memory of Myron Mathison, whose premature death is a cruel 
loss to Science, I dedicate this treatment of the problem which he has 
solved so beautifully. 


1 

The various forms of Huygens’ principle concern (as studied heretofore) 
phenomena governed by linear partial differential equations of the second 
order 1 


(E) 


F(u) = A kl 


d 2 u 
dx k dx l 


+ ••• 


[ 0 (homogeneous equation) 
(/(#) (inhomogeneous equation), 


where the terms replaced by dots contain the unknown u itself or its first deriva¬ 
tives, the A 9 s, the other coefficients, and / being given functions of the inde¬ 
pendent variables x. I have previously considered three of them, 2 two of which 
—the “major premise” and what can be called the “conclusion,” if Huygens’ 
argument is considered as a syllogism—are general properties of such a class 
of phenomena. On the contrary, the “minor premise” is only true for quite 
special equations of the type (E): it means that when a disturbance originally 
located within a determinate finite region of space, propagates by waves and 
reaches any given point outside that region after a certain time, no effect per¬ 
sists after the passing of the wave. This is what Mathisson described by 
saying that these waves are pure. 

Now, waves are not pure for any equation in an odd number of independent 
variables, (i.e. for wave motions in spaces with an even number of dimensions), 
nor are they pure when the number of variables is two. On the other hand, 
as is well known, the classical equation of spherical waves 


(E 0 ) 


d 2 u __ d 2 u _ d 2 u _ d 2 u 
dt 2 dx 2 ~dy 2 dz 2 


gives rise to pure waves, and this is also the case for its analogues in 6, 8, • • • , 
variables. The question is whether the§e cases are the only ones. Of course, 


1 We shall not use, at least in the present paper, the methods of the absolute differential 
calculus. However, we shall speak of contravariant and covariant vectors, the compo¬ 
nents being denoted by superscripts in the first case, subscripts in the second. 

As usual in the absolute differential calculus, we omit, when no confusion can arise, the 
summation signs referring to repeated superscripts and subscripts. 

* See my lecture on the subject in the Bulletin de la Soci6t6 Math^matique de France, 
vol. LII, 1925. 


510 



PBOBLEM OF DIFFUSION OF WAVES 


511 


two equations of the type (E) are to be considered as not essentially different 
if they can be deduced from each other by: 1) any point transformation 

(a) x h = (x 1 , x 2 , • • • , x m ) (h = 1, 2, • • •, m) 


on the x’s; 2) the multiplication of both sides of (E) by any (non-vanishing) 
factor, say 


(b) 


1 

X 


F(u) = 


0 , 


X = X(x) = ; 


3) multiplying the unknown by any non-vanishing factor, say 


(c) F(\u) = 

The combination of the last two, viz. replacing (E) by 


(be) FM = l F(\u) = e-*F(ue>) = ’ 

A [e 70*0 

permits an easier treatment, on account of the fact that it does not change the 
terms of the second order and, therefore, does not alter the “characteristic 
form” 

Mpi , P 2 , • • • , Vm ; x\ X, * • • , X m ) = A kl p k pi 


nor the corresponding “metric form” 


(f (dx l , dx 2 y • • • , dx m ; x 1 , x 2 , • • • , x m ) = A k idx k dx l . 


This combination is the only one which we shall deal with in this first paper. 

The relations between the above two forms, which are reciprocal ones, are 
well known. To obtain Au , we have to divide by A(A = 1/D being the dis¬ 
criminant of A; D, the discriminant of Cf) the coefficient of A kl in A. We have 


A kh A ki 


i 


(h 9* j), 
(h = j ). 


The differentials dx, or, to cfed with finite quantities, the m derivatives x 
are to be taken as the components of a contravariant vector, the covariant 
components of which are the p’s if we have 


x* = A hk p k , 

or the equivalent equations 

Pk = A hk x h . 


This also implies 


A (p) = Q(x) 



512 


J. HADAMARD 


so that the left and right members, one being the characteristic form and the 
other the metric form, represent one and the same form expressed once in 
terms of the covariant components, then in terms of the contravariant ones. 

2 

Now Mathisson has succeeded in giving the above question an affirmative 
answer, i.e. in proving that no other simply hyperbolic 8 linear partial equation 
in four independent variables generates pure waves, except (E 0 ) and those 
equations which are not essentially different from it. Mathisson divides his 
proof into two stages: 

1) In the first place, he considers the case in which the coefficients A of the 
terms of the second order are constants; 

2) In a further discussion he treats the general case. 

The first part alone has been published 4 and it will be the only one we shall 
deal with at present. 

3 

In the special case of constant A 9 s, we shall not have to use transformation 
(a), or (b) or (c) alone, but only (be). 

Furthermore, if the A 1 s are constant, it will always be allowed (the equation 
being of the simply hyperbolic type) to suppose that they have the same values 
as in (E 0 ), viz. 

(3.1) A* = 0 (A ^ Tc), A 11 = A 2i = A u - -1, A 44 = 1 . 

However, we shall begin by not even using the first assumption; let us study 
the effect of (be) on the general equation (E). As for the second assumption 

( 3 . 1 ) , it will be useful not to use it until the last'step of the calculation . 6 

In the case of variable A% it is convenient to write (E) or, rather, the adjoint 
equation 6 

G(v) = 0 

in the form 7 

(&) A 2 v + B h ^ + Cv = 0, 

A 2 t> - (^/D A hk (second differential parameter of Lam 6 -Beltrami). 

* This means that the characteristic form A or the equivalent metric form (J consists of 
one positive and three negative squares: what we previously called the normal hyperbolic 
case. We adopt Hilbert’s and Courant’s terminology. 

4 Acta Mathematica, Vol. LXXI, 1939, pp. 249-282. 

5 The disadvantage of immediately assuming (3.1) lies in the asymmetry of the equations 
between the suffixes 1, 2, 3 on one hand; 4 on the other. 

6 In passing from (E) to its adjoint, (b) and (c) are permuted, so that (be) remains un¬ 
changed but for the change of sign of p. 

7 S’s are omitted, as remarked in the beginning. 

8 See, e.g., Darboux, Lemons but la thkorie des surfaces, vol. III. 



Problem of diffusion of waves 


513 


It is known 8 that 


A 2 (Xv) = \A 2 v + 2Ai(X, v ) + i>A 2 X, 

Ai(X, v) = A hk (dv/dx h )-(d\/dx k ) (first parameter of Lam 6 -Beltrami for two 
functions), and that 

e^Aiie?) = A 2 m + Am, 

Am = Ai(m, m) = A hk (dp/dx h )(dp/dx k ), (first parameter of Lam^-Beltrami 
for one function) so that 

(3.2) e"* 1 give*) = A 2 v + 2E h ^ + Cv, 

(3.3) 5* = B h + 2A hk , 

dx k 

(3.4) C = e* G(e' 1 ) = A 2M + Ai n + B h + C. 


The transformation formula (3.3) on the B’ s suggests substituting, in place 
of the quantities B h considered as contravariant components of a vector, the 
corresponding covariant components 

Bk = AhkB h 

with respect to which the transformation formulae are simply 


(3.3') E k = B k + 2°» k . 

dx k 

We see that the differences E k — B h are the partial derivatives of one and 
the same function p: therefore, the expressions 


(3.5) 


= djh _ dB, 
dx k dx h 


are invariants with respect to (be). 


4 

The answer to our question rests, of course, on the integration of (E) itself: 
more precisely, on the solution of the corresponding Cauchy problem. This 
integration is obtained by the use of a proper solution v of the adjoint equation, 
by means of which the value of the unknown u in Cauchy's problem concerning 
(E) is directly given by (in our case, quadruple, triple and double) integrals 
extended over parts of the region where the right-hand member / is defined and 
of the variety to which Cauchy's data refers . 9 

Instead of using the exact elementary solution, one can also get to a result 

9 See our Lectures on Cauchy*8 problem , Cambridge-New Haven, 1923 or the French 
edition, Paris 1932, especially Liv. IV. 



514 


J. HADAMARD 


by introducing an approximate solution—what Hilbert calls a “parametrix”: 
then the answer to Cauchy’s problem is not directly obtained, but is reduced to 
an integral equation of the Fredholm (or rather Volterra) type. This is what 
Mathisson does. A first difficulty which he has to overcome consists, then, 
in showing that the necessary and sufficient condition in order that waves be 
pure is that the parametrix thus introduced be an exact solution, after which he 
has to find the cases where this condition is satisfied. 

Now, it seemed to me that this roundabout use of an integral equation by 
introduction of a special parametrix could be avoided, since the exact ele¬ 
mentary solution can actually be constructed; and indeed, it seems that the 
proof becomes simpler this way. Let us recall how the elementary solution 
can be calculated . 10 

a(ai, 02 , • • • , a m ) being an arbitrarily given point, chosen as the singular point 
or “pole” of the solution in question, let us consider the various geodesics issuing 
from a relative to the metric form If and denote by T = T(x l , • • • , x m , a 1 , • • • , 
a m ) the square of the geodesic distance betweeh a and any point x : a first term 
(the number of independent variables still being assumed to be 4) will be Fo/T, 
with 

(4.1) (y -«•)*}. 
where 

(4.10 M = G(X) - CT = A 2 r + B h ~ , 

the integral being taken along the geodesic ax and s denoting the independent 
variable in Hamilton’s equations 

-a dx 1 dA . dp h 1 dA 

(3I) x= S~2ip t ’ 2 ft? 

for that geodesic (A is precisely the characteristic form, considered in Section 1). 

In general, C(Fo) will be different from zero; then, in order to obtain the 
elementary solution, the first term V 0 /T must be completed by a term in log T. 
The latter vanishes if, and only if 

(4.2) G(V 0 ) = 0 

whenever the point x belongs to the characteristic conoid which has its vertex 
at a. Now, the general theory shows 11 that this is the necessary and sufficient 
condition in order that the waves be pure . . 

5 

Following the same general principle as Mathisson—and we shall be able to 
apply it even more thoroughly than he himself has done—, we shall write our 
equation in a simplified form with respect to the group of the transformations 


10 Lectures on Cauchy's problem, Liv. II. chap. III. 

11 Loc . ciL, Liv. IV, chap. I. 



PROBLEM OF DIFFUSION OF WAVES 


515 


(be). Now, this is obtained at once by the consideration of Vo : for, by an 
arbitrary transformation (be), Vo is changed into F 0 /X, as is seen by the formulae 

(4.1) , (4.1') and (5.2) (see below): therefore, by taking X = Vo , we get, in one 
and only one way, an equation for which 

(5.1) Vo = 1, 

a condition which, it must be remembered, depends on the choice of the pole a. 

The above reduction condition means that the integrand in (4.1) must be 
identically zero in x. In the special case of constant A 9 s, treated in the present 
memoir, T is simply the quadratic form T(£) = Ahdtt\ the symbol A 2 reduces 
to A hk d 2 /dx h dx k , so that, remembering (1), A 2 T — 2m vanishes identically. 
As for B h dY/dx h , the derivatives of T being dT/dx h — 2P h = 2sp h , it can be 
written, in any case, on account of (fK ), 


(5.2) 


B h ~ = 2sA hk B k p k - 2 sB„± k 


x h standing for a derivative taken along the geodesic ax. When the coefficients 
of the terms of second order are constant, the geodesics are straight lines along 
which each x varies linearly, so that sx h = £* = x h — a\ In this case, therefore, 
Vo = 1 gives us 

(5.3) B h ? = 0. 


We shall assume that the B’s are regular in the neighborhood of o, and there¬ 
fore have a Taylor's expansion 

-<»*)•+«* (so.+••• • 


Substituting in (5.3), we see that, in the reduced form of the equation, we must 
have, 12 at a, 


(5.4). 


(5.4). 

(5.40a 

(5.4") a 


B h = 0, 
dBh , dBk _ ~ 

dx k dx h 9 

d 2 Bh , d 2 Bk , d 2 Bi __ - 

dx k dx l dx h dx l dx h dx k 1 


(*,*,/,-•■ = 1 , 2 , 3 , 4 ). 


If, instead of taking X equal to Vo , we had only chosen it tangent to Vo at a 
up to certain order, we should not have all the relations (5.4) 0 , but only a 
certain (arbitrarily large) number of them. It is remarkable that Mathisson 
seems to have been led to write these conditions a priori, while we deduce them 
from (5.1), which seems to be their true origin. 

15 Throughout the following, we shall add the suffix a after the number of every relation 
which is proved only for x — o, in order to distinguish them from those which are identities 
in the x f B. 




510 


J. HADAMARD 


6 

In this section, we consider the general equation ( 6 ), not assuming the A 9 8 
to be constant. 

The successive relations (5.4) a yield the successive derivatives of Eh or, more 
precisely, the numerical values which they assume at a, which we shall denote 
by hh = (dfx/dx h ) a , i*hk = (d 2 n/dx h dx k ) a , • • • . The transformation formulae 
will be, at the point a, 


( 6 . 1 )« 


Eh = Bh + 2j uh , 


( 6 . 10 . 




dBh 

dx k 


+ 2 flhk , 


together with the corresponding formula for C obtained from (3.4), viz. 

(6.2) a C = C + B h Uh + A hk HhUk + ^ d ~ ^ A hk Pk + jU/fc + A**/***- 


While the set of transformations (be) constitutes an infinite group, formulae 
such as the above define finite ones, since u h , m , • • • are nothing but numerical 
parameters: ( 6 . 1 ) a are the equations of a group in m parameters; ( 6 . 1) 0 , ( 6 . 10 . 
and ( 6 . 2 ) a , the equations of a group in m + \m(m + 1 ) parameters; and so on. 
In order to obtain the reduced values, we must take 


(6.3). 


f 2 uh — — B h , 


| 2phk = — 
2 phki = — 


1 (dBh , dBk\ 

2 \dx k ^ dx h )’ . ' 

i ( ** 2 Bh , d 2 Bk , 

3 \dx k dx l dx l dx h 


d*Bi \ 

dx h dx k J 1 


Now, having these values of n h , nhk , • • • , we see, in the first place, that the 
reduction is made in one and only one way, so that every equation of the class 
under consideration has one determinate reduced homologue with respect to the 
group (be). 

The values assumed at a by the coefficients of this reduced homologue and of 
their derivatives at a can be calculated in terms of the original coefficients and 
their derivatives, by means of ( 6 . 1 ), ( 6 .JO •• • and (6.2); we find, still at a 

E h = 0, 

dBh = 1 /dBh _ dBu\ _ 1 „ 

d&~2\dx k dx h ) 2 **’ 

a s & _ l (dH hk djh\ 
dx* dx l = 3 \ dx l i " dx k J ’ 


(6.4), 






PROBLEM OP DIFFUSION OF WAVES 


517 


then, remembering ( 6 . 2 ) a , where, on count of (6.3)« , there is a reduction between 
the second and the third term, we have 

(6.5) C = C - i B h B h - \ - lo f B h - \ B k —\ A hlc ( d ~ + 

4 4 dx h 2 dx h 4 \dx k d&) 

where 18 the last two terms can be replaced by — \dB h /dx h . 

All the quantities in the right-hand members are invariants 14 of G(v) with respect 
to the group (be). We already knew those of the first line in ( 6 . 4 ) a , and the 
other ones which contain only the B ’s can be deduced from them by differentia¬ 
tion, so that they do not bring us anything new. But such is not the case as 
concerns the last one (6.5). 

The calculations in the present section being independent 14 of the hypothesis 
that the A 1 s be constant, the quantities (3.5), (6.5) are invariants, with respect 
to (be), for every equation such as (&). 

From now on, we come back to the case of constant A’s in which the invariant 
(6.5) is simplified by the vanishing of the terms in dA hk /dx h or in d log | D \/dx h . 

7 

If we write (&) in its reduced form with respect to the pole a, so that equa¬ 
tions (5.4 ) 0 are satisfied, the condition G(F 0 ) = 0 is nothing else than C = 0. 
This must happen not only at the point x = a, but along the whole surface of 
the characteristic conoid—which, in the present case, is not distinct from the 
characteristic cone of vertex a. In the first place we must have, at a, 

(7.1) C = 0 


and this property belongs to the original (i.e. not reduced) equation—a conse¬ 
quence of the fact that (6.5) is an invariant. Since this must be true whatever 
the point a may be, we see that (7.1) must be an identity : we can replace C 
everywhere, by its value obtained from it and (6.5). 

Furthermore, at a, we must have 


(7.2) a 

(7.3) a 


m.o 

dx h 


d\C) 

dx h dx k 


= qA 


hk 


where q is a factor common to all the terms ( 7 . 3) 0 ; and further successive rela¬ 
tions expressing the fact that the function ( C ) (assumed to be regular) is di¬ 
visible by r. 


18 This time, we do not write the suffix a, as explained in the next section. 

14 For variable the equations (0.3)« , (6.4) a • • • no longer agree with (5.1) (with the 
exception of the first line in (6.4)); but this is not necessary in the present section, it being 
essential only that the conditions define one determinate reduced homologue. We could 
have replaced the right hand members in (5.4) by arbitrarily given quantities Qh, Qhk , • • • , 
modifying the p* , mu , • • • accordingly. It is easy to verify that we should have obtained, 
in that way, the same invariants with only the addition of terms containing exclusively 
A's and theQ’s. 



518 


Ji HADAMARD 







PROBLEM OF DIFFUSION OF WAVES 


519 


Since, in (7.1), C is obtained by identifying x with a, and, thus (7.1) becomes 
an identity, C and ( C ) must be distinguished from each other: in (7.1), C is the 
value of (6.2) obtained by replacing, at any point a (or x), m and its derivatives 
by their numerical values corresponding to reduction at the same point, while, 
in (7.2) a , (7.3) 0 , • • • . (C), at any point x , means the value of (3.4) when n is 

constructed corresponding to reduction at a, viz. n = —log Vo(x ; a), so that its 
successive derivatives, at a, are given by the formulae (6.3) a . 

Both quantities C and (C) are derived from (6.2) by substituting — $B h for 
Hh and —\(dBh/dx k + dB k /dx h ) for unu , so that, if not differentiated, their 
values are equal; another treatment is needed for their derivatives. (Cf. Table.) 

Note that the corresponding values of one and the same derivative disagree 
only by some permutations of suffixes, so that their difference can always be 
expressed as a combination of the invariants H defined by (3.5). 


8 

Let us apply this, in the first place, to dC/dx h and d{C)/dx h , both of which 
we must equate to zero. We have 


/o i\ dC_dC dB k ( k kt v dn k k i dfi hl 

(8A) tei “ a? + a? ^ + {B + 2A S? + A & 


The coefficient of dn k /dx h vanishes, as m is defined by the first formula (6.3) a . 
Moreover, n k is the same in both expressions which we have to consider. There¬ 
fore, substracting them from each other in order to eliminate the derivative 
of C, the result, simplified by interchanges of k with l in the coefficients of A k \ 
reduces to 


1 tt ' _ iki d/xfcA _ djiki I 

6 h IWJ dx h j 

. 1 <ki[ d 2 B k 1 / 0 d 2 B k d 2 B h \ 1 _ 1 kl dH kh 

2 ldx h dx l 3 \ dx h dx l ~dtfdx l )\ 6 dx r 


which, for the same reason as (7.1), are identities. 


9 

These conditions are always satisfied if the Ihk are constant: especially, there¬ 
fore, if the B’s are linear functions of the x’s. We shall now show a remarkable 
consequence of this, that if complex coefficients were admitted, 15 waves could 
be pure for equations essentially distinct from (Eo). If we do not mind intro¬ 
ducing imaginaries, nothing prevents us from replacing, in the terms of the 
second order, the assumption (1) by 

(9.1) A hk = 0 h * k, A hh =1 (ft = 1,2,3,4). 


15 Complex values could not be considered for the A’s, as the distinction between elliptic 
and hyperbolic cases would lose its meaning; but, theoretically, a similar objection does 
not hold with regard to the coefficients B. 



520 


J. HADAMARD 


Let us take, for each J3* , a linear polynomial 

B\(x) = bh(x) + fih = M£) + Bx(a ), 

Wf) = a*/{' 

being the homogeneous part. On account of (3.2'), we can alter these ex¬ 
pressions by the elements of any exact differential. Therefore, decomposing the 
matrix || a*y || into a symmetric one plus an antisymmetric one, the former ean 
be cancelled and we can assume 

(9.2) a h j = —otjh, 
and in particular 

( 9 . 2 ') = 0 . 

Then J] £*&*(£) vanishes and we have 

(9.3) Vo = exp {-#»*(«)} - e w . 

On the other hand, C is given by (6.5), in which the last term disappears on 
account of (9.30. Thus, substituting Vo in G, we get 

Y t G(Fo) = e- w G(e“) = i{£ [S A («)] a - 2 £ B k (a)B k (x) + C} 

= [B*(a)] 2 - 2 E B k (a)B k (x) + £ [5*(x)] 1 } 

= l£[Wf)] 2 . 

This must be sero on the conoid which has its vertex at a, so that the above 
quadratic form must be proportional to 22o (£*) 2 , and we are led to the problem 
of expressing the sum 22 (£*) 2 05 a swra 0 / squares of linear forms , Z/ie coefficients 
of which form an antisymmetric matrix . 

As is easily seen, the most general way of obtaining this is to complete (9.2) 
into 

(9.4) a h j = — a# = ea*z = — «az* , € = dtl, 

where fiy&Z is any even (or alternate) permutation of the suffixes 1, 2, 3, 4. 

In other words, we have only to take a = 0 in the well-known identity 

(« 2 + fl 2 + y 3 + 6 s ) (x 3 + y* + z * + f) 

= (ax + fiy + yz + St) 3 + (-fix + ay + ye + St) 3 

+ (—yx — Sy + az + fit) 3 + (—Sx + yy — fiz + at) 3 

which gives the transformation of the product of two sums of four squares into 
a sum of four squares. A simple form of the result, to which the most general 
one can *be reduced by an orthogonal change of the variables (for instance, an 



PROBLEM OP DIFFUSION OF WAVES 


521 


orthogonal change of x and z combined with an orthogonal change on x and t) 
is the equation 

F(u) = Au + 2^2/^ — + — 2 \ + u(x s + 2/* + z* + t a ) = 0 

\ dx dy dz dt / 

which, for any given a, b, c, d, admits of the solution 


(x - a y + (y- by + (z- c y + (t- dy 

ay—bx+ct—dt 

V 

= (x - ~dy + (2/ - by + (z - c) 2 + (i - dy ~ w ’ 

w being any holomorphic solution of F(u) = Vo . 

Changing t into it , this would give a hyperbolic equation corresponding to 
pure waves. But we have reached this only by introduction of imaginaries. 
On the contrary, we shall now see that no such solution can exist in the real 
domain. 


10 


Let us treat (7.3) 0 as we did for (7.2) a . We have 


d 2 C 

dx h dx* dx h dx 1 ' dx h dx 1 1 


d 2 C , d*B l 

+ -T 7 . MA: + 


(dj? djXk dj? duA 
V&C* dx’ dx’ dx h ) 


+ 2 A" P k p. + (B k + 2 A kl w ) + A kl 

dx h dx 1 dx h dx’ dx h dx’ 


As in the treatment of (7.2) 0 , we remark that the coefficient of d 2 n k /dx h dx 3 
vanishes and the second term is the same in d 2 C/dx h dx 3 and in d 2 (C)/dx h dx } . 
As for the terms in d 2 n k i/dx h dx 1 , they give, in d 2 (C)/dx h dx J — d 2 C/dx h dx } , the 
result 


A ki [1. a 8 a* _l( 2 a 3 g* , a 3 g,- <yih X] 

l_2 dx h dx’dx l 8\ dx h dx’dx l dx h dx k dx 1 dx’ dx k dx 1 ) J 

= \ A kl (* + d llM \ = I ( d J& + d Ij\ 

8 \dx’ dx 1 dx h dx 1 ) 8 \ dx’ dx h ) 
which vanishes on account of (8.2). Moreover, we have 




d/Xk 

dx h 



so that we can write for the difference 


d 2 (C) 

dx h dx’ 


dx k dx’ ’ 


( 10 . 1 ) 


d 2 (C) 

dx h dx’ 


+ 2 


( 


V_ 

A dx’ 


dx h dx’ 
dBi 


dBk TJ I dBi IT 






dBi dBk 
dx h dx’ 





522 


J. HADAMARD 


the same as that which Mathisson obtains by his method . 16 As he shows, this 
immediately gives the desired solution: using, this time, the special choice (3.1) 
of the coefficients A , we only need to take, in (10.1), h = j = 4, then h = j = 1 , 
thus obtaining 

q - -Hi - Hi - Hi, 

-q = Hi - Hi - Hi, 

and by adding these two equations we see that we must have #24 = Hu = 
#21 == #8i = 0; and similarly for the other quantities #. Then, since every # 
vanishes, B h dx h must be an exact differential —dfi and there exists a trans¬ 
formation (be) which reduces all the coefficients B to zero, after which C must 
also be zero, on account of (7.1) and (6.5). 

New York 

18 It is remarkable that Mathisson’s deductions and ours follow a quite analogous order, 
successively obtaining (8.2) and then (10.1), although his parametrix is infinite along a 
parallel to the x°-axis, while our v 0 is a regular function. 



Annals or Mathematics 
Vol. 43, No. 3, July, 1942 


A PROOF OF THE FUNDAMENTAL THEOREM ON THE DENSITY 
OF SUMS OF SETS OF POSITIVE INTEGERS* 


By Henry B. Mann 

(Received January 7, 1942; revised March 16, 1942) 


Let A(B , C) be sets of positive integers. Form A 0 , B° by adjoining 0 to A 
and B respectively. Let A(n) be the number of positive integers in A that are 
^ n. The greatest lower bound of the quotients A(n)/n is called the density 
of A. Let C° consist of all integers of the form a + b(a e A 0 , b e B°). 

Let a be the density of A, the density of B, y the density of C, then we shall 
prove 

(1) 7^a + 0or=l. 

This inequality has been conjectured by E. Landau, I. Schur and A. Khint- 
chine. 

Approximations to (1) have been obtained by the following authors: 

E. Landau: 1 y ^ a + £ — a/3. 

A. Besicovitch : 2 If a* is the lower bound of the quotients A(n)/(n + 1) then 
7 s= «* + 0 or = 1. 

I. Schur: 3 y ^ a/( 1 — 0) or = 1. 

I. Schur: 4 y ^ + \{<x + 4/3 2 )* or = 1. 

A. Brauer: 5 y ^ 9/10(a + 0) or = 1. 

An important partial result was obtained by A. Khintchine: 6 If a t is the density 
of Ai and ai ^ a 2 ^ ^ a n and C° = {ai + a 2 + • • • + a n }(a< e A°) then 

7 ^ non or = 1. In this paper (1) will be proved completely. 

Stripped of its transcendental content (1) states that 


C(n) _ j 
n 


or 


. . Ail) , B(m) 

^ mm. -1~ —— 

l m 


1 ^ m ^ n, 1 ^ l ^ n. 


We propose to prove the following sharper theorem. 

Fundamental Theorem : Let A(B , C) be sets of positive integers . Le£ 
A(n), #(n), C(n) be the numbers of the integers 1,2, • • • , n that are in A, B, C 


* Presented to the American Mathematical Society Feb. 28, 1942. The author’s in¬ 
terest in this problem was aroused through Dr. A. T. Brauer’s lectures on additive theory 
of numbers at New York University. 

1 Die Goldbachsche Vermutung und der Schnirelmannsche Satz. Gottinger Nachrichten, 
Math. Phys. Klasse (1930), pp. 255-276. 

2 On the density of the sum of two sequences of integers. Jour. London Math. Soc., Vol. 
10 (1935), pp. 246-248. 

8 Vber den Begriff der Dichte in der additiven Zahlentheorie. Sitzungsber. der preuss. 
Akad. der Wiss., Math. Phys. Klasse, (1936), pp. 269-297. 

4 L.c. footnote 3. 

6 These Annals, Vol. 42, (1941), pp. 959-988. 

# Zur additiven Zahlentheorie. Matematiceski Sbornik, Vol. 39, (1932) pp. 27-34. 

523 



524 


HENRY B. MANN 


respectively. Let C 0 = {a + 6)(a « A 0 , b e B°) where A 0 , £° consist of the 
numbers of A and B respectively and the number 0. Then 


( 2 ) 



or 


^ . A(m) + B(m) 

^ min.-, 


m 


1 <£ m ^ n, m iC. 


Let 7h < n» • • • be the numbers of C (i.e. numbers not in C ); then C(n r ) = 
n r — r. (2) is an immediate consequence of the inequality 


(3) 


C(n r) ^ min A(n,) + B(w„) 


q £ r. 


We divide the numbers ^ n r into three sets: numbers of J5, numbers of the 
form n r — a(a e A 0 ), numbers of L r where L r denotes the set of all positive 
numbers ^ n r that are neither in B nor of the form n r — a(a e A 0 ). Denote 
by l r the number of integers in L r . These three sets are disjoint. Hence 


(4) n r = B(n r ) + A(n r ) + 1 + l r . 

(3) follows from (4) for r = 1. In proving (3) by induction we may assume 


C(n 9 ) > C(n r ) 


n 9 


n r 


or rn 9 > sn r for s = 1, • • • , r — 1. 


Our statement then is that 


(5) 

implies 

( 6 ) 

or using (4) 


A(n.) + B(n.) > CM for s » 1; ... , r _ i 


n 9 


n r 


C{n r ) ^ A{n r ) + B(n r ), 


-—— < — for a =s l ... r — l implies 1 + l r 2£ r. 
n 9 n r 


We have to prove the following lemma: If rn 8 > sn r , rn 9 > (1 + l 9 )n r for 
8 = 1, 2, • • • , r — 1 , then l r ^ r — 1 . 

From now on r is fixed. We therefore write n r = N, L r = L, l r — l. s, t 
range from 1 to r — 1 . Put d 9 = N — n, then n 9 — d t — n t — d 9 . 

Construction of numbers of L. To prove the lemma we have to construct 
r — 1 numbers b' in L, i.e. numbers not in B and ndt of the form N — a(a « A 0 ). 
To satisfy both conditions we construct numbers b f satisfying the equations 

b' = n 9 — a = N — d {atA\di A 0 ). 

The condition d not « A 0 is satisfied by putting a = n t — b or 


V = b + d t = n, — a. 

Thus we arrive at the following recursive definition of numbers ei , ej, • • 
and sets B °, JB 1 , B 2 , • • • , and T% , T 2 , • • • , starting with B°. 



DENSITY OP SUMS OP SETS OP POSITIVE INTEGERS 


525 


Definition: a) We choose c»+i as an element in S° w B l ^ ^ B \ 

b) t lies in Ti+ 1 z/ there is an a e A 0 such that 

(7) a + e w + d t = n 9 
and neither s nor t in T\ w T 2 ^ • • • w 7\ . 

c) 2? 1+1 = {e<+i + dtl«ir t+ i • 

Equation (7) is equivalent to 

(7') a + e %.fi + d* = n<, 

hence s as well as t lies in T*+i . We may therefore write 

(8) V = e i+1 + d, = w. - a e B i+l . 

The following propositions follow easily from the definition: 

1. «y«J3° s^B 1 ^ • •• ^ B j ~ l (j £ 1). 

2. Wo element of Tj lies in Ti ^ T 2 ^ • • • w Ty_i . 

3. B 3 — [ej + dt\ttTj • 

Corollary: IP contains as many elements as Tj. All elements of B j are 
greater than ej. 

4. An element h ' of B 3 may be written as n H — a(s e T ,, a e A 0 ). Also n K can be 
decomposed into a + b'(a e A °, 1/ e B 3 ). 

5. If a + b f = n,, (a e A 0 , b' e B° ^ B 1 w • • • B 3 ) then s c Ti w ^ Tj . 

Proof: Proposition 5 is true for j = 0. We assume that it is true for 

j = i and prove it for j = i + 1 . If 6 ' e B l+X then b r = e i+ i + d t , t e 7\ + 1 , 
hence tiTi ^ T 2 ^ - w 7\, therefore a + + d* = n,, a + e i+i + d, = n t . 

If s 4 Ti ^ Ti w •• • ^ Ti, then $ c 7\+i according to definition of 7\+i. 

6. No element of B 3 (j ^ 1) lies in B° ^ B l ^ • • • B 3 ~ l nor is of the form N — a 

(a e A 0 ). 

Proof: An element bj of B 3 is of the form n, — a(s e Tj , a e A °). But the 
relation 

71* == "f 5y, S ^ Ti w T 2 * * * w !T y_i , b j t B 5 v^***Vw/ 5^ 

contradicts 5. Each element By of 2P is of the form ej + d, (s e Ty): the equa¬ 
tion cy + d, = W — a, (a € A 0 ) or a + ey = n t contradicts 5 because of 

S 4 T\ w T 2 w • • • Ty—i , 6 B 1 • • * w B*~ l . 

In order to arrive at a uniquely determined construction we shall choose 
e < + 1 as the least number for which 7\+i is not empty. The final construction 
is then defined as follows: 

a) t lies in Ti+i{e\ if there is ana e A 0 such that 

a + e + dt =* n 9 

and neither t nor s in T\ ^ • • • w 2\-. 

b) e»-+i is the smallest e in B° B 1 ^ ^ B'for which Tn i[e) is not empty. 

c) Tw = Ti+i[ew) f jB t+1 = {e»+i + d*}i«r %+l . 

The process is to be continued until Tj+i{e] is empty for every ecB°^B 1 ^ 
• • w B J . 



526 


HENRY B. MANN 


We are then able to prove the following propositions: 

7. 7V+i{e} is empty for every e e B° ^ B 1 ^ • • • w B J . 

8 . If e e B° ^ B 1 ^ ^ B 3 ' 1 and T,{e } is not vacuous then e e;. 

9. ei < e 2 < • • • < ej . 

Proof: e, = ey+i is impossible because Ty+iley} is empty, ey < ey 4 i means 
therefore that there is no e < ey for which eeB° v ••• ^ B 3 and T y+i{e} is not 
empty. If there were, then Tj{e} would not be empty. Either e e B° ^ • • • 
B^ 1 , and then e ^ ey according to 8; or e c B 3 , and then e > ey according 
to the corollary to 3. 

10. All elements of B 3 ^ ^ are greater than ey (1 g j ^ J). This 

follows from the corollary to 3 and from 9. • 

11. Let s be the least index not in T\ ^ T 2 ^ • • • w Tj. Let there be q indices 

(r — 1, r — 2, • • • , r — q) such that d t ^ n, . 7%en 

l , ^ g. 

Proof: n, — (d, ^ n,) is either in C° and then = a + b f (a € 4°, b' t B°) 

or = n u , (u e Ti ^ TV) and hence = a + V (a c A 0 , b' c B° ^ ■ • • w S')- 

Thus in either case 

n 9 - d t = a + V (a e A 0 , V e B° ^ ^ £ J ). 

If J is not in Ti w • • * w 7V then T/+i{b'} is not empty which contradicts 7. 

Let < c Tj, ej + d t = by € B 3 . We assert ey + d* g n,. If ey + d t > n B , 
then 

ej > n a — d t = a b f ^ b'. 

Thus b' cannot lie in B 3 ^ ^ B J (see Prop. 10). Hence b'e5° v ••• w 

B 3 ~ 1 . The equation 

a + b' + d t ='rc, 

with ti T\ w • • • w Ty_i, si Ti ^ ^ Ty_i shows that Ty{b'} is not vacuous 

and b' < ey is impossible by Prop. 8. 

by is not in B nor of the form n 8 — a (a e A 0 ) (see 5). Hence it is in L 8 . 
Because of 6 we obtain q distinct such numbers by corresponding to the q values 
of t. Therefore 

U ^ q. 

We can prove now our lemma. Suppose that 

mi > iN, mi > (1 + U)N for t = 1, 2, • • • , r — 1. 

We shall show that Ti W * * * V-/ Tj contain all indices i ^ r — 1. Because of 
6 and the corollary to 3 our lemma is an immediate consequence of this fact. 
Suppose s is the least index not in Ti ^ ^ Tj. Let q be defined as in 

Proposition 11. Then 

r«. > (1 + l.)N ^ (g + 1 )N, rn (r _ 4 _i> > (r - g - 1)AT. 

Adding these two inequalities we obtain 
n, + n (r _ 9 _i) > N or 


n. ^ d(r—q— l) * 



DENSITY OF SUMS OF SETS OF POSITIVE INTEGERS 


527 


But then we would have q + 1 indices t = (r — 1, r — 2, • • • , r — q — 1) for 
which d t ^ n a contrary to the significance of q. This proves our lemma. 

Another result of a different nature can be obtained by the construction 
method used in the proof of the fundamental theorem. We propose to prove 
Theorem II. If C° = {a + b}(a e A 0 , b e B°) and if a * is the G.L.B. of the 
quotients A(m)/(m + 1) and if n is not in C. Then 

(9) C{n) ^ a*n + B(n). 

From equation (4) we have for r = 1. 

C(ni) ^ a*n\ + B(ni) + (a — a*)n x . 

We propose to prove by induction 

(9') C(m) £ a*n» + B(m) + (a - **)n x , i = 1, 2, • • • 

We distinguish two cases: 1). n r — n (r -i) > 1; 2). n r — n (r _i> = 1. 

Proo^ for Case 1 : The integers > n (r _ d and ^ n r are either in B or of the 
form n r — a (a e A 0 , 0 ^ a < n r — n (r _i>) or in neither of these two sets. Hence 

n r - n (r _D ^ A(n r - - 1) + B(n r ) — B(n (r - .«) + 1 

^ a*(n r - n (r _i)) + B(n r ) - B(n (r _i)) + 1. 

By induction we have 

C(n (r _i)) = n (r _D — (r — 1) ^ a*n ( r-n + B(n (r _«) + (a - a*)ni . 

Adding both inequalities we obtain (90 for i = r. 

Proof for Case 2: In this case we have d (r - 1 > = 1, n x — d (r _i) = a + b 
(a € A°, b c 5°). We form 7\{6} and B 1 . Let B* contain the numbers of B° 
and B l . Put C* = {a + b'} (a € A 0 , 6' c B*). n r will then be the j th gap 
j < r in C* and we have by induction 

C*(n r ) ^ a*n r + B*(?i r ) + (a — a*)rii. 

But by proposition 5 we have 

C*(n r ) - C(n r ) = B*(n r ) - B(n r ). 

Subtracting this equation we obtain (90 for i = r. 

It is not possible to substitute a for a* in (9) as shown by the following example 

A = B = {1,2, 6, 7, 8, 12, }, 

C = {1,2, 3, 4, 6, 7, 8, 9, 10, 12, • - • }, 

a = l B(ll) = 5, C(ll) = 9. 

If C° = {a + b) (a € A 0 , 6 € B) and if 1 6 B then it can be shown exactly by 
the same method that C(n) ^ a*n + B(n) if n is not in C. 


Columbia University 



Annals. or Mathnmatics 
Vol. 43, No. 3 y July, 1042 


A REMARK ON S. KAKUTANI’S CHARACTERIZATION OF (L)-SPACES 1 

By M. F. Smiley 
(Received March 9, 1942) 

The purpose of this note is to show that we may rephrase the condition 
(IX) of Kakutani in a way which indicates a close relation between the norm 
in an C4L)-space and the metric of a metric lattice. 2 This will also permit 
a slight weakening of assumptions in §§3-5. 

We show first that the condition 

(1X0 ||*-j/|| = ||*V!/-xAi/|| 

is equivalent to (IX) under (I)-(IV), (VI)-(VII). 

Proof. (IX) implies (IX'): By Lemma 1.5, x = x+ — z~ and x+ A = 0; 
consequently || x || = || x+ — X- || = || || by (IX). Since — x V 0 = 

— (x A 0), we have the equation 

II x II = II * V 0 + (-x V 0) || = || x V 0 — x A 0 ||. 

Replacing x by x — y and applying Lemma 1.2 yields (IX'). 

(IX') implies (IX): If x A y = 0, then x V y = x + y by Lemma 1.3. Hence, 
if x A y = 0, the condition (IX') yields || x — y || = || x V y || = || x + y ||. 
We may now prove that (I)-(IV), (VI)-(IX) imply the following condition. 

(1) || a V x - a V y || + || a A x - a A y || = || x - y ||. 

Proof. 3 By" (IX') and Lemma 1.6 the left side of (1) is || a V x V y — 
a V (x A y) || + || a A (x V y) — a A x A y ||. Condition (VIII) reduces 
this to the expression 

|| aVxVy — aV (x A y) + a A (x V y) — a A x A y ||. 

Using Lemma 1.3 we obtain the identity (1) at once. 

The identity (1) shows that the assumption (V) is not needed in §§3-5. We 
may even show that the triangle inequality for the norm is a consequence of 
(I)-(IV), (VI)-(IX) by a simple paraphrase of the proof of Theorem 3.10 of 
G. Birkhoff. 4 


1 Presented to the American Mathematical Society, April 3, 1942. Our title refers to 
Kakutani’s paper Concrete representation of abstract (L)-spaces and the mean ergodic theorem, t 
these Annals, vol. 42 (1941), pp. 523-537. All references are to this paper unless the con¬ 
trary is explicitly stated. 

* Cf. G. Birkhoff, Lattice Theory , Amer. Math. Soc. Col. Pub., vol. 25 (1940), p. 41. 
*lbid., p. 42. 

'Ibid., p. 42. 


528 



CHARACTERIZATION OP UO-SPACEB 


529 


Note added in proof. G. Birkhoff has remarked to the author that m [a] 
= || a + || — || a_ || is a sharply positive modular functional in an (L)-space. 
This result depends only on conditions (I)—(IV), (VI)-(VIII). For, if a ^ b, 
then m [a] — m [b] = || a — b || by (VIII) and Lemma 1.2. Since || a || > 0 
unless a = 0, this shows that m [a] is sharply positive. Applying this result 
again we find that m [a] — m [a A b] = || a — a A b || = || a V b — b || = 
m [a V b] — m [b]. Thus m [a] is also a modular functional. Our condition 
(IX') then merely requires that the metric 5 (a, b) which arises from m [a] 
should coincide with || a — b ||. When this is true, the identity (1) is a con¬ 
sequence of (in fact, equivalent to) the distributive law. 

Lehiqh Univebsitt. 



Anmals or Mathematics 
Vol. 48, No. 8, July, 1942 


GENERALIZED SURFACES IN THE CALCULUS OF VARIATIONS. U 

By L. C. Young 
(Received December 21, 1941) 

Mean Surfaces and the Theory of the Problem J J f(x> y, p, q ) dxdy = Min. 

1. Introduction 

In this second note, machinery is produced for studying extensions of the 
classical necessary conditions applicable both in the ordinary and in the gen¬ 
eralized minimum problem for surface integrals. We employ it to derive neces¬ 
sary and sufficient conditions in the case of an integrand /(x, y, p, q) in which z 
does not occur explicitly. Our methods follow naturally from the idea of 
generalized surface developed in the first note, 1 but are no longer restricted to 
continuous integrands/(x, y , z , p, q). We show, in fact, that for a much wider 
class of these integrands the minimum in the problem considered is unaltered 
if we admit, in addition to ordinary surfaces, only a special kind of generalized 
surface which we term “mean surface.” The ideas connected with the notion 
of a mean surface are closely akin to a method developed by Haar 2 in the special 
case of a regular problem. They lead to a generalization of an inequality, due 
to Steiner, 3 valid when the integrand has the form /(x, y, p, q)\ and thence to 
equations characterizing a Lipschitzian surface for which our minimum is 
attained. 

2. Mean surfaces 

Let Z\ and z^ denote the tracks, and Mi(g) and M 2 (g) the averages at x, y, of 
two generalized surfaces Si and S 2 or, in particular, of two ordinary surfaces. 
We define the mean of Si and S 2 as the generalized surface S with the track 
z = %Zi + \z 2 , and with the average M(g) = %Mi(g) + %M 2 (g) at the point x, y. 
A generalized surface which is the mean of two ordinary surfaces will be termed 
shortly a mean surface. 

(2.1) Any mean surface is expressible as the mean of two ordinary surfaces whose 
tracks differ by at most e. 

To prove this, it is clearly sufficient to show that if S is the mean of two 
ordinary surfaces Si and S 2 whose tracks z x and Z 2 differ by at most 2a, then 
S is also expressible as the mean of a second pair of ordinary surfaces whose 
tracks differ by at most a. Let E , E\ E" denote the sets of x, y in which, 
respectively, 

| Zi — 22 1 < a, Zi — z% g —a, zi — z 2 ^ a; 


1 Young [12]. 

* Haar [3J. 

1 Steiner [10], p. 298. 

630 



GENERALIZED SURFACES IN CALCULUS OF VARIATIONS 


531 


and let z\ , I 2 denote the pair of functions which coincide with 

Zi , Z 2 in E, z 2 — a, Zi + a in E\ z 2 + a, z x — a in U". 

We verify at once that these functions coincide with Z \, z 2 at boundary points 
of Ey since we then have z 2 — Zi = dta; it follows easily that they satisfy a 
Lipschitz condition with a same constant as the functions Z\ and % : for any 
segment can be decomposed into at most three, each of which has both ends 
belonging to the closure of the same set Ey E' , or E n 7 and in these partial seg¬ 
ments the Lipschitz condition for the new functions is identical with the original 
one for z\ , z 2 or their permutation z 2 , Zi . 

Finally it is clear that the new functions have the same gradient as the original 
ones or their permutation, wherever the four gradients exist, and so almost 
everywhere; and since we have also 

$2i + iz 2 = \z\ + %z 2 and | z 2 — Z\ | ^ a, 
this completes the proof. 

3. Integrals over mean surfaces. Comparison of minima 

We shall be dealing with surface integrals of functions/(x, y 7 z , p, q ) measur¬ 
able B but not necessarily continuous. The average M(g) at x, y which consti¬ 
tutes part of the definition of generalized surface, must now be suitably extended 
to discontinuous functions g(p , q). In the cases that we shall be concerned with, 
this extension is either trivial, or else sufficiently treated by well-known classical 
writers. 

As in the first note, all surfaces which occur will be supposed Lipschitzian. 

In the case of a mean surface S 7 the average M(g) is of the form 
\g(p'y q') + hg{p", q ,; ), where p f , q' and p", q " denote positions of p, q dependent 
on x, y 7 which will be termed the two possible positions of p, q at x, y. In accord¬ 
ance with our definition of mean surface, these possible positions are further 
restricted to be at almost every x, y of the domain A a permutation of the 
gradients of at least one fixed pair of ordinary surfaces. The integral F(S) of 
/(x, y 7 z 7 p, q) over the mean surface S is obtained by writing 

/ fj(x, y, z, P, q)dxdy = J £ £{/(x, y, z, p\ q') + /Or, y, z, p", q") | dxdy, 

where on the right z denotes the track of S. 

The notions of ordinary surface, mean surface, generalized surface give rise 
to three different problems of minimum, obtained by interpreting correspond¬ 
ingly the class of admissible surfaces with a given boundary for which we seek 
to make F(S) a minimum. If the corresponding minimal values of F(S) are 
denoted by m 0 , m m , m a , it is evident that 

(3.1) ^ mm S m a , 

since every ordinary surface is a mean surface (the mean of two coincident, or 
parallel, ordinary surfaces), while every mean surface is a generalized surface. 



532 


L. C, YOUNG 


The results of the first note 4 * 6 show that the inequalities (3.1) reduce to equali¬ 
ties when / is continuous, since the extreme terms then have the same Value. 
On the other hand, it is easily seen that at least one of the inequalities is strict 
for certain discontinuous functions /: thus if we write / = —1 when p, q equals 
either — y,x or y , —x, and / = 0 otherwise, we obtain a function/(x, y, p , q) 
which vanishes almost everywhere on any ordinary surface, but which has the 
average M(f) = — 1 at all points of the generalized surface with the track z = 0 
and the average M(g) at x, y given by the expression hg(—y, x) + \g(y , —x); 
so that m 0 vanishes while m g is negative in this case. 

We shall show, in the next paragraph, that for a class of functions including 
among others all those of the form f(x, y, p, q) which are bounded in bounded 
sets, and in particular the function just considered, the first of the inequalities 

(3.1) is necessarily an equality, i.e. we have 

(3.2) m Q = m m . 

4. Equality of the minima m 0 and m m 

We now come to the first main theorem of this note, which will enable us to 
treat the ordinary problem on the same footing as the very much simpler 
generalized problem from the point of view of necessary conditions. This theo¬ 
rem asserts that (3.2) is true under general conditions. Since this equality has 
already been established for continuous integrands /(x, y , z, p, q), the reader 
whose interest is limited to classical problems may feel inclined to omit its proof. 
It should be noted, however, that it is frequently desirable to transform a 
classical integrand into a discontinuous one by moving the origin of z, p, q to a 
position which depends on x, y. Such a transformation can be used, for in¬ 
stance, to render certain results of the present note applicable, not only to 
Lipschitzian, but also to more general surfaces. If we do not insist on these 
extensions, it is on account of the greater simplicity of the statements for the 
Lipschitzian case and because we have no evidence that conditions derived from 
these by means of the above transformation would be essential ones. It is also, 
perhaps, interesting and instructive to see how the necessary conditions of Euler 
and Weierstrass, once they have been freed of the formal differential apparatus, 
can be applied to integrands which are not even continuous. 

(4.1) Suppose firstly that /(x, y, z, p, q) is a function measurable B which is 
upper-semicontinuous in z when the other variables are fixed? and secondly that this 
function is bounded above* for any relevant bounded system of x, y, z, p, q. Then, 
given any mean surface S and real numbers k and e, where k > F(S) and e > 0, 
there is an ordinary surface S' with the same boundary as S, such that 


4 Young [12], (14.1), p. 102. 

• This is in particular the case when / is independent of z. 

6 This second condition may be relaxed: it is enough to suppose that in A the function / 
is less than an integrable function of x, y only. Moreover, when / is independent of z, 
we require this only in the neighbourhood of the boundary of A . 



GENERALIZED SURFACES IN CALCULUS OF VARIATIONS 


533 


(4.2) F(S') < k, 

and such that , moreover , its track differs from that of S by at most c, and its Lipschitz 
constant exceeds that of S by at most e. 

Denoting by n an arbitrary positive integer and by E the set of the points 
x, y of A whose distance from the boundary of A is not less than 1/n, we can, 
by (2.1) and McShane's lemma, 7 construct two ordinary surfaces S' and S" 
having S for their mean in the set E , in such a manner that the tracks of S' 
and S" differ by at most 1/n 2 in E and coincide with that of S on the boundary 
of A, while at the same time their Lipschitz constants exceed by at most 1/n 
that of S . We shall suppose that F(S') ^ F(S"). 

Now the tracks z' and z" of our two surfaces tend to the track z of S as n 
tends to infinity. Moreover, any point x, y of A becomes a point of E if n is 
sufficiently large, provided that x, y is not a boundary point; therefore at almost 
every point of A the gradients p\ q' and p", q" of our two surfaces are a permu¬ 
tation of the possible positions of p , q for the mean surface S , provided that n 
is large enough. Hence the average M(f) for S is almost nowhere less than the 
upper limit as n tends to infinity of the expression 

W(x, y, z', V', q') + fix, y, z", p", q")}, 
by semicontinuity of/. It follows from a suitable form of Fatou’s theorem 8 that 

J j^M(f)dxdy § Lim. Sup. j / i(fix, y,z',p',q') + fix,y,z",p”, q")\ dx dy 

and so, that k > \F(S') + hF(S"), if n is sufficiently large; clearly this requires 
k > F(S'). The remaining assertions are satisfied if we choose a value of n 
exceeding c -2 . This completes the proof. 

5. A convexity property of the minimum 

From now on, it will be assumed that the integrand of the problem is a func¬ 
tion /(x, y , p, q) which is independent of the variable z. This function will be 
supposed moreover measurable B and, for the present, bounded above when x, y 
lies in A and p, q in any bounded set. 

Denoting by U a function defined on the boundary of A in such a manner that 
there exists a Lipschitzian surface whose boundary is given by U , we introduce 
the functionals <p 0 (U), <p 0 (U), (p 0 (U , K) and <p 0 (U , K ), any onq, of which we 
designate shortly <p(U ). We define these to be absolute minima of our surface- 
integral for the surfaces with boundary U which belong respectively to the 
following four classes: 

ordinary Lipschitzian surfaces;. 

generalized Lipschitzian surfaces; 

ordinary surfaces with Lipschitz constants less than K ; 

generalized surfaces with Lipschitz constants less than K. 

7 Young [121, (9.1), p. 93. 

8 Saks [8], (12.10) p. 29, in the form obtained by taking a sequence a — /» instead of a 
non-negative sequence /». 



534 


L. C. YOUNG 


(5.1) Each of the minima <p(U) is convex in U. 

Before proving this, let us state for completeness, that we term convex a func¬ 
tional Q(U) in an arbitrary functional space, if given any two points U' and U" 
of this space at which this functional is defined and finite above, and given 
further any real number 6 where 0 ^ 0 ^ 1, the functional is defined at the 
point OU' + (1 — 0)U" and satisfies 

(5.2) Q[0U f + (1 - 0)U"] £ OQ(U') + (1 - 0)Q(U"). 

It is known that this definition is equivalent to the following conditions: 

the left-hand side of (5.2) is a function of 0 bounded above on the unit segments 

the inequality (5.2) holds for 6 = £. 9 
We shall use this equivalent form of the definition in proving (5.1). 

We observe in the first place that if U' and U" are boundaries of Lipschitzian 
surfaces belonging to the relevant class, then the same is the case of 
OU' + (1 — 0)U" when 0 lies on the unit segment and indeed when 0 lies on a 
segment including the unit segment in its interior. The left-hand side of (5.2), 
when we choose Q(U) = <?(£/), is majorized by the values of F(S) for certain 
surfaces with bounded Lipschitz constants, and so is certainly bounded above as 
function of 6 on the segment in question. 

It remains to verify the second condition. Let k be any number exceeding 
toil 7 ') + y(U") and therefore exceeding ^F(S') + ?F(S") where S' are certain 
suitably chosen admissible surfaces with boundaries U' and U" respectively. 
The number k then exceeds F(S) y where S denotes the mean of S' and S". But 
S has the boundary %U' + \U" and S is either an admissible surface, or can be 
added to the admissible surfaces without altering our minimum, and so 
ip(\U' + \U") ^ F(S) < k. This completes the proof of (5.1). 

In the case of the generalized problem, the theorem just proved has a variant 
which reduces in the regular case to a well-known inequality of Haar-Steiner. 10 
We denote by I{z) the minimum of F(S) for surfaces S with the track z } and 
we note that in the regular case I(z) reduces to the value of F(S) for the ordinary 
surface with this track. 

(5.3) The functional I(z ) is convex . n 

To prove this, we select an arbitrary real number 0 on the unit segment and 
two Lipschitzian tracks z' and z ". We denote by S' and S" two admissible 
generalized surfaces with these tracks, and by M'{g) y M"(g) the corresponding 
averages at x y y ; and we write S for- the generalized surface with the track 
Oz' + (1 — 0)z" and the average OM' + (1 — 0)M" at x y y. Then 

I(0z' + [1 - 6]z ") g F(S) = OF(S') + [1 - 0]F(S") y 

9 The equivalence of the definitions easily reduces to the case of real functions treated 
by Hardy, Littlewood and P61ya [6]. 

10 Haar [3], p. 227, Inequality (3). 

11 We assert this only for the generalized problem. The same is then evidently true of 
the ordinary problem only in the regular case. 



GENERALIZED SURFACES IN CALCULUS OF VARIATIONS 


635 


from which it follows, passing to the lower bound for all S' and S ", that 

i(0z' + [l - e]z") ^ ei(z') + [i - e]i{z"). 

This completes the proof. . 

For our purposes, a second variant of (5.1), valid this time for ordinary ad¬ 
missible surfaces as well as for generalized ones, is required. This variant 
constitutes a minor extension of the original theorem. 

6. Prescribed discontinuities 12 

The classical minimum problems for continuous surfaces may be regarded as 
special cases of corresponding problems for more general surfaces whose dis¬ 
continuities are specified. We shall require the extension of (5.1) to a problem 
of this type, in which we admit piecewise Lipschitzian surfaces with prescribed 
discontinuities along given parallels to the coordinate axes. 

By a 'piecewise Lipschitzian surface S (ordinary or generalized) we shall mean 
the aggregate of a finite number of Lipschitzian portions of surface S ik (ordinary 
or generalized) defined on partial domains A i k of a subdivision of A by a finite 
number of parallels to the axes. It will be understood that S is regarded as 
unaltered by subsequent subdivision of its constituent Lipschitzian portions. 
By the surface integral F(S), we shall mean similarly the sum of the surface 
integrals F(S ik ) over these portions, and this definition is again invariant under 
further subdivision. 

By the discontinuity of S, we shall mean a function V(x , y ) only defined at 
interior points of A and possibly undefined at a finite number of these, such 
that V vanishes at interior points of each Aik while on the line separating two 
adjacent partial domains A i+ i, k and A ik or else A itk +\ and A ik , the function V 
is the difference of the tracks of the corresponding Lipschitzian portions. Be¬ 
sides this discontinuity V, we have a boundary-function U just as in the case 
of a continuous surface. 

It is convenient to combine the two functions U on the boundary of A and V 
in the interior of A, into a single function W defined throughout A (except 
possibly at a finite number of points). The function W then specifies the 
boundary and discontinuity of S . 

(6.1) The functions W which correspond in this way to at least one such piece- 
wise Lipschitzian surface S constitute a vector-space. 

For if S' and S" are two piecewise Lipschitzian surfaces and IF' and IF" 
specify their boundaries and discontinuities, we can divide A into portions Aik 
corresponding to continuous portions of both surfaces simultaneously. De¬ 
noting by z ik and z” k the tracks of these portions we define a piecewise Lip¬ 
schitzian surface S consisting of the continuous portions of ordinary surfaces 

l * We do not study this problem for its own sake, but in order to obtain machinery for 
the special case in which there are no discontinuities. In the literature only problems with 
unspecified discontinuities appear to have been treated. 



536 


L. C. YOUNG 


given by the track az ik + bz'u on A a . This surface S then has the boundary 
and discontinuity specified by the function W = aW f + bW", and this proves 

(6.1) . Moreover, we see that S has in each of its continuous portions a Lipschitz 
constant not exceeding | a | + \ b \ times the greatest of those of S' and S", so 
that we have the following result : 

(6.2) If W f and W n correspond to piecewise Lipschitzian surfaces whose con¬ 
tinuous portions have Lipschitz constants less than K, then aW' + bW" corresponds 
to at least one piecewise Lipschitzian surface whose continuous portions have 
Lipschitz constants less than (| a | + | b \)K. 

This being so, we define by analogy with §5 the functionals of W consisting* 
of the absolute minima of F(S ) for the piecewise Lipschitzian surfaces S with 
boundary and discontinuity specified by W which can be subdivided into con¬ 
tinuous portions belonging to the same four classes as previously. As these 
functionals reduce to those of §5 when the variable W reduces to If, i.e. when 
there is no discontinuity, we shall designate them by the symbols <p 0 (W)> <p 0 (W ), 
<Po(W, K) and <p g (W , K ), and any one by the symbol <p(W). 

As an immediate consequence of (6.2) we have: 

(6.3) Any two points of the W-space for which ip{W) ^ + °o are inner points of a 
segment on which <p(W) is bounded above. 

Moreover, by exactly the same argument as that used to prove the corre¬ 
sponding inequality in §5, we see that 

(6.4) <p(\W' + W') g y(W f ) + h<p{W"). 

Combining (6.3) and (6.4), we deduce: 

(6.5) Each of the minima <p{W) is convex in W . 

7. Existence of a “flat support” 

The property of convexity of the functionals <p(U), I(z ), <p(W) owes its im¬ 
portance to the fact that it places at our disposal the powerful theorem of Min¬ 
kowski asserting the existence of a “flat support." We shall follow Banach 18 
in proving a form of this theorem valid in general vector spaces which covers 
our needs. 

We consider a space whose elements are abstract vectors X . As is customary, 
$> functional L(X) is termed linear if L(cX) = cL{X) for any constant c, and 
L(X + Y) = L(X) + L(Y); and since this definition implies that L(0) = 0, 
it is frequently necessary to introduce an additive constant when the analogy 
with Euclidean space is to be preserved. 

The expression a + L(X), where a is a constant and L(X) a linear functional, 
is termed “flat support ” at X 0 of a given functional Q(X), if it never exceeds 
Q(X) and if it takes the value Q(X 0 ) at X 0 . 


u Banach [1J, P- 27, Theorem 1. 




GENERALIZED SURFACES IN CALCULUS OF VARIATIONS 


537 


One more definition: if G is a vector subspace, we denote by {X 0 , G\ the 
vector space or subspace which consists of all vectors Y of the form X + tX 0 
where X lies in G and t is real. It is easy to see that if X 0 does not lie in G, 
this expression of a vector F'of {Xo, G\ is unique . We shall denote further by 
{X 0 , G}+ and {X 0 , G}_ the systems of vectors Y of the above form with t 
positive, and with t negative, respectively. 

(7.1) Let Q(X) be a convex functional defined in a vector-space , and let L(X) be a 
linear functional defined in a vector subspace G and such that throughout G we 
have L{X) g Q(X); suppose further that X 0 is any point of the vector-space such 
that Q(X) is defined and finite above for at least one X of each of the sets {X 0 , G} + 
and {Xo, G)- . Then we can extend the definition of L(X) in such a manner that 
this functional be defined , linear , and not greater than Q(X) throughout {X 0 , G\. 

In both the hypothesis and the conclusion of the theorem, the inequality 
connecting L(X) and Q(X) is regarded as automatically true at points X for 
which Q(X) is either not defined or not finite above. 

In proving (7.1), we may clearly suppose that X 0 does not lie in G. Denoting 
by k 0 a number to be fixed later, we continue L(X) by writing, for real t and for 
arbitrary X in G, 

L(X + /Xo) - L(X) + tk 0 , 

and this continuation is linear in {A r 0 , G}, so that we have only to show that it 
does not exceed Q(X + ZX 0 ). Since this last condition holds for t = 0, we 
need only verify it for t — a and t = —a', where a, a' are positive; it becomes 

(7.2) JL(X) - Q(X - a'X 0 )}/a' g k 0 g {Q(X' + aX 0 ) - L(X')}/a 

where X and X' are arbitrary vectors of G. 

It only remains to establish the existence of a finite k 0 which satisfies (7.2) 
for all a, a', X, X'. Moreover, since Q is defined and finite above somewhere 
in each of the sets {X 0 , G} + and {X 0 , G}_ , we see that the upper bound of the 
expression on the extreme left of (7.2) is not — «, and the lower bound of the 
expression on the extreme right is not + 00 • The existence of ko is thus ensured 
if we have an inequality between these bounds, which is equivalent to the state¬ 
ment that eveiy value of the extreme left of (7.2) is less than or equal to every 
value on the extreme right. 

Now this last statement, after multiplying by aa f /(a + a'), reduces to the 
inequality 

aUX) + a'L(X') ^ aQ(X - a'Xo) + a'Q(X' + aX 0 ) 
a + a' ~~ a + a' 

and the latter is satisfied since, if X" is the vector (aX + a'X')/(a + a') of G 
(which may also be written [a(X — a'Xo) + a'(X' + aX 0 )]/(a + a')), the ratio 
on the left equals L(X") by linearity, while that on the right is not less than 
Q(X") by convexity and so, by hypothesis, not less than L(X"). This com¬ 
pletes the proof, which follows closely that given in a special case by Banach. 



538 


L. C. YOUNG 


As a consequence of the theorem just proved, we deduce for our convex func¬ 
tionals <p(W ), still using only ideas set out by Banach, 14 the following corollary: 

(7,3) Each of the functionals tp(W) of §6 has a flat support at any Wo for which 
its value <p(Wo) is finite . 

To prove this, we write Q(X) for the difference <p(Wo + X) — v(Wo) and, 
since the latter vanishes for X = 0, we need "only obtain a linear functional 
L(X ) nowhere exceeding Q(X). We shall construct such an L{X) by trans- 
finite induction, first observing that it clearly exists in the space consisting of 
the single point 0. Let the points at which Q is defined and finite above be 
well-ordered, the origin 0 being first. Let X 0 be a subsequent point and let G 
be the vector subspace consisting of the linear combinations of vectors previ¬ 
ous to X 0 . 

We make the inductive hypothesis that L(X) exists in G with the properties 
required. By (6.3) the functional Q is bounded above, and sp finite above, on a 
segment containing 0 and X 0 in its interior, and therefore at some point of each 
of the sets \X 0 , G} + and {X 0 , £?}_ . By (7.1) we can therefore continue L{X) 
into {Xo, G}, and so, by transfinitc induction, into the whole of the vector 
subspace consisting of points which are linear combinations of those at which 
the functional Q is defined and finite above. This subspacc is precisely the 
whole of our vector space by (6.2). 

8, Necessary conditions for an attained minimum 

Denoting by Uo a particular boundary-function, we now propose to examine 
some consequences of the hypothesis that one of the minima <p(U 0 ) is attained, 
i.e. that for some surface So having the boundary V 0 and belonging to the appro¬ 
priate one of thqfour classes introduced in §5, we have <p(Uo) = F(So ). 

We shall modify our original assumption concerning the function f(x , y, p, g), 
and suppose that it is not this function but the difference 

Six, V, V, q) - fo(x, y ) 

which is bounded above when x , y lies in A and p, q lies in a bounded set, where 
fo(Xy y) denotes the average of / at x, y for the surface So or, in particular, in the 
case of an ordinary surface, the value of / when we substitute for p, q the gradient 
of So . In addition, we shall suppose, as is really implied in the fact that F(So ) 
exists, that/o(£, y) is integrable in some sense on every portion of A. It is then 
possible to write/ 0 (x, y) = 0 without effective loss of generality, and we shall do 
this when convenient in any proof. 

We shall denote by Zq(x , y) or simply z 0 , the track of So , and by W 0 the 
abstract vector which specifies its boundary Uo together with the discontinuity 
zero. We denote further by z(x , y) or simply by z, the track of an arbitrary 
admissible Lipschitzian surface S, and by f(x, y) the average at x, y of the func- 


14 Banach [1], p. 29, Corollary. 



GENERALIZED SURFACES IN CALCULUS OF VARIATIONS 


539 


tion/(x, y, p , q ) for this surface. Finally we denote by A an arbitrary interval 
of the x, y plane. 

(8.1) There exists a linear functional L(z\ A) of z , additive in A dependent only 
on the values of z on the boundary of A-A, such that 

(8.2) [ [ f(x, y)dxdy ^ [ [ f 0 (x, y) dx dy + L{z\ A) - L(z 0 ; A) 

J Ja a J J a-a 

for every admissible surface , (i.e. for every Lipschitzian surface of the appropriate 
class), whatever its boundary U may be. 

(8.3) In particular , taking for A an interval including A , there is a linear func¬ 
tional L(z) depending only on the boundary-values V of the track z of the surface S , 
such that 

(8.4) F(S) ^ F(So) + L{z) - L(zo). 

It will be observed that this last statement includes as a special case the 
minimizing hypothesis from which we started, namely that 

F(S) ^ F(So) when U = U 0 , 

since we then have L(z) = L(zo). So that (8.3), and a fortiori (8.1), are theorems 
in which the hypotheses are exploited to the full. 

We may suppose f 0 (x, y) = 0, so that f(x y y , p , q) now satisfies the hypotheses 
of §5. We have then (p(W 0 ) = 0, so that by (7.3) there is a linear functional of 
W — Wo nowhere exceeding <p(W), and a fortiori not exceeding the values of F(S) 
for a discontinuous S of the appropriate type whose discontinuity and boundary 
are specified by W. We select for S a surface which coincides in A with an 
admissible continuous surface S and outside A with the minimizing surface S 0 . 
The linear functional of W — Wo obtained becomes a functional of the track 
of S and of the interval A, clearly dependent only on the boundary-values of 
this track z in ^4-A, and which we denote by L(z — z Q , A). The functional 
L(z, A) is moreover linear in z and additive in A, by construction; furthermore 
it satisfies (8.2) since the left-hand side is F(3). This completes the proof. 

We shall now proceed to derive from (8.2) further information regarding the 
functional L{z , A) whose existence is asserted there. 

(8.5) . There exist bounded measurable functions of x, y , the functions P(.r, y) and 
Q(x , y) say , such that 

(8.6) L(z } A) = J J ( P-Zy — Q'Z x )dxdy. 

In proving this, we again suppose fo(x , y) = 0. Moreover we may restrict 
ourselves, by homogeneity, to tracks z of surfaces whose Lipschitz constants 
are sufficiently small for z + zo to be the track of an admissible surface. The 
function /(x, y , p, q) being now bounded above for the relevant surfaces, it 



540 


L. C. YOUNG 


follows by applying (8.2) to surfaces with the track z + zo , that L(z, A) cannot 
exceed N- 1 A |, where N is a constant and | A | is the area of A. We can there¬ 
fore express L(z, A) as the integral of its derivative in A, the latter existing 
almost everywhere for fixed z. 

In particular, when z is a linear function ax + by + c, the functional L, 
which is clearly unaltered by passing to a parallel surface, will be of the form 
aA + bB where A and B are functions of A possessing bounded derivatives in A 
almost everywhere. Denoting by —Q and +P these derivatives at x, y , and 
writing for instance P = Q = 0 when they do not exist, the formula to be proved 
clearly holds for linear tracks z . 

In order to establish the formula generally, we denote its right-hand side by 
Li(z, A). We denote further by zi a track with the same boundary as z on A • A 
and with a Lipschitz constant increased by as little as we please, whose gradient 
in a subset of A of measure as close as we please to A differs arbitrarily little 
from that of z and is constant in a neighborhood of each point of this subset. 16 
Now the difference of Li(zi, A) and Li(z, A) is arbitrarily small since it is less 
than a constant multiple of the integral of the absolute difference of the gradients 
of z and Zi . Moreover L{z\ , A) and Li(z x , A) have the same derivative outside- 
small measure, and their derivatives elsewhere are bounded; therefore their 
difference also is as small as we please. Finally L(z, A) and L(zi, A) coincide 
since z and Z\ have the same boundary on A • A. Combining these results, we 
see that L(z, A) and Li(z, A) differ arbitrarily little, and so coincide. This 
completes the proof of (8.5). 

These results can now be sharpened still further by applying the following 
lemma due to Haar 16 and Schauder: 17 

(8.7) In order that the integral f f (P z v — Q z x ) dx dy , where P, Q are bounded 

measurable functions of x, y> shall take the value 0 whenever z is a Lipschitzian 
function vanishing on the boundary of A , it is both necessary and sufficient that 
there exist a Lipschitzian function Z whose gradient is almost everywhere P, Q. 

It is assumed in the above lemma that A is a bounded simply-connected 
domain. 18 

As a consequence of these results, our main theorem becomes: 

(8.8) In order that F{So) = <p(t/ 0 ), where So satisfies the remaining hypotheses of 
the beginning of this paragraph , it is necessary and sufficient that there exist a 
Lipschitzian function Z such that the following inequality be verified for every ad¬ 
missible surface {whose track z has an arbitrary boundary ): 

(8.9) ff {/(x, y) - / 0 (x, y) - Z z (z v - z 0 „) + Z y {z x - z 0z )}dxdy ^ 0. 

J Ja'A 

w The existence of such a track is ensured by (12.1) p. 98 of the first note, Young J12]. 

“ Haar [2], 14]. 

17 Schauder [9]. 

18 Our results have been mainly stated for convex domains but hold generally. At this 
stage however they extend only to simply-connected domains. 



GENERALIZED SURFACES IN CALCULUS OF VARIATIONS 


541 


9. Localized form of the conditions. Connection with Weierstress’ condition 
and with the Euler-Haar equations 

Following Bonnet and Haar, we term adjoint 19 of the minimizing surface So , 
the ordinary surface with the track Z(x , y) whose existence is established in (8.8). 

Our object is to reduce (8.9) to the inequality 

(9.1) f(x, y , p, q) - fo(x, y) - P-(q - q 0 ) + Q(p - p 0 ) ^ 0, 

where p 0 , qo and P, Q are the gradients at x , y of the tracks z 0 and Z. The 
validity of (9.1) for all p, q will be termed the condition ( W , E) at x, y\ similarly, 
its validity for all p, q interior to the circle of radius K and center at the origin, 
will be termed the condition (IT, E) K . These terms are similar to those adopted 
by the writer in the case of curves 20 and the most elementary considerations 
show that for a function f{x , y , p, q) possessing partial derivatives in p and q 
at x , y (and in particular for the integrands of all classical problems), the condi¬ 
tion (W y E) is wholly equivalent to the simultaneous validity of 

fPo Qy foo Py 

f(Xy yy Vy <l) - /<>(*, y) “ fv 0 * (v ~ 7>o) - f go • (q - qo) S 0. 

The first two relations are Haar\s form of the Euler equations, while the third 
is Weierstrass’ condition. 

A set E of values of x, y, p, q will be termed superficially negligible , if given 
any ordinary Lipschitzian surface S with the track z(x , y) y the set of the points 
x > Vy Py Q °f P which p, q is the gradient of this track has a projection in the 
Xy y plane of measure zero. In view of this definition, it is clear that an altera¬ 
tion of f(x, p, p, q) in such a set of values of x, y , p, q cannot affect the value 
of F{S) for ordinary Lipschitzian surfaces. On the other hand, the example 
considered in §3 shows that such an alteration may affect the values of F(S ) 
for generalized S. This difference between the ordinary and the generalized 
problems will cause slight differences between the corresponding localized forms 
of the conditions for an attained minimum, which we state in (9.2) and (9.3) 
below. 

We begin with the ordinary problem. 

(9.2) With the hypotheses of (8.8), where we now suppose So an ordinary surface , 
a necessary and sufficient condition for an ordinary minimum is that the inequality 
(9.1) he verified for a corresponding adjoint surface with the gradient P , Q except 
at most in a superficially negligible set of x y y, p, q. 

This is clearly equivalent to the condition that the integrand of left-hand 
side of (8.9) be non-negative for every admissible surface except at most in a 
set of Xy y of plane measure 0; and this in its turn is clearly equivalent to (8.9) 
itself. A similar result holds for the ordinary minimum in the class of surfaces 
with Lipschitz constants less than K. 


1# Haar*[5]. 

10 Youngjfll]. 



542 


L. C. YOUNG 


The corresponding results for the generalized problem are more complete and 
also not so easy to deduce from (8.8). 

(9.3) With the hypotheses of (8.8), a necessary and sufficient condition for a 
minimum among generalized surfaces whose Lipschitz constants are either unre¬ 
stricted or less than a fixed K, is that at almost points x, y of A the minimizing 
surface So satisfy either the condition (W, E), or else the condition (W, E ) K . 

The main difference between this statement and the one concerning ordinary 
minima is that more information is given here about the exceptional set of 
x 9 Pi q for which (9.1) need not hold, its projection in the x , y plane being this 
time a null set. Incidentally the combination of the two results will show that* 
if an ordinary minimum is attained, then by altering /for at most a superficially 
negligible set of x , y, p, q we ensure that the same minimizing surface provides a 
generalized minimum. 

In proving (9.3), we may suppose as usual / 0 (x, y) = 0. We shall denote 
by K 0 any number exceeding the Lipschitz constant of So and less than the 
upper bound of the Lipschitz constants of all admissible surfaces. We shall 
denote further by f*(x , y, p, q ) the lower bound of the expression 

(9.4) 22 <**/(*, V, Vk , Qk) 

Ar—1 


as function of the integer N and of the a k , p k , q k where k = 1, • • • , N when 
these variables are restricted by the conditions 


(9.5) 


N 


X) a k = 


a k ^ 0, p k + q k S K \, 

N ' N 

1, a k p k = p, X) k q k = q. 

*- i fe-i 


It is easy to see that the lower bound of (9.4) is unaltered if we impose the addi¬ 
tional restriction NS 3. For if iV >4, there exists a system of numbers b k , 
such that not all b k vanish, which fulfil the equations 

N N If 

X) b k = £ bkPk = X) M* = 0; 

fc-i 


we can arrange, by changing the signs if necessary throughout, that 

N 

22 bkf(x, y, Pk, qk) g 0, 

fc-1 

and, by multiplying through by a suitable positive constant (remembering that 
at least one b k must be negative), that the least of the numbers 

a k = a k + b k 


is equal to zero. It follows that we can reduce the value of N without in¬ 
creasing (9.4), by taking as new system of a k , p k , q k those of the o£, p k , q k 
for which a k does not vanish, and so, by induction we can make NS 3. 



GENERALIZED SURFACES IN CALCULUS OF VARIATIONS 


543 


We shall require, for the purpose of proving (9.3), the following properties 
of the function /*(x, y> p, q) : 

(9.6) (i) For constant x , y the function f*(x, y , p, q) is convex in p, q. 

(ii) For constant p, q the function /*(x, y, p, q) is measurable in x, y. 

The first of these is an immediate consequence of the definition. To prove 
(ii), it is sufficient to prove that the set of x, y at which any given number c 
is not greater than/* is measurable; this set is clearly the projection in the x, y 
plane of a set of x, y, p k , q k , a k (k = 1, 2, 3) for which (9.5) holds and for which 
the expression (9.4) is greater than or equal to c; as the projection of a set 
measurable B , it is therefore itself 21 measurable (though not in general measur¬ 
able B). This completes the proof of (9.6) (ii). 

We shall still require one further lemma: 

(9.7) Given any finite measurable function h{x , y) greater than /*(x, y, p, q), 
there exists a generalized surface with constant gradient p, q and with a Lipschitz 
constant not exceeding K 0 , such that M(f) < h for almost all x,y of A. 

To prove this, we consider the Borel set of x, y } p*, q k , a* (k = 1, 2, 3) in 
space of eleven dimensions, for which (9.5) holds and for which the expression 
(9.4) is less than h(x, y). The projection of this set in the x, y plane contains 
every point of A, therefore 22 the set itself contains the graph of a system of nine 
functions p k , q k f a k (k = 1, 2, 3) of x, y measurable in A . Substituting these 
functions in (9.4), this expression becomes an average M{f) whose value is a 
measurable function of x, y less than /i(x, y ). 

We are now in a position to establish (9.3). For any p, q of magnitude not 
exceeding K 0 , it is clear that f* is a finite function of x, y not exceeding /, pro¬ 
vided that we exclude if necessary a null set of x, y where / = — oo (if this last 
set were not a null set, it would still have positive measure when p, q is replaced 
by the gradient of So). Moreover, by excluding a suitable null set, we may 
suppose /*, for the given constant p, q , to be measurable B in x, y. Choosing 
h(x, y) to be/*(x, y } p,q) + e in this set, and say /(x, y, p,q) + e in the comple¬ 
mentary null set, it clearly follows from (9.7) and (8.9) that for almost all 
x, y of A y 

f* + « ^ fo + Z x (q — Zo y ) — Z y (p — Zo x ). 

From this inequality, making € describe a sequence with limit 0 and making p, q 
describe all rational points of magnitude not exceeding K 0 , we deduce that 
outside a fixed null set, the inequality 

f* ^ fo + Z*(q - *o„) - Z y {p - Zo x ) 

holds for all such rational p, q. This same inequality, and a fortiori the in¬ 
equality (9.1), then holds for the same set of x, y and all p, q whose magnitude 

Lusin [7], p. 144 & p. 152; cf. also Saks [8], p. 47-50. 

** By the argument of Young [11], p. 237, lemma (5.2) & p. 256, 257. 



544 


L. C. YOUNG 


does not exceed K 0 , since f* is continuous in p, q on account of its finiteness and 
convexity. Finally, making Ko describe a sequence with the limit <© or K , we 
see that (9.1) must still hold outside a null set of x, y for all relevant p, q , and 
this is what we asserted. The converse being a trivial consequence of (8.8), 
this completes the proof. 

Let us remark in conclusion that the converse part of these theorems can be 
strengthened: A surface So satisfying the condition ( W , E) with a Lipschitzian 
adjoint, provides a minimum in a wider class of surfaces than those originally 
admitted, namely for all surfaces whose track z in such that the integral 



Z v z x ) dx dy depends only on the boundary values of the variable 


track z . 

It has been proved in the case of a rectangle A, and the proof extends easily 
enough to the case in which A is a convex region, that the integral in question, 
where Z is a given Lipschitzian function, has this property for any z absolutely 
continuous in the sense of Tonelli. 23 


Capetown, South Africa. 


References 

[1] Banach, S. .Thtorie des operations lin&aires, Monografic Matematyczne, I (Warszawa 

1932). 

[2] Haar, A. Vber die Variation der Doppelintegrale, Journal ftir die reine und angewandte 

Mathematik, 146 (1919), pp. 1-18. 

[3] Haar, A., Vber regulare Variationsprobleme, Acta Litterarumac Scientiarum (Szeged), 

3 (1927), pp. 224-234. 

[4] Haar, A. Vber das Plateau 1 sche Problem , Mathematische Annalen, 97 (1927), pp. 

124-158. w 

[5] Haar, A. Vber adjungierte Variationsprobleme und adjungierte Extremalfldchen , 

Mathematische Annalen, 100 (1928), pp. 481-502. 

[6] Hardy, G. H., Littlewood, J. E., and P6lya, G. Inequalities , (Cambridge 1934). 

[7] Lusin, N. Leqons sur les Ensembles Analytiques et leurs applications , Collection de 

monographies Borel (Paris 1930). 

[8] Saks, S. Theory of the integral , Monografie Matematyczne, VII (Warszawa 1937). 

[9] Schauder, J. Vber die Umkehrung eines Satzes aus der Variationsrechnung , Acta 

Litterarum ac Scientiarum (Szeged), 4 (1928), pp. 28-50. 

[10] Steiner, J. Gesammelte Werke , Band II. 

[11] Young, L. C. Necessary conditions in the calculus of variations , Acta Mathematica, 69 

(1938) p. 239-258. 

[12] Young, L. C. Generalized surfaces in the calculus of variations f I. Generalized Lip¬ 

schitzian surfaces. Annals of Mathematics, 43 (1942), pp. 84-103. 

[13] Young, W. H. On a formula for an area , Proceedings of the London Mathematical 

Society, (2) 18 (1919), pp. 339-374. 

[14] Young, W. H. On a new set of conditions for a formula for an area , Proceedings of the 

London Mathematical Society, (2) 21 (1923), pp. 75-94. 

M Young, W. H. [13], [14]. The case in which the derivative in one variable of one of the 
functions Z, z and the derivative in the other variable of the other function have associated 
summabi^ties is also treated. 



Annals of Mathematics 
Vol. 43, No. 3, July, 1942 


THE GEOMETRY OF ISOTROPIC SURFACES 

» By Shiing-shen Chern 
(Received May 23, 1941; revised October 10, 1941) 

Introduction 

The geometry in a space in which there is given a family of varieties has 
been the object of extensive researches. The geometry of paths 1 corresponds 
to the case of a (2 n — 2)-parameter family of curves in an 71-dimensional space, 
defined by a system of differential equations of the second order of a certain 
type. It may be regarded as a natural generalization of projective geometry, 
since Cartan has proved that of all the projective connections having the same 
geodesics there exists one, and only one, which is characterized by intrinsic 
properties and which he called normal. 2 Recently, M. Hachtroudi 3 proved 
that in a three-dimensional space with a three-parameter family of surfaces 
there can be defined, in an intrinsic way, one and only one projective con¬ 
nection with the contact elements as the elements of the space. It is the aim 
of the present paper to study the geometry of a space in which there is given 
a two-parameter family of surfaces. Our main results may be summarized 
in the following two theorems: 

Theorem 1 . Suppose there he given in a three-dimensional space a two-param¬ 
eter family of surfaces {2} such that the tangent planes at a point of the surfaces 
of the family through the point do not pass through a fixed direction. Then there 
can he defined in the space one and only one four-dimensional Weyl geometry , which 
has the contact elements of the surfaces as its “points” and possesses certain intrin¬ 
sic properties. 

Theorem 2. There exists a point transformation carrying one family of surfaces 
{2} into another {2} when and only when the two four-dimensional Weyl geom¬ 
etries are equivalent . 

The paper is divided into four sections. In §1 we give the definition of a 
Weyl space of elements 4 and of some of its fundamental notions. The deter¬ 
mination of the Weyl space from the family of surfaces is then given in §2. 
In §3 we show that the so-defined Weyl geometry naturally leads to a solution 
of the problem of equivalence, i.e., the problem of deciding when two such 
families are equivalent under point transformations. The important particular 


1 Cf., for instance, O. Veblen and T. Y. Thomas, The geometry of paths , Trans. Amer. 
Math. Soc., vol. 25 (1923), pp. 551-608. 

2 E. Cartan, Sur les varietes d connexion projective , Bull, de la Soc. Math, de France, t. 52 
(1924), pp. 205-241. 

2 M. Hachtroudi, Les espaces d i elements d connexion projective normale , Actuality Sci- 
entifiques et Industrielles, no. 565, Paris, 1937. 

4 Cf. E. Cartan, La mHhode du repbre mobile , la thkorie des groupes continue , et les espaces 
generalises. Actuality Scientifiques et Industrielles, Paris, 1935. 

545 



546 


SHIING-SHEN CHERN 


case by which the Weyl space reduces to a three-dimensional point space is 
discussed in §4. 

The extension of these results to a space of ^dimensions will be treated in 
a subsequent paper. 


1. Weyl Space of Elements 

Consider a three-dimensional space S with the coordinates x, y, z. To each 
point M of the space we attach three vectors I\, It, /*, such that their scalar 
products satisfy the conditions 


ihh = -1\* 0 
( I\ = Il = IJt = hh = 0 . 


The totality of the point M and the vectors h , / 2 , h through M satisfying 
( 1 ) will be called a frame. We say that the space is a Weyl space or that a 
Weyl connection is defined in the space, if a law of infinitesimal displacement 
of the following form is given: 

dM = wi/i -|“ 0 * 2/2 H” 0 * 3/3 > 

dl\ = wn/i + 0*12/2 + 0*13/3, 

< 

dl 2 = 0*21/1 + 0*22/2 + 0*23/3 j 
d/s = 0*31/1 + 0*32/2 + 0*33/3 , 

where the o*’s are Pfaffian forms in x , 2 . On differentiating ( 1 ) and making 

use of the above equations, we find 

«13 = 0, 0*31 = 0, 0*11 + 0*33 2o*22 — 0, 

0*12 — 0*23 88 0, 0*21 — 0*32 = 0. 


Hence it turns out to be more convenient to put 


T = 0*12 , 7Tl = 0*11 > 7T2 = 0*22 , **3 = «21 9 

and to write the equations of infinitestimal displacement in the form: 


(dM = 0 * 1/1 + 0 * 2/2 4" 0 * 8 / 3 , 


( 2 ) 


I dli = Till + r/ 2 , 
d /2 = tJi + T 2 I 2 + rlz , 
dlz = T3/2 + (—Ti + 2 t 2 )/s . 


To serve our later purpose we shall generalize the notion of a Weyl space to 
that of a Weyl space of elements , by which we shall mean the following: We at¬ 
tach to each point of the space a one-parameter family of contact elements with 



GEOMETRY OF ISOTROPIC SURFACES 


547 


the point as origin. A contact element is then defined by four coordinates 
x , y, z , t, where x , y , 2 are the coordinates of the point and t fixes the plane. 
By attaching a frame to each contact element and supposing the wi, w 2 , « 3 , 
r, ri , in , ti in (2) to be Pfaffian forms in x y y , z } t , we shall get a Weyl space of 
elements. To define a Weyl space of elements it is thus necessary to give the 
one-parameter family of contact elements through each point and the seven 
Pfaffian forms in (2). 

It will be advantageous to interpret the four-parameter family of contact 
elements to be the “points” of a four-dimensional auxiliary space *S'. Given 
a curve in S' defined by the equations 

(3) x = x(\), y = y(\), z = z(X), t = t{\), 

we can express the Pfaffian forms in (2) as Pfaffian forms in X. Then equations 
(2) become a, system of ordinary differential equations. By a well-known 
theorem in the theory of differential equations, we see that when the initial 
frame jif (0) j< 0) /< 0) /< 0) corresponding to an initial value X = X 0 is given, the 
family of frames MIJ2I3 satisfying (2) is uniquely determined (with respect to 
M ^o) 1 1 °)1 2 °> I g°>), In this case, we say that the family of frames is developed , 
along the curve (3), in the Euclidean space defined by the frame ^ (0) /{ 0) /£ 0) /< 0) # 
If we join a point Po of S' to a point P of S' by two different curves and develop, 
in the Euclidean space of the frame attached to P 0 , the two families of frames 
along these two curves, the frames obtained corresponding to the frame at P 
will in general be different. 

Instead of studying the development of the frame at P on the frame at Po 
along two different curves, we shall consider an infinitesimal parallelogram 
with P 0 as a vertex and with two directions d and 8 through P 0 as its two sides. 
Such a parallelogram is called a cycle. If we develop the family of frames first 
along the side d and then along 5 or vice versa, the frames obtained will be 
different, so that there exists an infinitesimal transformation carrying one frame 
to the other. This infinitesimal transformation is called the infinitesimal 
transformation associated to the cycle . By a classical method, 6 it is easy to show 
that the infinitesimal transformation associated to a cycle formed by the direc¬ 
tions d and 8 is given by equations of the form 

VM = d8M — 8dM = Qih + O 2/2 + ^ 3 / 3 » 

V/i = d8h - 8dh = Ihh + T h , 

(4) 

7/2 = d8h - &dh = II 3/1 + U 2 I 2 + T/ 3 , 

VI 3 = d8h - 5dl s = n,J 2 + (-H + 2 n 2 )/ 8 , 

8 Cf., for example, E. Cartan, Lemons sur la gkomktrie des espaces de Riemann } Paris, 
1928, Chap. VII. 



548 


SHIING-SHEN CHERN 


where, with the notations of Cartan’s exterior calculus, 6 we have 
fti = + [vm] + 

&2 ~ 0)2 + [rwi] + [tT2W2] + [ 7 T 8 COa], 

% = <*>3 + [TCO 2 ] “ [7TlW 3 ] + 2[7r26)3], 

( 5 ) <T = r' — [tti r] + [ 7 T 2 r], 

III = 7 r( + kar], 

n 2 = -7T2 , 

II3 = 7T 3 + [7ri7r 3 ] — [7r 2 7r 3 ], 

so that T, , I!* (i = 1, 2, 3) are exterior quadratic differential forms in x, y } 
z , t. 

Suppose the frame attached to each contact element be so chosen that M 
coincides with the point of the clement and I x , / 2 belong to the plane of the 
element. Then the contact element will be fixed, if and only if 

Wi = 0)2 = «3 = t — 0. 

It follows that o)\ , 0)2 , co 3 , r are linear combinations of dx f dy , dz , dt and are 
themselves linearly independent. We can therefore put 

' 3 

ft = -P»l[« 2 C 0 8 ] + P» 2 [w 3 COi] + Pi3[wiW 2 ] + S Qik[mr], 1 = 1, 2, 3, 

k-1 

*3 

(6) ^ n t = cos] + Ri 2 [wswi] + /?i3[wiw 2 ] + X^ S<fc[w*T], i = 1, 2, 3, 

Jfc -1 
3 

T = P i[c*>2 co 3 ] + /Mwawi] + p 3 [w 1W2] + XI Q*[«*r]. 

k-\ 

The 42 coefficients in (6) are the components of the “tensor of curvature and 
torsion” of the space. 

The Weyl space of elements as defined above may be studied from two dif¬ 
ferent points of view. We may either regard the space as a genuine four¬ 
dimensional space ( x , y, z, t) and study its properties invariant under the group 
(or, more precisely, the pseudo-group) of transformations 

(7) x = x(x, y, z, 0, y = y(x, y, z\ t) y z = z(x, y , z , 0, * = y, z, 0, 
or we may study the properties invariant under the group of transformations 

(8) x = 2(:r, y 9 z) y $ = y(x f y t z), z = 2(x, y , 2 ), * = i(«, V, z f 0. 

* An introduction to the notions of exterior multiplication and exterior differentiation 
is given in: E. Car tan, Legons sur les invariants inttgraux, Paris, 1922. 



K GEOMETRY OF ISOTROPIC SURFACES 


549 


For our purpose we shall restrict ourselves to the latter point of view and shall 
call a property intrinsic , if it is invariant under transformations of the form 

(8) . In particular, it follows that the concept of a point in S is an intrinsic 
property. 

2. The Two-Parameter Family of Surfaces and Its Intrinsic Weyl Geometry 

Suppose there be given in the space S a family (2) of surfaces depending on 
two parameters. We shall study the properties of {2}, which are not affected 
by an arbitrary transformation of coordinates 

(9) x = x(x, y, z), y = y(x, y, z), z = z(x, y, z), 

To make this study it will be convenient to introduce the four-dimensional 
auxiliary space S' formed by the contact elements of the surfaces of the family 
{2}. By the use of a parameter t the contact elements of {2} having the same 
origin (x, y, z) can be defined by an equation of the form 

(10) 0 = A(x, y, z, t)dx + B(x , y, 2 , t)dy + C(x, y, z , t)dz = 0, 

where A, B, C are not all zero. The parameter t serves to distinguish the con¬ 
tact elements through the point and is arbtrarv to the degree that it may be 
submitted to a transformation of the form 

(11) t = t{x, y, z, t ). 

In the auxiliary space S' of our contact elements with the coordinates x, y , 
z, t, each surface 2 of the family is represented by a two-dimensional variety. 
All these two-dimensional variaties fill up the space S' in the sense that through 
every point (in a certain neighborhood) of S' there passes one, and only one, 
such variety. The family {2 J of surfaces can therefore be defined by a com¬ 
pletely integrable Pfaffian system of the form 

(12) 0 = 0, 0i sb Aidx + Bidy + C x dz + Edt = 0, E 0, 

in the four variables x, y , z, t . The complete integrability of the system is 
expressed by the fact that the exterior derivatives S', d[ are congruent to zero, 
mod. 0, 0 i . 

By this representation of the family {2} of surfaces in S as a family of surfaces 
in S' the properties of the original family unaffected by transformations of the 
form (9) are represented by properties of the corresponding family in S', which 
remain unaltered under the transformations (9), (11), and conversely. But 
these are exactly the intrinsic properties in S' in the sense defined in §1. 

Suppose, without loss of generality, that C 7 ^ 0, so that we may simplify the 
system (12) to the form 


(13) 


0 s dz — pdx — qdy = 0, 
$i = dt — rdx — sdy = 0, 



550 


SHIING-SHEN CHERN 


where p, q, r, s are functions of x, y, z, t, 

' d d 


We introduce the notation 


(14) 


dx 


,9,9 
dx + P dz + r dt’ 


d _ ± , a a 

[ dy ay ^ dz 8 dt * 


With this notation we have, for any function F in the variables x, y, 2 , $, the 
formula 


(15) 


dP, , dP, >dF a .dF a 
dF = — dx + -r- dy + — 0 + — 0i, 
dx dy d 2 ai 


and the integrability conditions of the system (13) can be written in the following 
simple form: 


(16) 


dy _ dq _ ^ dr _ ds 
dy dx 9 dy dx 


We now proceed to prove that when a two-parameter family of surfaces (13) 
is given in S such that 


(17) P a ptqu - qtVtt ^ 0, 

there can be defined, in an intrinsic way, a Weyl geometry of elements in S. 
As it is easy to see, the vanishing of P signifies that the contact elements through 
a point of S pass through a fixed direction . 

To each contact element formed by the point (x, y, z) and the plane 6 = 0 
we attach a frame MIJJz adapted to it by supposing the following conditions 
to be satisfied: 

a) M coincides with the point (x, y, z). 

b) 1 1 is along the line of intersection of the plane and a neighboring plane 
through M , i.e., the line defined by the equations 


(18) 9 = 0, p4x + q4y = 0. 

We shall call this line the characteristic of the element, 

c) h lies on the plane 0 = 0. 

If Mhhh is such a frame, the most general frame M*ltl*I* satisfying the 
same conditions is given by 


(19) 


M* = M, I x = ul r, 


_ w 2 T * . r* , 1 UV 2 r * 


7* I UV T* 

wl 2 d-ll, 

W 


where u, v 9 w are parameters, with uw 0. 

We now suppose our Weyl geometry to have the following two intrinsic 
properties: 

1) Points are developed into points, i.e., the contact elements having the 
same origin are developed into the contact elements having the same origin. 

2) The contact elements belonging to a fixed surface of the family are de¬ 
veloped into contact elements having the same plane. 



GEOMETRY OF ISOTROPIC SURFACES 


551 


Let us express the conditions for these two properties analytically and see 
how far the Weyl geometry is thus defined. When the point M is fixed in 
space, we have 

dx * dy =s dz = 0 . 

Hence condition 1 is expressed by the relation 

dM = 0 (mod dx, dy, dz), 

i.e., coj , o? 2 , o )3 are linear combinations of dx, dy, 0 . Similarly, condition 2 is 
expressed by the fact that 0)3 , r are linear combinations of 0 and 61 . It follows 
in particular that 0)3 is a multiple of 0 . On the other hand, the condition b 
for the choice of the frame means that 0)2 is a linear combination of 0 and 
p t dx + q t dy. 

With a definite system of coordinates x, y, z, t we may make use of the param¬ 
eters u, v, w in (19) to simplify the expressions for 01 , o) 2 , 0 ) 3 . It is easy to 
verify that we may choose the frame attached to each contact element such 
that we have 

^ j «2 = ~ (Pt dx + q, dy) + r ,8 , 

}co 3 = 6 , 

where rj remains undetermined. The most general change of frame leaving the 
conditions a, b, c and the form of the equations ( 20 ) unaltered is given by 

( 21 ) /1 = If , h = J* + vlX , I z = It + vlt + hvlt , 

v being a parameter. 

In order to impose further conditions on the Weyl space, we are led to con¬ 
sider two particular kinds of cycles. The first kind is one whose directions 
d and 8 displace the point of the contact element in the same direction and is 
analytically characterized by the conditions 

coi(d) __ co 2 (d) __ o)s(d) 
coi( 8 ) 0 ) 2 ( 8 ) c*)a(S) 9 

or by 

( 22 ) [ 0 ) 10 ) 2 ] = [ 0 ) 30 ) 3 ] = [ 0 ) 30 ) 1 ] = 0 . 

The second may be described as one displacing the surface of the contact ele¬ 
ment in the same direction and will be defined by 

0 33 (d) __ r(d) 

0)3 (5) r( 8 ) 7 

or by 

(23) [o)st] = 0. 

The property of a cycle to be either one of the two kinds is intrinsic. 

We now suppose our Weyl space to possess the following three further prop¬ 
erties: 

3 ) The infinitesimal transformation associated to every cycle displaces the 
point M in the direction of h . 



552 


SHIING-SHEN CHERN 


4) The infinitesimal transformation associated to every cycle whose directions 
d and 8 displace the surface of the contact element in the same direction leaves 
the point M invariant. 

5) The infinitesimal transformation associated to every cycle whose direc¬ 
tions d and 8 displace the point of the contact element in the same direction 
leaves the plane MIJ 2 invariant. 

We shall prove that the conditions 1 to 5 completely define the Weyl space. 

To prove this, it is sufficient to determine the seven Pfaffian forms in (2). 
In the meantime we can still normalize the frame (with respect to the coordinates 
x 9 y> z, t) by making use of the transformation (21). Before doing this, we 
remark that the property 3 is expressed analytically by the fact that VM is 
equal to a multiple of Ji, i.e., by the equations 


(24) 


(o>2 + [ro)i] + [t* 0 ) 2 ] + [7T 3 ca 3 ] = 0, 
(o >3 + [TCO 2 ] — [*7Tl CO 3 ] + 2[7T2W 3 ] = 0 . 


When these relations are satisfied, the property 4 is characterized by the condi¬ 
tion that tti =0 when (23) holds, i.e., by 

(25) 0 )[ + [tticoi] + [7T 3 o^] + a[rco 3 ] = 0, 


where a remains undetermined. Similarly, the property 5 is given by an 
equation of the form 

(26) t' — [tj-it] + [tt 2 t] — b[ a>io>s] — c[o) 2 o> 3 ] = 0. 

From the second equation of (24) we find, by making use of (20), 

[(r + 6i)(ptdx + q4y )] = 0, ' mod 6. 

Since r is a linear "combination of 0 and 0i , it follows that r is of the form 


(27) r = -0i + r*, 

where f is undetermined. We calculate next the first equation of (24), mod 
«2 , w 3 . We find then 

[(<01 + pudx + q tt dy)0i] ss 0, mod o> 2 , c*> 3 . 

But 0)1 is a linear combination of dx y dy, 6 only, so that it must be of the form 

o)i = —(pttdx + q tt dy) 4- <r(ptdx + qdy) + £0. 

By applying the change of frame (21),. we find that a>i is then changed into 
0)1 + vo )2 + %v 2 0. We may therefore assume the frame so chosen that 

(28) <01 = - (pttdx + qttdy) + £ 0 . 

When wi, 6 > 2 , o ?3 are of the forms given by (20), (28), the frame attached to 
each contact element is uniquely determined. 

It now remains to determine the functions £, 77 , £ and the Pfaffian forms 
wi , V 2 y is* from the conditions (24), (25), (26) and to show that they are thus 
completely determined. For this purpose we have to calculate the exterior 
derivatives of , a*, w 3 , r. We remark that the calculation of every such 



GEOMETRY OF ISOTROPIC SURFACES 


553 


exterior derivative is equivalent to the calculation of the exterior derivative 
of a Pfaffian form of the form 

(29) Co — Id mdz — ndy , 

where i, m, n are functions of x, y , z, t. We find, by making use of (20), (27), 
(28), 


Pu>= - Ct - £) [wi “ sl+ {($ ■ £) n+(m,9 ‘ ~ n,p<)r 

+ + (w,g ( - ft, p ( ) + - g,Pi)| [«ia> 3 ] 

(30) + {" (|r s) « + ( ~ : ~ + (“ | ** + f P“) 

+ (—m a q lt + n z p tt ) + l(-p z q t t + q z p t t )| taws] 

+ ( n t p t — m/^)[a)iT] + (IP + m t q t t - n f p«)[w 2 r] 

v + {(m t q t — n t p t )$ + ( — m t q tt + n t p u — IP) y + Z*P}[w 3 r]. 

On substituting into (24), (25), (26), for o >[, a>i , , r their expressions in terms 

of the exterior products of a>i, a> 2 , co 3 , r, we shall get relations of the form 

— fcicoi] — [7 t 3 W2] = Xi[o)ia> 2 ] + X 2 [a>ico 3 ] + X 3 [cd2<o 3 ] -f X 4 [coit] + X 6 [o)2t], 

— W 2 w 2 ] — [fjWj] = Ml[cdiC0 2 ] + M2[wiW 3 ] + p 3 [o> 26 d 3 ] + M4 [w 2 r] + Mb[w 3 t], 

(31) 

[xiWs] — 2 [tT 2C0 3 ] = Vl[c0iW 3 ] + V'kuwz] + Vs[o)3t], 

, [ttit] — [*2t] = pi[a>iT ] + P2[o32t] + p 3 [w 3 r], 

where we have, in particular, 


Xl “ T {lie ~ w) ’ Xi ~ P (p,q “‘ ~ q,p,u) ’ 


X& — { — p (Pttqttt ~ qttPttt ), 


» - - * (£* - £*)+*($ - £ + *• - -») - <• 


+ P (— P<2<* + qtPi,), 


vi ~ j> (p»$« _ ?«P<)> ''a = p (~P.?« + 9*P«<) “ f> vz= ~ V, 

p, « I ( r,q t -s t p «), P* = f + ^ (~ *W« + «(?«)• 



554 


SHIltfG-SHEN CHERN 


In order that the system of equations (31) have a set of solutions for n , it*, r 8 
it is necessary and sufficient that the following relations hold: 


(33) 


(Xi M2 + V2 — 2p2 = 0, X 4 + — 2/14 = 0, 

(x 6 — mb = 0, Mi — *1 + Pi = 0. 


The set of solutions is then given by 


(34) 


iri = ( —j'i + 2 pi)wi + ( — V2 + 2p2)o>2 + X2W3 + (— + 2 /u )r, 

< *2 = (~^1 + Pl)^i + ( — + P2)W2 + (X 2 — Ps)(^S + M4T, 

*-3 = — M 26>1 + (X 2 — MS “ P 3 )W 2 + X 3 CO 3 + MBT, 


and is uniquely determined. Of the relations (33) the last one is identically 
satisfied, while the other three equations determine £, 17 , f. The Weyl con¬ 
nection is thus completely determined. 

On summing up our results, we get the theorem: 

Suppose that there is given in the space (x, y, z) a two-parameter family of sur¬ 
faces such that the tangent planes to the surfaces through a point do not pass through 
a fixed direction . Then there can be defined in the space , in an intrinsic way, 
one and only one Weyl connection of elements having the following properties: 

1 . Points are developed into points . 

2. Surfaces of the family are developed into planes. 

3. The infinitesimal transformation associated to every cycle displaces the 
point of the element in the direction of the characteristic of the element. 

4. The infinitesimal transformation associated to every cycle by which the direc¬ 
tions d and 6 displace the surface of the element in the same “direction ” leaves 
the element invariant . 

5. The infinitesimal transformation associated to every cycle by which the direc¬ 
tions d and 6 displace the origin of the element in the same direction leaves the plane 
of the element invariant. 


3. The Problem of Equivalence 

In the study of the geometry of a two-parameter family of surfaces in space 
under the group of transformations (9) the fundamental problem is the follow¬ 
ing: Given a two-parameter family { 2 } of surfaces in S and another such family 
{2} in a space S with the coordinates x, y , z. When are they equivalent , i.e., 
when can the one be carried into the other by a transformation ( 9 )? We shall 
show in this section that the solution of this problem follows as a consequence 
of the theorem of the last section, e.g., the possibility of defining an intrinsic 
Weyl connection in the space. 

Before discussing this so-called “problem of equivalence,” we shall give some 
further properties of the Weyl geometry defined in the last section. Our choice 
of the frames in §2 adapted to each contact element made use of the coordinates 
of the space. If we suppose only that the point M of the frame MhlJt coin- 



GEOMETRY OP ISOTROPIC SURFACES 


555 


cides with the point of the contact element and that the plane Mlih with the 
plane of the element, the frame attached to each contact element will depend 
on three parameters and the relation between two frames attached to the same 
element will be given by (19). By taking the parameters u y v y w in (19) as 
independent variables, we get a family of frames depending on the seven 
parameters x y y, z, t , u , v f w. Then the seven Pfaffian forms wi, o >2 , o ) 3 , r, 
ir i, 7 T 2 , tz are linearly independent and it is easy to verify that they satisfy 
the system of equations (5), ( 6 ), which are called the equations of structure of the 
Weyl space. For this general family of frames the conditions 

(35) Hi = a[o> 3 r], ik = Hz = 0, T = b[ CO 1 CO 3 ] + ^[ 0 ) 20 ) 3 ] 

are still satisfied, because they have an intrinsic geometric meaning. 

On account of these forms of the expressions for Hi , ik , H 3 , T the expressions 
for IIi, n 2 , n 8 can be simplified. In fact, by applying to the first four equa¬ 
tions of (5) the theorem that the exterior derivative of the exterior derivative 
of a Pfaffian form is zero (the so-called “Poincares Theorem”) and noticing 
that the resulting equations are identically satisfied if the expressions in ( 6 ) are 
zero, we get 

ill = [IIiWi] + [IIsCi^] — [fil7Tl] — [fljsTTs], 

= [Tcoi] + [II2W2] + [1130)3] — [ili r] — [ft 2 7r 2 ] — [Qjirs], 

12 a = [To) 2 ] — [ITia) 3 ] + 2 [1120)3] fer] + [^3^1] ~ 2 [Q 3 ir 2 ], 

T' = [Tin] - [Ttt 2 ] - [Hr] + [n 2 r]. 

These are some of the “Bianchi Identities” of the Weyl space. When the 
conditions (35) are satisfied, these equations can be simplified to the form 

[n x o)i] + [n 3 o) 2 ] — [(da + 3 a 7 ri — 3 a 7 r 2 )o) 3 r] = 0 , 

[n 2 o) 2 ] + [ 1130 ) 3 ] + c [ 0 ) 10 ) 20 ) 3 ] = 0 , 

[nio) 3 ] — 2[n20) 3 ] + b[ 0)10)20)3] = 0, 

[ [IIi r] — [II 2 r] — 6[o)iO) 2 r] + [(60)1 + Co) 2 )'o) 3 ] + [(bcoi + 002)^20)3] = 0 . 

From the third equation we see that every term of IIi — 2II 2 + 6 [o)io) 2 ] must 
contain 0)3 . On the other hand, we get, by multiplying the fourth equa¬ 
tion by 0 ) 3 , 

[(iii — n 2 — ^ 0 ) 10 ) 2 ) 70 ) 3 ] = 0. 

Similarly, the second and the third equations give respectively 
[ 1120 ) 20 ) 3 ] = 0 , [(Iii — 2112 ) 0 ) 20 ) 3 ] = 0, 
from which it follows that 

[(iii — n 2 — 60)10)2)0)20)3] = 0. 





556 


SHIING-SHEN CHERN 


Again, by multiplying the right-hand sides of the first equation by co 8 , the 
second by <o 2 , the third by — , and adding, we get 

[(Hi — IT2 — 6<OlO)2)(OlW8] = 0 . 


It follows from the three equations that every term of Hi — II 2 — &[cuiw 2 ] must 
contain . We can therefore put 


(38) 


111 = 36 [o>ico 2 ] + e[coicos] + /[0)20)3] + 0[a? 3 r], 

112 = 2&[<oig> 2 ] H“ ft[W1W3] i[w2Ws] ~f" 


From (37) it then follows that II 3 is of the form 


(39) II3 = (ft — c) [coico 2 ] + /[a? 1W3] + j[c»>2r] + ft[ 0)20)3] + Z[c*? 3 r]. 

Consequently, for our Weyl space, the seven Pfaffian forms a?i, w 2 , o? 3 , r, 
xi, 7t 2 , ir 3 satisfy the equations of structure ( 5 ), with Ui , ft 2 , , T, IIi, n 2 , II3 
given by (35), (38), (39). In these equations there are introduced 11 quantities 
a, 6, c, e,/, g , ft, i> j, ft, Z, which are functions of a:, 2 /, 2 , Z, w, t;, w and which consti¬ 
tute the tensor of curvature and torsion of the space. 

When two such Weyl spaces of elements are given, with the coordinates 
(x, y , 2 , 0 and (x, 2 , i), we say that they are equivalent, if to every family of 

frames of the one (with one frame attached to each contact element), there is a 
corresponding family of frames of the other, such that a transformation of the 
form (7) exists, which satisfies the relations 


(40) = co», f = r, 7 fi* = Ti , i = 1, 2, 3. 

If we attach to each contact element (x, y , 2 , t) the most general family of 
frames depending on three parameters u , v, w and to (x, 2 , <) the family of 

frames with the parameters tZ, v , ZD, it is evident that the two Weyl spaces are 
equivalent, when and only when there exists a transformation of the form 

t = 2(x, y , 2 , t , w, r, it?), 

ft? = ft?(x, 2 /, 2 , t, u, v , K?), 

satisfying the relations (40). 

With these preparations we now establish the theorem: 

There exists a point transformation of . 1 the form (9) carrying one family of sur¬ 
faces {2} into another {2} when and only when the two corresponding Weyl spaces 
of elements are equivalent. 

The proof of this theorem is immediate. In fact, when the two families of 
surfaces are equivalent, the corresponding Weyl spaces are equivalent, since 
our definition of the Weyl space is intrinsic. Conversely, when the Weyl spaces 
are equivalent, the transformation (41) will realize the equality of the sets of 
Pfaffian forms in (40). From the first three equations in (40), we see that in 


(41) 


fx = x(x, y, 2 , t , u , v , w), 

[u = U(Xy 2 /, 2 , ty Uy Vy W) y 



GEOMETRY OF ISOTROPIC SURFACES 


557 


the transformation (41) the coordinates x , y , z are functions of x , y , 2 alone, so 
that the transformation in question is a point transformation. From 

d>3 = OJ 3 , f = T 

we conclude that the transformation carries {2} to {2}. Hence the theorem 
is proved. 

Analytically this result may be formulated as follows: To solve the problem 
of equivalence for a two-parameter family of surfaces (whose tangent planes at a 
point do not pass through a fixed direction) in a three-dimensional space (x, y, z), 
we introduce four auxiliary variables t, u, v, w and seven linearly independent 
Pfaffian forms < 0 *, r, in (i = 1, 2, 3) in the coordinates and in these variables . 
All these Pfaffian forms are invariant in the sense that the two families of surfaces 
{2} and {2} are equivalent , when and only when there exists a transformation of 
the form (41) satisfying (40). 

Since the process of exterior differentiation is invariant under the point trans¬ 
formations (41), it follows that the 11 quantities a, • • • , l in (35), (38), (39) 
are invariants. We shall call them the fundamental invariants of the family {2}. 
From a given invariant F the equation 

(42) dF = F 1 W 1 + F 2 CO 2 + F 3 W 3 + For + Fitt\ + Fiiri + F37r3 

defines the new invariants Fi , F 2 , F 3 , F 0 , F[ , F 2 , F' 3 , called the covariant 
derivatives of F. According to a theorem of E. Cartan, 7 the complete set of 
invariants of the family of surfaces consists of the fundamental invariants and their 
successive covariant derivatives. This is to be understood as follows: By elimi¬ 
nating the independent variables x, y, z, t , a, v, w, we may set up relations of 
the form 

(43) *(Fi , • • • , F m ) = 0 

between the invariants Fi, • • • , F m of our complete set. Then the two families 
of surfaces {2} and (2) are equivalent if and only if the relations (43) and 

*(Pl , • • • , Pm) = 0 

hold at the same time for Fi, • • • , F m and for the corresponding invariants 
Pi , • • • , P m of {2}. Moreover, it is also well-known that the equivalence of 
{2} and {2} can be decided by only a finite number of such relations (43). 

4. The Weyl Space of Points 

An important particular case of our geometry is the case by which the in¬ 
finitesimal transformation associated to every cycle depends only on the dis¬ 
placements of the origins of the contact elements. The space of elements is 


7 E. Cartan, Les sousgroupes des groupes continus de transformations , Annales de TEcole 
Normale Supdrieure, s£rie 3, t. 25 (1908), pp. 57-194. 



558 


SHIING-SHEN CHERN 


then identical with the three-dimensional point space (x, y, z) with a Weyl 
connection. The conditions for this are, from (35), (38), (39), 

a = g = j = l = 0. 

These conditions are, however, not independent. In fact, when a = 0, the 
first equation of (37) will give 

g = 1 = 0. 

It follows that the conditions 


(44) a = j = 0 * 

are the necessary and sufficient conditions for the Weyl space to be a Weyl space 
of points. 

We may give a geometrical interpretation to the condition a = 0. In fact, 
it signifies that the tangent planes to the surfaces of the family through a fixed point 
in space envelop a cone of the second order. To prove this, let 5 be an operation 
under which the point remains fixed, so that 


wi(5) — <02(5) = (5) = 0. 

Since t is now the only variable, we may determine the auxiliary variables 
u, v, w as functions of t such that the conditions 


xi(5) = ir 2 (S) = Tt(S) = 0 


are also satisfied. We put r(S) = e, and we can assume, by changing the 
parameter t when necessary, that Se = 0. 

If 5 is an operation so chosen and d is an operation by which all the variables 
vary, we get, from (5) and (35), 

fitti = —aeut , 


(45) 


< = — eui , 

fat = —eui . 


From (45) we get 


(46) 


5 2 <»)8 = e 2 oii , 

= —ae 3 tos , 

6 4 o> 3 = ae 4 «2 + (• • • )tot , 

= — ae*ui + (• • • )«2 ■+•(••• )o>z. 


If we define the “plane coordinates” h , k , U of a plane through the point by the 
relation 


( 47 ) 


litoi + ktoz + l*tot = 0 , 



GEOMETRY OF ISOTROPIC SURFACES 


559 


we shall get, when the coordinates h , k , h of the tangent plane of the family 
are expanded in powers of e 9 

fZi = — T^ae b + — , 

(48) lh -e + ^W+--*, 

[h - 1 ~ W + • • • . 

From (48) it follows that 

(49) ll - 2hh = tW + ••• . 

Hence the cone 

(50) l\ - 2kk = 0 

is the osculating quadric cone of the cone enveloped by the tangent planes of 
the family {2} and the condition a = 0 is a necessary and sufficient condition 
that these tangent planes themselves envelop a quadric cone. 

Added October 10 , 1941. The author has succeeded in extending the results 
of this paper to a space of n( ^ 3) dimensions. The following theorem has been 
established: Given in a space of n dimensions a family of hyper surfaces depending 
on n — 1 parameters such that the tangent hyperplanes at a point to the hyper¬ 
surfaces of the family through the point envelop a hyper cone of n — 1 dimensions. 
Then it is possible to define in the space , in an intrinsic way , a Weyl geometry . 
The elements of this Weyl space are in general more complicated for n ^ 4 
than for n = 3. The problem of equivalence is solved as a consequence of 
this result. 

National Tsinq Hua University 
Kunming, China 



Annals or Mathematics 
Vol. 48, No. 3 f July, 1948 


IDEMPOTENT MARKOFF CHAINS 

By David Blackwell 
(Received March 6, 1942) 

I. Introduction 

Let SB be any Borel field of subsets of an abstract space X , and suppose that 
for each x e X a probability measure 1 P(x y E) is defined on SB and that P(x, E) 
is for fixed E a SB-measurable function of x. Then P(x, E) may be considered 
as representing the transition probability of going from the point x into the set 
E in a single trial, and it is said to determine a Markoff chain on X . The prob¬ 
ability of going from x into E in n trials, denoted by P n (x, E) y is given induc¬ 
tively by 

(1) p»(x, E) = J P(y, E) dP n_i(x, y ).* 

In this paper we shall consider Markoff chains for which P n (x y E) is independent 
of n. It is clear from (1) that this will occur whenever P 2 (x , E) ss P(x y E) y ' 
so that we shall be studying Markoff chains which satisfy 

(2) P(x, E) = J P(y, E) dP(x, y). 

Such a Markoff chain will be called idcmpotent , and the justification for this 
is apparent. 

Besides having some independent interest, idempotent Markoff chains are 
useful as tools in the study of general Markoff chains. For example if there is 
a subsequence N, —» such that for all x and E 

!>„(*, E) -+ Q(x, E), 
iV„ n -l 

where Q(x , E) is for each x a probability measure, then the Markoff chain deter¬ 
mined by Q(x , E) is idempotent, and the properties of Q(x , E) are closely related 
to those of P(x y E) (cf. Doob (II) and Yosida and Kakutani (III)). 

Suppose that X contains only a finite number of points, denoted by the 
numbers 1, • • • , n, and suppose that JB consists of all subsets of X . Then 
P(x y E) may be represented by a Markoff matrix P = || p»y||, i.e. a square 
matrix with non-negative elements and row sums equal to unity, where p,, 
denotes the probability of going from i to j in a single trial. For this case (2) 
is the statement that the matrix P is idempotent. Idempotent Markoff matrices 


1 A probability measure is a non-negative completely additive set function having the 
value unity for the space. 

* We shall use the notation dP n (x t y) to denote integration of y with respect to the meas¬ 
ure P n {x t E). 


660 



IDEMPOTENT MARKOFF CHAINS 


561 


have been completely characterized by Doob (II) and by Yosida and Kakutani 
(III), and the result is essentially summarized in the following theorem: 

Theorem 1: If P = || Pa \\ is an idempotent Markoff matrix, the subscripts 
may be divided into groups F, Ai, • • • , A k such that 

(a) p tj = Pi > 0 for i, j e A t 

(b) 12 Pi - 1 

j • At 

(c) Pi j 3BS o for j € F. 

In section II we shall prove some general theorems about idempotent Markoff 
chains. In sections III and IV we shall apply these results to two classes of 
idempotent Markoff chains and show that in each case a decomposition analo¬ 
gous to that described by Theorem 1 obtains. Finally in section V we give a 
simple example of an idempotent Markoff chain which admits no such de¬ 
composition. 


II. Definitions and general theorems 

In this section we shall restrict attention to idempotent Markoff chains 
A set N e SB is called a null set if P(x, N) = 0 for all x. A set I e SB is called 
invariant if there is a null set N such that 

P(x, I ) = 1 for all x c / - N. 

The equation 

(3) f P(y 7 CE) dP(x y y) = 0 for all x 

is clearly a necessary and sufficient condition for the invariance of E. Following 
Doeblin (I), we shall call an invariant set indecomposable if it does not contain 
two disjoint non-null invariant subsets. A set E is called strictly invariant if 
P{x , E) *= 1 for all x € E. It is clear that any strictly invariant set is invariant 
and that if E is invariant there is a null set N such that E — N is strictly in¬ 
variant. We shall need the following fact about the reduction of multiple 
integrals: 

Lemma 1: Let £B(£B') be a Borel field of subsets of a space Z(Y), and suppose 
that P(y y E ) is for each y eY a measure on SB and for fixed E a $>'-measurable 
function of y . Let m be any measure on SB', and define 

M(E ) = f P{y, E) dm. 

Then any M-integrable function f(z) is integrable with respect to P(y , E) for all y 
except a set of m-measure zero , and 

I f(z) dM = / | f f(z) dP(y, z) j dm. 



562 


DAVID BLACKWELL 


Proof: The verification of the lemma is immediate when f(z) is the char¬ 
acteristic function of a set of finite M-measure, and the general result follows by 
approximation. 

Theorem 2: If S, E are any sets, then for all x 

(4) Jf | P(z, E) dP(y, 2 )} dP(x, y) = |jT P(z, E) dP(y, z)} dP(x, y). 

This theorem states that, starting from any point x, the probability of going 
first into S, then into CS , and finally into E is the same as the probability of 
going first into CS, then into S, and finally into E . This is not of course true • 
for general Markoff chains, and in fact it may be shown that if it is true for 
every S in the special case E = X, then the Markoff chain is idempotent. The 
idea of the proof to be given below and of the results which follow will become 
clear at once if the integrals are interpreted as representing probabilities of 
events. 

Proof: For all x, 

f if P(z, E) dP(y, 2 )) dP(x, y)+ f if P(z, E) dP(y, z)| dP(x, y ) * 

Js [J CS ) J CS {J CS ) 

(5) 

- /1/ F(z)P(z, E) dP(y, z)j dP(x, y ), 
where F{z) is the characteristic function of CS. Also 

f if P(z, E) dP(y, z)\dP(x, y) + f if P(z, E) dP(y, z)\dP(x, y) 

Jcs [Js J JCS CS ) 

= ics {/ P ^ Z ’ pS> dp ( y ’ z ^} dp ( x > v) - f F(y) p (y> E) dP(x, y). 

By Lemma 1, with f(z) = F(z)P(z, E), the right members of (5) and (6) are 
equal. Hence their left members are equal, and (4) follows at once. 

Theorem 3: The class S of invariant sets is a Borel field. 

Proof: It is clear that a denumerable sum of invariant sets is invariant; 
we need show only that the complement of an invariant set is also invariant. 
Taking E = X in Theorem 2, (4) becomes 

(7) f P(y, CS) dP(x, y) - f P(y, S) dP(x, y ). 

Jcs 

If S is invariant, it follows immediately from this equation and condition (3) 
that CS is also invariant. 

Theorem 4: The function P(x, E) is for fixed E an ^-measurable function. 
Proof: We must show that the set 


S = {x for which P(x, E) < A} 



IDEMPOTENT MARKOFF CHAINS 


563 


is invariant for any k. We may assume k > 0. Now 

(8) / (/ P(z> E) dP(y, 2 )) dP(x, y) £k( P(y, S ) dP(x, y), 

J C 8 i/S J J CS 

and the equality sign holds only if both sides vanish. Also 

(9) L {L p(z> E) dp(y ’ z) } dp(x > y) - k l B p(y ’ cs) dp(x ’ y) - 

By Theorem 2 the left members of (8) and (9) are equal, and (7) implies that 
their right members are equal. Hence equality holds in (8) for all x. Conse¬ 
quently the right side of (8) vanishes for all x and, since k > 0, this is precisely 
condition (3) for invariance of CS. 

The principal result of this section is contained in the following theorem 
which shows that the strictly invariant indecomposable sets are the analogues 
of the sets Ai of Theorem 1. 

Theorem 5: If S is strictly invariant and indecomposable , then P(x, E) is 
independent of x for all x e S. 

Proof: For all z e S we have 

(10) P(z, E) = j s P(y, E ) dP(z, y). 

Let yo e S . The sets 

S • {P(x, E) £ P(y 0 , E )} and S- {P(x } E) ^ P(y 0 , E)\ 

are by Theorem 4 invariant, and it is clear from (10) with z = yo that neither 
can be a null set. Since S is indecomposable, their complements in S, the sets 

S • \P(x, E) < P(y 0 , E)\ and S• {P(x, E) > P(yo , E)} 

must be null sets. Then P(x, E) = P(y Q , E) for all x e S except a null set, 
and (10) then implies that 

P(x, E) = P(y 0 , E) for all x e S. 

Ill . The strictly separable case 

A Borel field ££ of subsets of X is said to be strictly separable if it is determined 
by a denumerable subcollection of its elements. SB is said to be atomic if X = 
y] a X a , where each X a * SB and (11) for every B c S8 and every a , either BX a = 
X a or BX a = 0. The sets X a are called the atoms of SB. 

Lemma 2: A strictly separable Borel field is atomic. 

This fact is well known; since we shall refer to the proof, we reproduce it here. 
Proof: Let SB be determined by Pi, P 2 , • • • . Define X a = Iln-i Fn ni *\ 
where e n («) = ±1, E l = E, E" 1 = CE. Then each X a c SB, X = , 

and the class of sets B satisfying (11) is a Borel field including every F n and 
hence includes SB. 



564 


DAVID BLACKWELL 


Theorem 6: If A contains a strictly separable subfield A\ whose atoms are in¬ 
decomposable, then X = F + ]£« A a , where 


(a) 

P(x, E) = P a {E) for x e A 

(b) 

PMa) = 1 

(c) 

F is a null set. 


Proof: Let Fi, F 2 , • • • determine A\ . We may assume the F* form a field. 
Let N k be the null subset of F k for wliich P(x, F k ) < 1, and define F = 2" iV* . 
Let {X*} be the atoms of J, and define = X« — F. If x c , it is evident* 
from the construction of the atoms given in the proof of Lemma 2 that P(x , X„) 
= 1 and hence that P(x, A a ) = 1. Since A a is strictly invariant and inde¬ 
composable, we may invoke Theorem 5 to obtain (a). 

Theorem 7: If SB is strictly separable , the conclusion of Theorem 6 
Proof: We shall define a subfield of A satisfying the hypothesis of Theorem 6. 
Let Fi, F 2 , • • • determine SB. We may suppose the F, form a field. Define 

Iikn « S P(x, F,) < k - 0, • • •, »; n,j - 1, 2, • • 

Each of these sets is invariant by Theorem 4. Let A\ be the strictly separable 
subfield determined by the /,*„ . If X a is an atom of A \, it is clear that P(x, F>) 
is independent of x for all x e X a and all j. Since whenever a set of measures 
coincide on a field, they coincide on the Borel field determined by it, P(x, E) 
is independent of x for all x eX a and all E. Then each X a is indecomposable, 
for otherwise it would contain two disjunct non-empty strictly invariant sub¬ 
sets S and T, and P(x, S) = 1 for x e S and 0 for x e T, contradicting the fact 
that P(x, E) is independent of xon I a . 

Thus for instance if SB is the class of Borel sets in Euclidean space, any idem- 
potent Markoff chain admits the decomposition of Theorem 6. 

IV. The density case 

Let m be any finite measure defined on fjB and define the product measure 
m X m on the Borel field fBi = fB X SB of subsets of the product space X X Y 
of X with itself. Suppose pi(x, y) is any non-negative SBi-measurable function 
such that for each x e X, 

(12) J pi(x, y) dy = 1. 

Then the function 

P(x, E) - £ pi(x, y) dy 


( 13 ) 



IDEMPOTENT MARKOFF CHAINS 


565 


determines a Markoff chain on X, and the function pi(x, y) is called the density 
function. If the Markoff chain so defined is idempotent, we have 


(14) 


J g Pi(x, y)dy = J | pi(z, y) dy j dP(x, z). 


Applying a well known formula for transformation of measures and inter¬ 
changing the order of integration, we obtain 


(15) jf Pi(x, y) dy = pi(x, z)pi(z, y) dzjdy; 


i.e. if we define p(x, y) by the equation 


(16) 


p(x, y) = J Pi(x, z)pi(z, y) 


(IZy 


p(x , y) is another density function defining exactly the same Markoff chain as 
Pi(x,y). It follows that for fixed x, p{x, z ) and pi(x y z) are equal for almost all 
z, and (16) then implies 

(17) p(x, y) = J p(x f z)p 1 (z y y) dz. 


Multiplying both sides of (17) by pi(y, w) and integrating with respect to y, 
we obtain 

(18) p(x , w) = J p(x, z)p(z, w) dz . 

Thus if an idempotent Markoff chain can be represented by a density function, 
it can be represented by a density function which is idempotent in the sense of 

(18) . (18) is the direct analogue of the idempotent Markoff matrix if we 

think of the Markoff chain as determined by the density function p(x , y) rather 
than by the function P(x , E). To study idempotent density functions we shall 
use the following well known fact: 

Lemma 3: If m is a finite measure defined on a Borel field SB of subsets of a 
space X , then 

(19) X = S Xi -(- X 2 • • • where 

(a) m(Xi) > 0, and E e S8, E Q Xi implies m(E) = m(X t ) or m(E) = 0. 

(b) every subset of S of positive measure has a subset of every smaller positive 
measure . 

Either S or the Xi may of course be absent. The decomposition is clearly 
unique up to sets of measure zero. 

Proof: The sets Xi satisfying (a) may be divided into equivalence classes 
by defining two such sets as equivalent if their symmetric difference has measure 
zero. Any two sets in different classes can have only a set of measure zero in 
common, and since each set has positive measure there are at most denumerably 



566 


DAVID BLACKWELL 


many equivalence classes. Let X \, , • • • be a set of representatives, one 

from each equivalence class, and define S = X — J2* • Then every subset 

of S of positive measure has a subset of smaller positive measure and hence a 
subset of arbitrarily small positive measure, and we must show that this implies 
(b). Suppose Si C S y 0 < A; < m(Si). Choose a sequence of sets K m induc¬ 
tively so as to satisfy 


(c) 

Knt&n 


(d) 

m(K n ) > l.u.b. m(E) — - 




• 

where £R n 

is the class of all subsets E of Si — (Ki + • 

■ • + K n -i) for which 


m{E) £k- m(Ki + - • - + K n -J. Define K = YaK«. Then K C Si 
and m(K) = k, otherwise the choice of K n would at some stage be contra¬ 
dicted, since Si — K would contain a subset of positive measure smaller than 
k - m(K). 

We now use an argument similar to that of Doeblin in a related discussion (I). 
Apply Lemma 3 to m defined on the Borel field A of invariant subsets of X y 
obtaining a decomposition (19) into invariant subsets. Each X n is clearly 
indecomposable, and we may suppose that each X n is strictly invariant. We 
shall show that S is a null set. First extract from S a null set H of largest pos¬ 
sible measure and write T = S — H, It is sufficient to show that T has measure 
zero. Certainly every null subset of T has measure zero. Choose a class 
D mn e $ y n = 1, • • • , 2 m ; m = 0, 1, 2, • • • of subsets of T so that 

(a) Dmi -Dmj = O. if iVi 

(b) - rn(D mi ) = ^m(T) 

(c) "4" ^ -0«i • 

By deleting a null set N mi we shall make each of the Z> mt strictly invariant. 
Hence by deleting N = from each D mi simultaneously we obtain a 

new class of sets satisfying (a), (b), (c) and each of which is moreover strictly 
invariant. Thus without loss of generality we may suppose that each D mi 
is strictly invariant. Now 
00 

Doi = IX (A»i + • • • + Dmv*) = DirDyDzk 

m—1 i.lffci'** 

Each set Du-D 2 j •!)**■ ••• is a strictly invariant set of measure zero and is 
therefore empty. It follows that D 0 1 is empty. Since m(T) = m(D 0 1 ), T 
has measure zero. 

We are now in a position to apply Theorem 6, and we have shown further 
that there will be only denumerably many sets A a in this case. However 
without appealing to this theorem we may easily obtain an even more precifee 
decomposition in terms of the density function p(x, y ). 



IDEMPOTENT MARKOFF CHAINS 


567 


By Theorem 5, P(x, E ) is independent of x for x t X n ; it follows that for 
any xi, 


(20) p(xi, y ) = p(* 2 , y) 

except possibly on a y-set of measure zero. (18) then implies that (20) holds 
identically in y for all xi , xt t X n . We write for all x t X n , 

p(x, y) = Pn(y). 

Then for any x it follows from (18) that 

p(x, y) = Yj / p(x, z)p n (y) de = 52 p (*> X„)p n (y); 

n-l Jx n ti-1 

i.e. any p(x, y) is a linear combination of the functions p%{y). Define [/„ = 
{y t X n such that p n (y) = 0} and V„ = [y i X„ such that p„(y) >0} and 
write U - 23" U» , F = F„ . Then m(V) = 0 and both U and V are 
null sets. Writing A n = X n — {U + V), F = X — (]£“ X„ + V), we may 
summarize our results in the following theorem. 

Theorem 8: If p(x, y ) is an idempotent density function , then X = 
F + F + -4i + ^2 + • • • , where 


(a) 

(b) 

(c) 

(d) 

(e) 


p(x } y) = p n (y) for x eA n 
Pn(y) >0 for y e A n 

Vn{y)dy = 1 

p(z, 2 /) = 0 for y e F 
m(V) = 0. 


V. An example 

We conclude by giving an example of an idempotent Markoff chain for which 
no decomposition like that obtained in the preceding sections is possible. Let 
£B be any Borel field, and define P(x, E) to be the characteristic function of E. 
Then every set of SB is strictly invariant; there are no null sets except the empty 
set. A decomposition of X would consist in writing X = X a where each 
X a is indecomposable; and this is clearly impossible whenever SB is non-atomic. 

Institute for Advanced Study 


Bibliography 

I. Doeblin, W., Chaines simples constantes de Markoff. Ann. Ec. Norm. (3), LVII., 
Fasc. 2. 

II. Doob, J. L., Topics in the theory of Markoff chains. (To appear in the Trans. Am. 
Math. Soc.) 

III. Yosida, K., and Kakutani, S., Markoff processes with a denumerably infinite number 
of states. Jap. Journal of Math., 1939, p. 47. 



Axmals op Mathematics 
Vol. 43, No. 3, July, 1942 


BANACH SPACE METHODS IN TOPOLOGY* 

By Samuel Eilenbeeg 
(Received January 22, 1942) 

1. Introduction 

We shall consider the totality B(X) of all continuous real valued bounded 
functions/defined on a topological space X. 1 With the usual definition of addi¬ 
tion and multiplication by real numbers and with the norm 

II/II = sup I A*) I 

x tX 

the set B(X) becomes a Banach space. 

Banach 2 has established the interesting and important result that two com¬ 
pact 3 metric spaces X and Y are homeomorphic if and only if the corresponding 
spaces B{X) and B(Y) are isometric. Stone 4 has shown that the hypothesis 
that X and Y are metric was superfluous. Both authors arrive at their results 
by establishing the general form of an isometric mapping of B{X) onto B{Y). 

The Banach-Stone theorem implies that if X is compact then the topologicar 
structure of X is entirety determined by the metric properties of B(X). The 
first purpose of this paper is to exhibit this relationship more explicitly. By 
studying the convex subsets of the surface of the unit sphere in B(X) we define 
an “ideal space” E and prove that if X is compact, then E and X are homeo¬ 
morphic. From this result we readily get a proof of the Banach-Stone theorem 
for compact spaces and also for a class of non-compact spaces including all 
metric spaces. 

Although the^space E provides theoretically complete means for translating 
topological properties of a compact X into metric properties of B(X), it will 
very often lead from simple topological properties of X to involved metric 
properties of B(X) with no clear geometric interpretation. This makes it an 
interesting problem to relate “interesting” topological properties of X with 
“interesting” metric properties of B(X). In this direction we prove that if B(X) 
is a direct sum Bi + B 2 of two Banach spaces, then Bi = B(Xf) where X* is an 
open and closed subset of X. From this we see that the connectedness proper¬ 
ties of X translate into properties of B(X) dealing with direct sums. 6 

* Presented to the American Mathematical Society April 12 and May 2, 1941. 

1 A topological space is a set X with a family of subsets called open such that (a) 0 and X 
are open, (b) the union of any number of open sets is open, (c) the intersection of two open 
sets is open. If ih addition, (d) every two distinct points of X belong to two disjoint open 
sets, then X is said to be a Hausdorff space . 

2 S. Banach. ThSorie des operations linearies. Warsaw, 1932, p. 170. 

3 A space X is called compact if it is a Hausdorff space and if every covering of X by open 
sets contains a finite subcovering. 

4 M. H. Stone. Trans. Amer. Math. Soc ., 41 (1937), p. 469. 

* Another result in this direction was recently announced by S. Krein and M. Krein 
(C. R. U\ R. S. S. t 27 (1940), pp. 427^130): If X is completely regular then B(X) is separable 
and only if X is compact metric. 


668 



BANACH SPACE METHODS IN TOPOLOGY 


569 


B(X) is not only a Banach space but also a linear lattice and a nonned ring. 
From these two points of view B(X) has been thoroughly investigated. 6 In 
particular, necessary and sufficient conditions are known for a linear lattice or 
normed ring B in order that B = B(X) for some topological space X . The 
analogous problem for a Banach space B is as yet unsolved and we hope that 
the methods developed in this paper will lead towards a solution. 

2. Tychonoff’s compacting 

A Hausdorff space X is called completely regular if given a closed set F C X 
and a point Xo e X — F there is a continuous real valued function / on X such 
that f(x o) = 0 and f(x) = 1 for all x c F. Corresponding to every completely 
regular space X Tychonoff 7 has constructed a space /3(A) such that 
(Ti) /3(X) is compact, 

(Ti) X C /3(A), X = 0(X), 

(T 8 ) every/c B(X) has an extension/* c JS[/3(A)]. 

Moreover, these properties determine (8(A) uniquely. Since A is dense in 
/3(A) it is clear that the extension f* of / e B(X) is unique and therefore estab¬ 
lishes a 1 — 1 correspondence between /3(A) and /?[(8(X)], in view of which the 
two Banach spaces may be considered identical. 

The space /3(A) was extensively studied by Cech 8 who has proved among 
other results that if A is completely regular and satisfies the first countability 
axiom 9 then /3(A) determines A. More precisely, A is then the subset of (3(A) 
consisting of the points at which (8(A) has a countable base. 9 


3. Stars 

Given a Banach space B we shall consider the transformation Tb = —b and 
define 


SA = A U TA 


for every A C B. 

Lemma 3.1. Given bi , b 2 € B, || bi || ^ 1, || b 2 || g 1 the following conditions 
are equivalent : 

(a) || tbi + (1 — t)b 2 1| = 1 for at least one t such that 0 < t < 1, 

(b) || tbi + (1 — t)b 2 1| = 1 for all t such that 0 ^ t g 1, 

(c) || b\ + 621| = 2. 

8 See Stone loc. cit. and S. Kakutani, Ann. of Math., 42 (1941), pp. 994-1024. The latter 
paper contains an up-to-date bibliography of the subject. 

^ A. Tychnoff, Math. Ann., 102 (1930), pp. 544-561. 

8 E. Cech. Ann. of Math., 38 (1937), pp. 823-844. 

• We say that a sequence JJ% of open sets is a countable base for the point x of a Hausdorff 
space A if every open set containing x contains at least one of the sets U% . If a countable 
base exists for every point of A, then A is said to satisfy theirs* countability axiom. 



570 


SAMUEL EILENBERG 


The proof is left to the reader. 

For every b < B such that || b || g 1 we define the star St( 6 ) as follows: 

St( 6 ) = { 6 i| || bi|| g 1, || 6 i + 6 || - 2|. 

The definition is justified geometrically by Lemma 3.1. We shall also consider 
the symmetrized star SSt( 6 ). Clearly the conditions ||b|| < 1 , St( 6 ) = 0 
and SSt( 6 ) — 0 are equivalent. Also the conditions 61 e St(b 2 ) and bt c St( 6 i) 
are equivalent and similarly for the symmetrized stars. It is also clear that 
b e St(b) if and only if || b || = 1 . 

Let now X be a topological space and B(X) the Banach space described in 
the introduction. 

Lemma 3.2. Let X be countably compact 10 and let fi, ft e B(X), ||/i || ^ 1, 
|| ft || ^ 1. We have fi t St(/ 2 ) if and only if there is an x eX such that 


Mx) = ft{x) = ±1. 

Proof clear. 

Lemma 3.3. Let X be a topological space and let f « B(X). We have | / | = 1 
if and only if SSt(/) is the whole surface of the unit sphere of B(X). 

It is obvious that if | f(x) | = 1 for all x e X and ft e B(X) and ||/i || = 1 
then either ||/ + / 1 1| = 2 or ||/ — /1 || = 2 and therefore /1 « SSt(/). 

Now suppose that inf |/(x) | = k < 1. Define 


/i(x) = 


1- !/(*)! 

1 - k 


Clearly /1 e B(X) and ||/i || = 1. However 

| fix) - kf(x) + 1 - | fix') | 


11/ H=/x|| = sup 


1 - k 


g 1 + k < 2 


because if f(x) ^ 0 we have 

I /(*) ~ kf(x) 4- 1 - |/(x) | | = | 1 - fc/(x) | g 1 - k 2 
and if /(x) ^ 0 then 

|/(x) - kf(x) + 1 - | fix) | | - | 1 - (2 - k) | f(x) | | ^ (1 - kf. 

Similarly ||/ — / 1 1| < 2 and/ t non t SSt(/). 

Lemma 3.2 shows that the elements of B(X) such that | /1 = 1 can be char¬ 
acterized intrinsically in B(X). Hence the number of components of X can be 
determined by the properties of B(X-), since X will consist of » components if 
and only if there will be exactly 2" continuous functions / such that \f | = 1. 

4. The sets M(f) 

Let/ < B(X) and ||/1| £ 1. Define 

M(f) ” {x\\f{x) |-1}. 


10 A'Space X is called countably compact if it is a Hausdorff space and if every countable 
covering of X by open sets contains a finite subcovering. 



BANACH SPACE METHODS IN TOPOLOGY 


571 


Clearly Af (/) is a closed subset of X. If 11 / 11 < 1 then obviously Af (/) is empty; 
if ||/ || = 1, Af(/) still may be empty unless X is countably compact. 

Lemma 4.1. The sets M(f) form a sublattice of the lattice of the closed sub¬ 
sets of X. 

In fact, we have Af(/i) fl A/(/ 2 ) = M(f) where/(x) = min [|/i(x) |, |/ 2 (x) |] 
and M(Ji) U Af(/ 2 ) = M(f) where f(x) = max [|/i(x) |, |/ 2 (x) |]. 

Lemma 4.2. The space X is completely regular if and only if the sets M (/) form 
a multiplicative base for the closed sets in X. 

If X is completely regular, then given a closed set F C X and a point 
x 0 eX — F, there is an / e B(X) such that ||/|| ^ l,/(x 0 ) = 0 and/(x) = 1 for 
x tF. Hence F d M(f) and x 0 non e M(f). Conversely, if Af(/) form a multi¬ 
plicative base, then for a given closed set F C X and a point x 0 € X — F there 
is an/cJ5(X), such that ||/|| ^ 1, F C M(/) and Xq non eM(f). It follows 
that |/(x 0 ) | < 1 and | /(x) | = 1 for all x e F; hence X is completely regular. 

Lemma 4.3. 11 If X is normal then the sets M(f) coincide with the closed Ga 
subsets of X. 

Lemma 4.4. If X is countably compact then the relations 
M{fi) fl M(/ 2 ) ^ 0 and f 2 e SSt(/i) 

are equivalent for any fi , / 2 e B{X) such that ||/i || ^ 1, ||/ 2 || ^ 1. 

In fact, / 2 6 SSt(/i) means that 11 f x + f 2 j | = 2 or that 11 /i — / 2 11 = 2. Since X 
is countably compact this means that for some x 0 e X either | /i(x 0 ) + / 2 (xo) | = 2 
or | /i(x 0 ) “ f 2 (x 0 ) | = 2. This is equivalent with | /i(x 0 ) | = | / 2 (x 0 ) | = 1 
which means M(fi) fl M(f 2 ) ^ 0. 

Lemma 4.5. If X is completely regular and countably compact then the relations 
M(Sx) C= M(f t ) and SSt(/0 C SSt (f 2 ) 

are equivalent for any fi , / 2 *B(X) such that ||/i || ^ 1, ||/ 2 || ^ 1. 

Assume that M(f x ) CZ Af(/ 2 ) and let / eSSt(/i). In view of Lemma 4.4 we 
have then Af(/) fl Af(/i) ^ 0; consequently M(f) fl Af(/ 2 ) 0 and by Lemma 

4.4 we have ft SSt (/ 2 ). Hence SSt(/i) C SSt(/ 2 ). 

Let now XoeM(fi) — Af(/ 2 ). Since X is completely regular there is an 
f *B(X) such that ||/|| g 1, /(x 0 ) = 1 and f(x) = 0 for all xeM(f 2 ). It 
follows that x 0 «Af(jf), therefore M(f) H M(f x ) ^ 0 and/€ SSt(/i). Since 
/(x) = 0 for all x e Af (/ 2 ) we have | /(x) ± / 2 (x) | < 2 and since X is countably 
compact this implies that 11 / =t /a | j <2. Consequently / e SSt(/i) — SSt(/ 2 ). 

From Lemmas 4.3 and 4.5 we obtain the following 

Theorem 4.6. If X is normal and countably compact , then the sets SSt(/) 
( ftB(X), ||/|| :g 1) ordered by inclusion form a lattice isomorphic with the 
lattice of the closed G$ subsets of X. 

5. The sets Q(x) 

Given a point x e X we define 

Q(x) - {/1/« B(X), ||/1| - 1, |/(x) | = 1}. 

11 For the proof see N. Vedenisoff, Fund . Math., 27 (1936), pp. 234-238. 



572 


SAMUEL EILENBERG 


Lemma 5.1. Let X be completely regular . Given x c X and f e B(X) suchthat 
|| /1| 2? 1 we have Q(x ) C SSt(/) if and only if | f(x) | = 1. 

If | f(x) | = 1 it follows directly from the definition that Q(z) C SSt(/). If 

now | f(x) | < 1 then there is an open set G such that x c G and | f{x') \ < 

1 — « < 1 for all x 1 eG, Hence there is an fi eB(X) such that ||/i || = 1, 

fi(x) = 1, and/i(x') = 0 for all x' tX — G. Consequently /i eQ(x) but since 

11/ db /i |j ^ 2 — « we have fi non e SSt(/). 

Theorem 5.2. If X is compact , then a subset A of the unit sphere of B(X) is 
of the form Q(x) for some x eX if and only if A = S.4i where Axis a maximal 
convex subset of the surface of the unit sphere in B(X). 

This is an immediate consequence of the following 

Theorem 5.3. If X is compact then a subset A of the surface of the unit sphere 
of B(X) is maximal convex if and only if there is an x o e X and, e = dtl such that 
A - lf\f*B(X), H/ll - 1, f(xo) = c|. 

We first show that if such x 0 and e exist then A is maximal convex. It is 
obvious that A is convex. Let then A* be a convex set contained in the unit 
sphere of B(X), such that A C A* and f e A* — A. Since f(x 0 ) ^ c and X 
is compact and therefore normal there is an f\ e B(X) such that ||/i || = 1, 
fi(xo) = € and/i(o:) = 0 for x eM(f). It follows that | f(x) + fi(x) \ < 2 for 
all x and therefore £(/i + /) non e A*. However, fi e A* and / c A*, hence .4* 
was not convex. 

In order to prove that every maximal convex subset A of the unit sphere is 
of the form described in Theorem 5.3, it is enough to find an x 0 e X and an 
€ = ±1 such that/(:ro) = e for all / in A. If we denote 

M + (f) = \x\f(x) - -1} MJf) = {x\f(x) - -1} 
this reduces to “prove that either 

fl M+(f) ^0 or fl MM) * 0. 

/ € A / < A 

Assume the contrary. Since the sets M+(J) and M_(/) are closed and X is 
compact, we would have then two finite sequences/i ,/s, • • • , /» , gi , g* , • • • ,g n 
in A such that 

C\M+(fi) = 0 and f)M_(0<) = 0. 

Since A is convex we have (/i + •••+/» + gi + • • • + 0»)/(n + m) t A, 
hence ||/i + •••+/», + ffi + •• • + g n )| = n + m, therefore there is an 
x t tX such that/i(xi) = ••• = f m (xi) — gi(xi) = ••• = g„{x\) = ±1 which 
leads to a contradiction. 

6. Reconstruction of X from B(X) 

Let B be a Banach space. A subset of B will be called an ideal -point , and 
denoted by a Greek letter £ if £ = SA where A is a maximal convex subset of 
the surface of the unit sphere of B. The totality of all the ideal points will be 



BANACH SPACE METHODS IN TOPOLOGY 


573 


noted by E. A subset of E will be called an M-set if there is an / e B such 
that ||/1| ^ 1 and 

*- U|€CSSt(/)}. 

A subset of E which is an intersection of M-sets will be called closed. 

It is to be noted that with this definition of closed sets E does not necessarily 
become a topological space. In fact, if E were a topological space it would 
have to be a closed set and therefore an M-set. This would imply the existence 
of an element/i c B such that || /i || = 1 and that E = {£ | £ Cl SSt(/i)}. From 
this it follows easily that SSt(/i) is the whole surface of the unit sphere of B. 
Of course, in most Banach spaces no such element exists. 

Theorem 6.1. If X is a compact space then E defined from the Banach space 
B(X) is a topological space homeomorphic to X. 

Given x e X it follows from Theorem 5.2 that Q(x ) is an ideal point and that 
all the points of E are obtained in this fashion. Hence taking 

£(*) = Q(x) 


we obtain a mapping of X onto E. 

We prove that £ is 1 — 1. Let x\ , x 2 e X and xi ^ x 2 . Since X is com¬ 
pletely regular there is an ft B(X) such that || / || ^ l,/(xi) = 1 and f(x 2 ) — 0. 
Hence / e Q(xi) but / non cQ(x 2 ), therefore Q(xi) 9 * Q(x 2 ). 

We now prove that £ is a homeomorphism. Since the sets M(f) form a multi¬ 
plicative base for the closed sets in X (see Lemma 4.2) and by definition the 
M-sets form a similar base for the closed sets in E, we shall prove that h is a 
homeomorphism by showing that if 


m = * 

then F is.a set M(f) if and only if 4> is an M-set. 

Suppose that F = M(/). It follows from Lemma 5.1 t-hatQ(x) CZ SSt(/) if 
and only if |/(x) | = 1, i.e. if x e F. Consequently 

*= {£|£CSSt(/)} 


and $ is an M-set. 

If / will vary over all of the unit sphere of B(X), F will vary over all M(f) 
sets and £(F) = 4> will vary over all M-set in E. Since £ is 1 — 1, the proof is 
complete. 

From Theorem 6.1 and from §2, we obtain 

Theorem 6.2. If X is a completely regular space , then E defined from the 
Banach space B(X) is a topological space homeomorphic with the compacting 
fi(X) of X. 

//, in addition , X satisfies the first countability axiom , then X is homeomorphic 
with the subset of E consisting of points which have a countable base. 



574 


SAMUEL EILENBERG 


7. Isometric mappings of B(X) onto B(Y) 

Theorem 7.1. Given two topological spates X and F, every isometric mapping T 
of B(X) onto B(Y) is of the form 

(*) Tf = go + gi-(T'f) 

where go, gi c .B(F), | gi | = 1 and T' is an isometry of B(X) onto B(Y) that is an 
algebraic and lattice isomorphism. 

Define Tif = Tf — TO. Clearly T\ is an isometry and 7\0 = 0 ; hence by a 
theorem of S. Mazur and S. Ulam 12 Ti and its inverse are linear. Consequently 
we have TSt(f) * St(Ti/). Consider the function 1 in B(X ), since SSt(l) 
is the whole surface of the unit sphere it follows that SSt(Til) is the whole 
surface of the unit sphere in J3(F) and by Lemma 3.3 | Til | = 1 . If we define 
go = TO, gi = Til and T'f = T if/Til we have (*) and T' is a linear isometry such 
that T'O = 0 and T'l = 1 . All that remains to be proved is that T' preserves 
the lattice properties. 

Suppose / §£ 0 , then 13 

T'f = T'l!/|| + T'/- T' ||/1| - ||/|| - T'dl/H -/) 

^ ll/ll - \\T'(\\f\\ -/) || = ll/ll - || ll/ll - /II iO 

and T'f ^ 0. The same holds for the inverse of T' and therefore T' is a lattice 
isomorphism. 

Theorem 7.2. Let X and Y be two compact spaces. Every isometric mapping 
T of B(X ) onto B(Y) is of the form 

Tf = 0 o + 0 i • ( fh) 

where go , 0 i e B(Y), | 0 i | = 1 and h is a homeomorphism h(Y) = X. 

If TO = 0 then 0 O = 0 and 

Tf * gr(fh). 

If TO = 0 and Tl = 1 , then go = 0, 0 i = 1 and 

Tf = fh. 

The same holds if X and Y are completely regular and satisfy the first countability 
axiom ; in particular , if X and Y are metric spaces. 

In view of the previous theorem we may restrict ourselves to the case when 
TO == 0 and Tl = 1. The isometry* T is then linear and preserves all the 
lattice, properties. In particular 

(**) T\f\ = \Tf\. 

We first discuss the case when X and F are compact. Let Ex and E y be the 
corresponding ideal spaces and {x and the homeomorphisms b(X) = Ex 

11 C. R. Paris 194 (1932), pp. 946-948; also Banach loc. cit. p. 168. 
u A real number a is identified here with the function of / « B(z) such that f(x) — a for 
all x € X. 



BANACH SPACE METHODS IN TOPOLOGY 


575 


and tr(Y) = Sr . Since Ex and S r were defined intrinsically using B(X) and 
B(Y) it is clear that T induces a homeomorphism between Sx and Sr which we 
will also denote by T; T(Ex) = Er • Define 

h(y) = tx'T-'triy) for y t Y. 

Clearly h is a homeomorphism h(Y) = X. It remains to prove that Tf = fh 
for all / € B(X). 

We shall first prove that 

(V) if ||/1| £ 1 then M(f) = h[M{Tf)]. 

This follows immediately from the remark that £*[Af(/)J and £ Y [M(Tf)] are the 
ilf-sets 


ix[M(f)] - uiecsstO)} 
hmm - UI € c sst(iy)} = {{|{C Tsst(/)}, 

hence !F£r[M(/)] = which implies (***) in view of the definition of h. 

Now let / c B(X), x e X, h(y) = x and f(x) = a. Consider 


Because of (**) we have 


/i = 


11/-air 


Tf, = 1 


|r/-a| 

iir/-air 


Since .r € we shall have in view of (***) y * M(Tfi), hence j (Tf,)(y) | = 1 
which implies ( Tf)(y ) = a. This shows that Tf = hf which proves the theorem 
in the compact case. 

If now X and Y are completely regular, we consider the compactings @{X) 
and>0(y).' Since B[fi(X)] = B(X ) and 2?[/3(F)] = J3(F) we have an isometry 
T mapping J3[0(X)] onto B\fi(Y)] and hence a homeomorphism h[fi(Y)] — 0(X) 
such that Tf = fh. If X and Y satisfy the first countability axiom, then they 
are the sets of points at which the first countability axiom holds in (3(X ) and 
p(Y) respectively. Therefore h(Y) = X. 


8. Discussion of Banach’s proof 

Theorem 7.2 was first established by Banach 2 for X and Y compact metric. 
His proof is based on the concept of a peak function. A function / e B(X) is 
called a peak function with the point x<> € X as peak if | f(x) | < | f(x 0 ) | for all 
x € X 7 x 9 * xq . Banach proves it if X is compact metric, then / e B(X) is a 
peak function if and only if / ^ 0 and the limit 


lim 

0 


II/+All-ll/ll 


t 



576 


SAMUEL EILENBERG 


exists for every fi e B(X). Having such an intrinsic characterization of peak 
functions, it is easy to see that an isometry TB(X) — B(Y) such that TO * 0 
must carry peak function into peak functions and this way a homeomorphism 
between X and Y can readily be established. 

We propose to replace Banach's characterization of peak functions by the 
following one which seems to be more geometrical and natural: 

Theorem 8.1. Let X be completely regular and countably compact. A function 
f e B(X) such that || /1| = 1 is a peak function if and only if St(/) is convex. 

If/ is a peak function then there is an Xq e X such that 

\f(xv) I = 1, J f(x) I < 1 for X 7* X 0 . 

Since X is countably compact it follows from Lemma 3.2 that 
St (f) = {/x|/x«B(X),||/x|| = 1,/xOro) =f(xo)} 
and this set is clearly convex. 

If / is not a peak function, then since X is countably compact / assumes its 
maximum and therefore there are at least two distinct points x \, x 2 € X such 
that 

l/(*0 I = I/O*) i = i. 

Since X is completely regular there are two open sets Gx and Gt such that 

%i < Gi , Xi t Gi and Gx D Gi = 0 

and two functions/i, ft t B{X) such that ||/i || = ||/ 2 1| = 1 and 

/<(*<) = fix,), fi(x) = 0 for x e X — Gi, i = 1,2. 

Clearly ||/-h/i || = 2, ||/+/ 2 1| = 2and \\fx +/ 2 1| = 1 since X = (X - G t ) U 
(X — Gt). Hence fx e St(/), ft « St(/) and J(/i + ft) non t St{/). HenceSt(/) 
is not convex. 

Having an intrinsic characterization of the peak functions we can carry out 
Banach’s proof provided we know that for every Xq e X there is a peak function 
with x<> as peak. It can be easily seen that this is true if and only if X is com¬ 
pletely regular and every point of X is a Gt . If in addition X is countably 
compact, this last condition becomes equivalent with the first countability 
axiom. Hence we see that Banach’s proof can be carried out for completely 
regular, countably compact spaces satisfying the first countability axiom. 

9. Representations of B(X) as a direct sum 

Given two linear subspaces Bx, Bt of a Banach space B we say that B is the 
direct sum 

B = Bx 4" Bt 

if every b t B admits a unique decomposition 

b — bi bt, hi e B , bt t Bt 



BANACH SPACE METHODS IN TOPOLOGY 


577 


and if 14 


Pll " max (|| 611|, || 621|). 

It is easy to prove that if B — B\ + J3 2 then B\ and B 2 are closed subspaces of B. 
Let X be a topological space and let 

(*) X, - Xx U X 2 , Xi D X 2 = 0, X, = X,, X 2 = X,. 

Let B(X 1), B(Xi) be the subspaces of B(X) defined as follows 

B(X<) = {f\f(x) = 0 for all x non t X<}, i = 1, 2. 

We verify easily that B(X) = B(X 1) + B(X 2 ). 

Lemma 9.1. Every decomposition X = Xi U X 2 0/ X into two disjoint closed 
sets determines a direct sum decomposition B(X) = B(Xi) + B(X 2 ). 

We shall show that the converse is also true. 

Theorem 9.2. Let X be a topological space. For every decomposition 

B(X) = B t + B 2 

as a direct sum there is a decomposition 

X = X, U X 2 

into disjoint closed sets such that B\ = J5(Xi) and B 2 = B(X 2 ). 

We fii-st assume that X is compact. Since B(X) — Bi + B t even* / e B(X) 
has a unique decomposition 

/ — ft + fi > fi e Bi, ft t Bt. 

Define 


Tf=fi~ ft. 


Clearly T- is linear and 

I P/|| = ||/||, 

hence TB{X) = B(X) is an isometry such that TO = 0 and because of Theorem 
7.2 we have 


Tf = gv(fh) 

where gi t B(X), \ g\ \ = 1, and h is a homeomorphic mapping of X onto itself. 

We shall prove that h(x) — x for all x t X. Assume to the contrary that 
h(x 0 ) = Xi 9 * Xo for some Xo e X. There is then an open set G C X such that 

£<>«<?, G D h(G) = 0. 


14 The definition of the direct sum essentially depends upon the expression chosen for 
the norm. The most commonly used are || b || — (|| bi || ’’+ 115a ||’’) 1/p for p ^ 1. The ex¬ 
pression in the text is obtained by letting p —* «. 



578 


SAMUEL EILENBERG 


Since X is compact it is also completely regular and hence there is an f e B(X) 
such that 


f(xo) = 1, f(x) = 0 for all x tX - (?, ||/|| = 1. 

Since/ = f x + / 2 whereB 2 and ||/|| = max (||/i ||, ||/ 2 1|) we may 
assume that ||/i || = 1. Since X is compact there is an x't X such that 

l/i(*')l - 1- 

Since 

\Mx') +Mx ') | = \m | g ll/ll = l 

1/1(0 -/.(*') I = I Tf(x') I £ 112/11 - 1 

we must have/ 2 (x') = 0. Consequently/(x') = f x (x') = ±1 and therefore a:' «G. 
Since /i e J3i we have T/i = f x and therefore 

fi = ffi* (M).- 

Consequently j f x (h(x')) | = | /i(x') | = 1 and by the previous argument 
h(x') e G. Hence G f~l h(G) ^ 0,a contradiction. 

Having proved that h(x) — x for all x t X we have 

Tf = g x ■/ where | g t | = 1. 

Define 

Xi = {* | gi(x) = 1}, X 2 = jx | gi(x) = -1). 

Clearly X!, X 2 are closed and X = Xi U X 2 . Given / e J3(X) we have f e B x 
if and only if Tf = f, this means that / = gi f and this holds if and only if 
/(x) = 0 whenever </i(x) = —1. Hence we see that B x = B(Xi). Analogously 
B 2 = B(X t ). 

Next we assume that X is completely regular. Let j3(X) be the compacting 
of X. Since B(f3(X)) = B(X) we have a direct sum decomposition BQ 3(X)) = 
Bi + B 2 and hence there is a decomposition 

/s(X) = Yi u f 2 

into disjoint closed sets such that B\ = B(Y\) and B t = B(F 2 ). Let 
X x = X (1 Yi , X 2 = X n Y t . 

From the definition of the compacting it is clear that Yi = 0(X x ) and F 2 = 
|S(X 2 ). Hence B x = B{X x ), B t = S(X 2 ). 

If now X is an arbitrary topological space we consider the completely regular 
space p(X) defined by Cech. 8 Again B(p(X)) = B(X) and hence we have the 
direct sum decomposition B(p(X)) = B x + B t . Since the theorem is proved 
for completely regular spaces there is a decomposition 

P (X) = f 2 u f 2 



BANACH SPACE METHODS IN TOPOLOGY 


579 


into disjoint closed sets such that Bi = B(Yi) and B 2 — B(Y 2 ). From the 
definition of p{X) it is clear that there is a decomposition 

X « Xi U X 2 

into disjoint closed sets such that Fi = p(Xi) and Y 2 = p(X 2 ). Consequently 
Bi = B(Xi) and B 2 = jB(X 2 ). 

From Theorem 9.2 we obtain the following corollaries: 

Corollary 9.3. X is connected if and only if B(X) is indecomposable as a 
direct sum . 

Corollary 9.4. X contains at least n components if and only if B(X) is a 
direct sum of n summands. 

Corollary 9.5. X contains an isolated point if and only if B(X) admits the 
straight line as a direct summand. 

University of Michigan. 



Annals op Mathematics 
Vol. 43, No. 3, July, 1942 


SIMPLIZIALZERLEGUNGEN VON BESCHRANKTER FLACHHEIT 1 

By Hans Freudenthal 
(Received July 25, 1941) 

Wir beantworten eine Frage von Herrn Prof. L. E. J. Brouwer nach einer 
einfachen Konstruktion einer unendlichen Folge beliebig feiner Simplizialzer- 
legungen eines Polytops, deren jede eine Unterteilung der vorangehenden ist, 
und bei der die auftretenden Teilsimplexe nicht beliebig flach werden dtirfen, 
d. h. bei der der Quotient 

c r /t> 

(c = Durchmesser = grofite Kantenlange des Simplexes, v = Volumen des 
Simplexes) fur alle r-dimensionalen Simplexe gleichmafiig beschrankt bleibt. 
Derartige Zerlegungsfolgen braucht man in der Analysis uncT auch im Grenzge- 
biet zwischen Analysis und Topologie. 

Die Konstruktion, die wir hier angeben wollen, verlauft ahnlich wie unsere 
Simplizialzerlegung des Cartesischen Produkts zweier Simplexe. 2 

1 

Simplexe sollen hier immer mit einer festen Reihenfolge der Ecken gegeben 
sein, Parallelotope immer unter Auszeichnung einer Ecke und einer Anordnung 
der von dieser Ecke ausgehenden Kanten. 

Das Simplex T habe die Ecken 

Co , • • • , e r • . 

Wir konnen T auch beschreiben durch die Vektoren 

xo = e 0 , Xi = €i - e t _i {i = 1, • • • , r). 

Sei t eine Permutation pi, • • • , p r der Zahlen 1, • • - , r. Die Punkte 

r 

e< = xo + X *p. (i - 0, r) 

r-1 

(als Ecken) erzeugen ein Simplex T T . Ist tt 0 die identische Permutation der 
Zahlen 1, • • • , r, so ist T w ° = T . Die r! Simplexe T v heiCen untereinander 
konjugiert. 

T* ist die Gesamtheit der Punkte 

r r 

T ' • • • 2 Xr e p , mit X, ^ 0, 2 X. = 1, 

r-l r-1 


1 Diese Note stimmt im Wesentlichen (lberein mit einer im M&rz 1939 bei den Funda- 
menta Mathematicae eingereichten Note die dort nicht mehr erschienen ist. Das Ergebnis 
habe ich bereits anderswo verwendet [Proceedings Amsterdam 42 (1939), 880-901]. 

* Fund..Math. 29 (1937), 138-144. 


580 



81MPLIZIALZERLEGUNGEN VON BESCHRANKTER FLACHHEIT 


581 


oder, was auf dasselbe hinauskommt, 

r 

T r • • * Xo + 1 ^ Ot\ • • • Of §8 0. 

V -1 

Daraus folgt: Die T r bilden eine simpliziale Teilung des ParaUelotops P, 

P ••• X o + «rXr» 0 ^ a v £ 1. 

Umgekehrt wird durch den Parallelotop P, von dem die Ecke x 0 und die Kanten- 
vektoren Xi , • • • , x r gegeben sind, und durch die Permutation r eindeutig das 
Simplex T* bestimmt. 


2 

Aus dem Parallelotop P entstehen durch Halbierung quer zu alien Kanten 
die Parallelotope 

Pff * Xff ,0 “h ^ y ^ OCpXp J 

hier sei <r ein System 


von Zahlen s v , 
und 


Si > * * • » s r 


s v = 0 oder 1, 


av.o = Xo + £ 23 $vX„. 


In derselben Weise wie P in die Simplexe T* ist jedes P a zerlegt in die Sim- 
plexe ( T a y .— T r und (TV)' sind homothetisch (1:2). 

Sei t wieder die Permutation pi, •• • , p r der 1, • • • , r und sei ir* = v die 
Permutation Pi , • • • , p* , die entsteht, wenn man erst (in der durch i r gege- 
benen Reihenfolge) alle p„ mit s Vv = 1 auszieht, und dann alle mit s Pr = 0, also: 
fiir .Sa > Sb kommt a vor b in tt*, 

fur s 0 = Sb und a vor b in tt kommt auch a vor & in x*. 

Wir behaupten: ( T „)' liegt ganz in T 
In der Tat ist 

r f * 

{Pa) • • • Xo *4" ^ y ^ s> x? ~i“ ^ yi <x v xp v 

r -1 r -1 

583 Xo + J 23 (<** “b (1 ^ ^ * * ' £= «r ^ 0) 

(wo 23' zu erstrecken ist iiber alle v mit = 1 und 22” uber alle v mit s Pw = 0) 


r 


xo + 23 ftxp* 

r-l 


(1 i A £ A S J S jSu+i & • • • J& A 2i 0) 


(wo tt die Zahl der Einsen unter den $, ist). Hieraus folgt unsere Behauptung. 

Weiter folgt hieraus: Die (TV)' mit r = tt* InTden cine Simplizialzerlegung 
Z(T) von T. 



582 


HANS FREUDENTHAL 


Sei e</ der Mittelpunkt der Strecke ef'ej', eJi = ej', 
e’i — ^ (xq + 2 


i j * 

= x 0 + ]£ x pt + - x j>: 

F-l Z V-*+l 


Seien unter den s p , mit v g k 


k ' Einsen und k " Nullen. 


(i ^ i). 


Die & te Ecke von (T 7 ,)* erhalt man fur on = • • • = a* = 1, a k +i = • • • = <x r = 0; 
sie lautet also 

*o + E *„• + l 1 Z U *pj = e'/ (i = k', j = ft" + u). 

*-l J F-i' + l 

Daraus folgt fur jedes Teilsimplex ( T„Y von T**: 

0 U Ecke ist e<?J ; 

auf die Ecke e[] folgt die Ecke e^u oder die Ecke el *+1 ; 
der erste Index ist stets rg u , der zweite ^ r. 

Umgekehrt sind die Teilsimplexe (T,) T von T T ' durch diese Eigenschaften auch 
gekennzeichnet. 


4 

Wir achten nun nur noch auf T = T*° und lassen den Index wo im Folgenden 
weg. Wir konnen dann die Teilung Z(T) von T auch so beschreiben: 

Die Ecken von Z(T) sind die eij ; 

dann und nur dann bilden e-a und evy ein eindimensionales Simplex , wenn die 
Paare ij und i'j' einander nicht trennen; 

eine Menge von e t -/ bildet dann und nur dann ein Simplex von Z(T), wenn Hire 
Elemente zuje zweien Simplexe von Z{T) bilden ; 

in den Simplexen von Z(T ) sind die Ecken nach steigenden i + j angeordnet. 
Wir bemerken weiter: 

Die Simplexe von Z(T) sind dhnlich der zu T konjugierten . 

Sei nun ein endlicher Poly top R gegeben! Wir legen eine Reihenfolge der 
Ecken zugrunde und bilden die Teilung Z(R), indem wir auf jedes Simplex von 
R den Prozefi Z anwenden. Die Ecken von Z{R) ordnen wir lexikographisch. 
Indem wir den Prozefi Z unbegrenzt wiederholen, erhalton wir eine Unter- 
teilungsfolge. Alle dabei auftretenden Simplexe sind den (endlich vielen) kon¬ 
jugierten der Simplexe von R ahnlich, und da ahnliche Simplexe dieselbe Flach- 
heit 

c r /v 

besitzen, bleibt, wie wir es wiinschten, die Flachheit gleichmafiig beschriinkt. 


Amsterdam 



Annals of Mathematics 
Vol. 43, No. 3, July, 1942 


A SIMPLIFIED PROOF FOR THE RESOLUTION OF SINGULARITIES OF 

AN ALGEBRAIC SURFACE 

By Oscar Zariski 
(Received March 19, 1942) 

1. Introduction 

We presuppose the theorem of local uniformization, as proved elsewhere. 1 
By this theorem, any valuation of a field Z/K of algebraic functions of r inde¬ 
pendent variables, over a given ground field K of characteristic zero, can be 
“uniformized” on a suitable projective model V of Z, i.e. the center of the 
valuation on a suitable model V will be a simple subvariety of V 2 If the 
ground field K (the field of coefficients) is the field of complex numbers (the 
classical case), then the above result, in conjunction with the bicompactness of 
the Riemann manifold M of Z/K, 3 implies the existence of a finite set of models 
of Z, say 


Vi , V 2 , • ■ . , Fn , 

such that any zero-dimensional valuation of Z/K is uniformized on at least one 
of the models Vi of the set. 4 Any finite set of projective models with the above 
property shall be called a resolving system of the Riemann manifold M. 

In the case of abstract varieties, where we cannot fall back on topology, it is 
necessary to give an algebraic proof of the existence of resolving systems of M. 
In the special case of algebraic surfaces the algebraic proof of the existence of 
resolving systems is strikingly simple (see section 6 of this paper). The general 
case of algebraic varieties will be treated in a subsequent paper. 

The theorem of the resolution of singularities of an algebraic variety can be 
formulated in terms of resolving systems, as follows: There exists a resolving 
system of the Riemann manifold M which consists of only one model . In view of 
the existence of resolving systems, this theorem will be established if it can be 

1 For the case of algebraic surfaces, see our paper “The reduction of singularities of an 
algebraic surface”, Annals of Mathematics, vol. 40 (July, 1939), pp. 649-059. For the gene¬ 
ral case of varieties of any dimension, see our paper “Local uniformization on algebraic 
varieties”, Annals of Mathematics, vol. 41 (October, 1940), pp. 852-896. These two papers 
will be referred to respectively as “Reduction” and “Uniformization”. 

* For the definition of the center of a valuation see “Uniformization”, p. 857. 

* M is the totality of all zero-dimensional valuations (or places) of Z/K; sec “Uniformiza¬ 
tion”, p. 855. 

4 It follows then necessarily that any valuation (whether of dimension zero or greater 
than zero) will be uniformized on at least one of the models V% . To see this, it is only 
necessary to observe that: a) If B is any valuation of dimension > 0, then there exist zero¬ 
dimensional valuations compounded with B; b) if B 0 is such a zero-dimensional valuation 
then on every model V of X the center of Bo will lie on the center of B; c) if an irreducible 
subvariety W of V contains a simple point, then W itself is simple. 

583 



584 


OSCAR ZARISKI 


proved that the existence of a resolving system of M consisting of n models (n > 1) 
implies the existence of a resolving system of M consisting of n — 1 models . To 
carry out this induction from n to n — 1, it is sufficient to prove the fundamental 
theorem which we are now going to state. 

Let N be an arbitrary subset of M, i.e. let N be an arbitrary set of places of 
our field 2/K. In the same fashion as we have defined resolving system of M, 
we can define resolving systems of N. A resolving system of N will be there¬ 
fore any finite set of models of 2 such that any valuation in N has a simple 
center on at least one of the models in the set. In particular, if a resolving 
system of N consists of only one model, this model shall be called a resolving 
model for N. 

Fundamental Theorem. If N is an arbitrary subset of M and if there exists 
a resolving system of N consisting of two models , then there also exists a resolving 
model for N. 

From the fundamental theorem, the theorem on the resolution of singularities 
follows immediately. For, let V \, • • • , F„ be a resolving system of M. Let 
N be the subset of M consisting of those valuations which have a singular 
center on each of the models Fi, • • • , V n - 2 . It is clear that the pair V n -i ,' 
V n constitutes a resolving system for N. If we assume the fundamental theo¬ 
rem, then there exists a resolving model V* n -\ for N. The n — 1 models 
Vi , • • • , Vn- 2 , Vn -1 constitute a resolving system for M, and our induction 
from n to n — 1 is complete. 

The aim of this paper is to give a proof of the fundamental theorem in the 
case of algebraic surfaces. Our present proof for the resolution of singularities 
is much simpler than our earlier proof (see “Reduction”) and is also more 
general, since at present we do not assume that the ground field K is algebraically 
closed. In the course of the proof we have to make use of certain properties 
of fundamental loci of birational correspondences. This preliminary material, 
strictly confined to the precise needs of our proof, is presented in the next 
section. In a forthcoming paper dealing with the general theory of birational 
correspondences we study these properties systematically. A brief account of 
this general theory will be found in my address “Normal varieties and bi¬ 
rational correspondences” (delivered before the annual meeting of the Society 
in Bethlehem in 1941) which has appeared in the Bulletin of the American 
Mathematical Society, Vol. 48, No. 6 (June 1942). 

I. Preliminary Concepts 

2. Birational correspondences 

Let V and F' be two models of our field 2/K. 

Definition. Two points, P, P ' of V and F' respectively are corresponding 
points if there exists a valuation of 2/K such that its center on F is the point P 
and its center on F' is the point P'. 5 

8 Compare with p. 666 of “Reduction”. Our study of the general theory of birational 
correspondences is based on this valuation-theoretic definition. 



RESOLUTION OF SINGULARITIES OF ALGEBRAIC SURFACE 


585 


We say that P is a, fundamental point of the birational correspondence between 
V and V', if there exists a corresponding point P' on V' such that Q(P) £Q(P') 
(here Q(P) denotes the quotient ring of P). 

Suppose that there exists a point P 9 which corresponds to P and which is 
such that Q(P') £ Q(P). If v is a valuation of centers P and P 9 on F and F' 
respectively and if denotes the prime ideal of v, then fl Q(P) = p and 
% H Q(P') = p', where p(p') is the ideal of non units in Q(P)(Q(P')). Hence 
p fl Q(P') = p', and from this it follows that any valuation whose center on F 
is P has P 9 as center on V i.e. P' is the only point of V ' which corresponds to P. 
Therefore, if P is a point of F which is not fundamental, then to P there corre¬ 
sponds a unique point P' on F', and we have Q(P') £ Q(P). On the other 
hand, if P is fundamental, then it follows that Q(P ) Q(P') for any point 

P 9 which corresponds to P. 

In the case of surfaces we have proved elsewhere that if F and F' are normal 
surfaces and if to P there corresponds a finite number of points on V 9 , then P 
is not fundamental. 6 Therefore, if P is fundamental, the locus of corresponding 
points P 9 is a variety of dimension l. 7 

If neither F nor V 9 carry fundamental points, then the birational correspond¬ 
ence between F and V 9 shall be called regular. 

In any case, the number of fundamental points on either surface is always 
finite. 

Let (o , £1 , * • • , £n be the homogeneous coordinates of the general point of F, 
and let if? ,*?*»•••> *?£ be the homogeneous coordinates of the general point 8 
of V 9 . The (n + l)(m + 1) products 

* „* * 

= iiVi 

can be regarded as homogeneous coordinates of the general point of a variety 
F* which is birationally equivalent to F and to V 9 : for the quotient of any two 
<o*’s, say a>* /co*0 , is equal to ?*/{*• vl/vp > and is therefore an element of the 
field S. ‘ Moreover, the non-homogeneous coordinates of the general point of 
F*, i.e. the quotients of the by a fixed a>*, say by w? 0 , generate the field 
2/K, since the quotients (*/(* (and also the quotients vf/vo) are among them. 
The variety F* is called the join of F and F\ It is clear that the ring 

6 See “Reduction”, p. 688, Theorem 5, but the proof is much simpler and is as follows. 
If P[ , • • • , P' h are the points on V' which correspond to P, then for any valuation v whose 
center is P it must be true that the valuation ring of v contains one of h quotient rings 
Q(Pi), whence it also contains the intersection 3 of these quotient rings. Since Q(P) is 
integrally closed, it is the intersection of the above mentioned valuation rings. Hence 
Q(P)2 3. If Pfl 3 - p, then p is one of the h prime maximal ideals of 3> say the one 
relative to the point P[ , and it is clear that any valuation of center P will have center P[ 
on V\ i.e. h — 1. [For the definition of normal varieties, see our paper “Some results in 
the arithmetic theory of algebraic varieties”, p. 282, American Journal of Mathematics, 
vol. 61 (April 1939). This paper will be referred to as “Results”.] 

7 See “Reduction”, p. 667. 

* For the concept of homogeneous coordinates of the general point of an irreducible 
algebraic variety, see “Results” p. 284. 



586 


OSCAR ZARISKI 


K[«?o/«oo , • • • , w!»/o)oo] of polynomials in these non-homogeneous coordinates 
—briefly: the ring of non-homogeneous coordinates—coincides with the ring 
K[£i/£o , • • • , £„/£o , Vi/vo , * • * , Vm/vo] and is therefore the join of the corre¬ 
sponding rings of non-homogeneous coordinates of the general points of V 
and V From this one concludes immediately that the birational correspond¬ 
ence between V* and either one of the two given models V , V f has no funda¬ 
mental points on V*. 

It follows that each point P* of V* represents a unique pair (P, P') of corre¬ 
sponding points of V and V'. Conversely, each such pair (P, P') is represented 
by at least one point P* of the join V *. For this reason the join V* is often 
referred to in the literature as the variety of pairs of corresponding points of 
V and V'. However, this last term may be misleading when we deal with a 
ground field K which is not algebraically closed. For in this case (and only in 
this case) it may happen that one and the same pair (P, P') of corresponding 
points of V and V' is represented by (or, we may say, splits into) more than 
one point of F*. 

The join of any finite number of models Fi, • • • , Vh can be defined as the 
join of Vh and of the join of Vi , • • • , Vh -1 . It is seen immediately that this 
definition is symmetric in V \, • • • , Vh . 

3. Quadratic transformations 

Let P be a point of V and let p* be the corresponding prime homogeneous 
ideal (of projective dimension zero) in the ring K[{* , £ * , • • • , £*]. Let 77 * , 

, • • * , rim be a set of forms in (* , (1 , • • • , (* , of like degree , such that the 
ideal ( 77 ? , rjt , • • • , vt) differs from p* only by an irrelevant primary component 
i.e. a component belonging to the irrelevant prime ideal po = (£0 , £1 , • • • , £*). 9 
By a quadratic transformation of center P we mean the birational transformation 
which carries V into the variety V whose general point has the following 
homogeneous coordinates: 

( 1 ) «*■ = £v* > i = 0 , 1 , • • • ,n;j = 0 , 1 , • • • , m. 

This transformation depends of course on the choice of the forms 77 * , but in a 
non-essential fashion. Namely, if is the transform of V by a quadratic 
transformation, relative to another set of forms, then V and Vi are in regular 
birational correspondence . 

Proof. Let fo , f? , • • • , f* be the set of forms which defines Vi , and let 
w*- = . Let: a = degree 77 * , 0 = degree f* , 31 = ( 17 ? , • • • , 77 *), 

93 = (fo , • • • , f*). Since the ideals 31 and 93 differ only in their primary 
irrelevant components, we will have for sufficiently high integers a and b : 
21 • po 255 0(93), 93-po = 0(31). Select a and b so as to have a + a = $ + b. 

9 Let pf ■* ( 0 o, 0 i , • • • , </>•), where the 0 »* are forms in the £*’s and let v{ — degree of 0 », 
v * max. (v Q , v x , • • • , v ,). Then the forms , j - 0, 1, • • • , n, i - 0, 1, • • • , s, 

satisfy the desired condition. 



RESOLUTION OF SINGULARITIES OF ALGEBRAIC SURFACE 587 

The elements w* constitute a linear base for the forms of degree a + 1 which 
belong to 31. If we multiply the w* by £?,*••, ft , then the products give a 
base for the forms of degree a + 2 in 31. These products are the homogeneous 
coordinates of the general point of the join V * of V and P. Since, as will be 
pointed out later on, the quadratic transformation has no fundamental points 
on P, it follows that P and V* are in regular birational correspondence. We 
now multiply the homogeneous coordinates of the general point of F* by 
£o , * • • , £* , getting the join of F* and F. Proceeding in this fashion we con¬ 
struct a model F*, such that : a) the homogeneous coordinates of the general 
point of V* constitute a linear base for the forms of degree a + a in 31; i.e., they 
are elements of 3lpo, and consequently also of 93; b) V * and P are in regular 
birational correspondence. In a similar fashion we construct a model Vt in 
regard to 93, Pi and the integer ft + b. Since a + a = /3 + 6, it follows imme¬ 
diately that the homogeneous coordinates of the general point of F* are linearly 
dependent on the homogeneous coordinates of the general point of F? , and 
vice versa. Consequently, V* and Vt are related protectively to each other, 
q.e.d. 

If K is algebraically closed, then it is permissible to assume that the coordi¬ 
nates of P (elements of K) are 1, 0, 0, • • • ,0. Then p* = (£*,•••, ft), and 
our quadratic transformation is given by the equations: co,y = £ t £y, i = 0, 
1, ■ • • , n; j = 1, 2, • • • , ft. This is the ordinary quadratic transformation 
defined by the system of hyperquadrics passing through the point P (see “Re¬ 
duction,^” p. 676). 

Since P is the only point of F at which all the forms £* 77 * vanish simulta¬ 
neously, P is the only fundamental point of the quadratic transformation. To 

any other point A of F there corresponds a unique point A on P, and moreover 
Q{A) coincides with Q(A). The transformation has no fundamental points 
on F. The proofs of all these assertions are straightforward and do not differ 
essentially from the proofs in the case of an algebraically closed ground field 
(see “Reduction,” p. 676-679). 

To see what happens to the point P, we pass to non-homogeneous coordinates: 

= £*/£* (i ^ 0 ), and can = w*/co*o (i, j not both zero). Let rji denote the 
non-homogeneous polynomial obtained from the form rj* by the usual process 
(replace £? by 1 and £* by £», i ^ 0 ). The ring of non-homogeneous coor¬ 
dinates for F, resp. P, will be: o = K[fi, • • • , £*] and 

0 = K[£l , • • • , in , Ifl/no , • • * , WT?o] 

respectively. The point P will be given by the prime ideal p = (vo, • • • , Vm) 
in o. Since the ideal op is the principal ideal 5 • uo, we conclude immediately 
that the transform T(P) of the point P is a pure (r — \)-dimensional subvariety 
of P (not necessarily irreducible ). 10 


10 If r is an irreducible subvariety of P, at finite distance with respect to the non-homo- 
gencous coordinates , T is given by a prime ideal p in the ring 5. If r corresponds to the 



588 


OSCAR ZARISKI 


4. Quadratic transformations with simple center. The p-adic divisor 

We are now especially interested in the case in which P is a simple point. 
Let 3 be the quotient ring of P and let denote the ideal of non-units in 3 * 
Let t\ , • • • , t r , U € 3 > be uniformizing parameters of P, i.e. a set of r elements 
in 3 with the property 3 • (*i, • * • , t r ) = ^. u 

A significant property of the uniformizing parameters is the following: if 
<t>(h , • • • , t r ) = 0 , where ^ is a polynomial in k , - • • , t r with coefficients in 3, 
and if <t> p (k , • • • , t r ) is the sum of terms of <f> of lowest degree p (<£, = the leading 
form of 0), then the coefficients of <t> p must all be divisible by ty. 12 This property 
enables us to construct a valuation of 2 in the following fashion. If a € 3 and 
if a is exactly divisible by Sp p , then a = <l> p (k , • • • , t r ), where <t> p is a form of 
degree p with coefficients in 3 but not all in Let 0 be another element of 3> 
exactly divisible by so that 0 = , • • • , Q. We have a/3 ss 0 ( < ip p ' f<r ), 

and since the coefficients of the form <t> p \l/ a are not all in we conclude, by the 
property of the uniformizing parameters stated above, that a/3 is not divisible 
by < ^3 p " hr+1 . This shows that if we put v(a) = p, we get a discrete valuation B 
of 2, of rank 1. We shall call B the p-adic divisor of center P (P — a simple 
point!). 

It is not difficult to see that B is of dimension r — 1, i.e. B is a divisor. For, 
we have v(U) = 1, v(t t /h) = 0. Were the B-residues of U/k , • • • , t r /k alge¬ 
braically dependent over K, there would exist a form <f> p (k , • • • , t m )> with coeffi¬ 
cients in K, not all zero, such that v(<t> p ) > This would imply <£ p a 

0(r +1 ), which Js impossible. 

The following theorem puts into evidence the effect of a quadratic transforma¬ 
tion of center P in regard to our p-adic divisor: . 

Theorem 1 . If V is the transform of V under a quadratic transformation T 
of simple center P, then the p-adic divisor of center P is of the first kind with respect 
to F, i.e . the center of the divisor on V is (r — 1 )-dimensional. This center coin¬ 
cides with the transform T(P) (i whence T(P) is irreducible). 

Proof. We use the notations of the preceding section. We assume that the 
variety T(P) is not entirely at infinity, i.e. that the principal ideal 5 • 770 is not 
unit ideal (see footnote 10). 

point P, then we must have p f| 0 * P (s° e footnote 7). Hence 5p » o(p), i.e. r lies on the sub- 
variety W of V defined by the principal ideal o*rjo . Conversely, if r is on W, then o*p » 
0(p), whence p f) 0 ■■ 0 (since p is_a maximal ideal of 0 ), and therefore r corresponds to P. 
This shows that T(P) consists of W and perhaps of other irreducible components at infinity. 
Since any irreducible component of T(P) can be assumed to be at finite distance, for a 
proper choice of the non-homogeneous coordinates (if co*p is different from zero on the 
given component, we use the quotients and since if the principal ideal 0*170 is not 

the unit ideal, all its isolated prime ideals are (r — 1)-dimensional, our assertion follows. 

11 See our paper * 4 Algebraic varieties over ground fields of characteristic zero”, American 
Journal of Mathematics, vol. 62 (January, 1940), p. 199. 

1S See loc. cit. in footnote 11, p. 202 and p. 207-208. As a consequence of this property, the 
quotient ring of a simple point (and also, more generally, the quotient ring of a simple sub- 
variety) is a “p-Reihenring” in the sense of Krull (see Krull, Dimensionstheorie in Stellen- 
ringen, Crelle’s Journal, vol. 179 (1938)). 



RESOLUTION OF SINGULARITIES OF ALGEBRAIC SURFACE 


589 


To prove the theorem it will be sufficient to prove that the ideal $-rjo is 
prime, where 5 = 3*5 (3 = quotient ring of P ), 13 and that the irreducible sub- 
variety of V defined by this prime ideal is the center of B. Let B' be an arbi¬ 
trary valuation of center P, whose center on V is at finite distance (such valua¬ 
tions exist, since we have assumed that T(P) is not entirely at infinity). 

Among the uniformizing parameters h , • • • ,t r of P we take one which 
has least value in B'. Let it be h . Since = 3 • (*?o, m > * * * , Vm) y it follows 
that o, vi, - y Vm)- Consequently ti/vo € 3, and, in particular, 

ti/vo € 3* Since the center of B ' is at finite distance we conclude that v B '(ti) ^ 
v B '(v o). This inequality implies that rj Q ft OC}? 2 ). For, assume that tj 0 s= (K^ 2 ). 
Then 770 = , • • • , t r ) y where 02 is a quadratic form in the t 1 s, with coefficients 

in 3* We can write 770 /fi = *i 02 (l, U/h , • • • , t r /h). By hypothesis, 

v B '(U/ti) ^ 0. We also have v B *(ti) > 0 and v B '{ot ) ^ 0, for any element a 
in 3 (since P is the center of B'). Consequently, v B >{^/t\) > 0, a contradiction. 

Since rj 0 ft CK^ 2 ), we have v B (rf 0 ) = 1, and since v B (vt) — 1> we conclude that 
the quotients rn/rj 0 belong to the valuation ring of B. Consequently the entire 
ring 3 is contained in the valuation ring of B. Therefore the center of B on V 
is a (irreducible) subvariety W of V at finite distance, where W will be given by 
the prime ideal ^ in 5 consisting of the elements of positive value in B. Now 
it is clear that 3*»Jo = 0( s !p). On the other hand, let a e 3, v(a) > 0. We 
can write a in the form: a = <t> P (vo , vi > * * * < Vm)/vo , where 0 P is a form of 
degree p, with coefficients in 3- Since v B (a) > 0 and v B (rjo) = p, we must have 
v B (<t> P ) ^ p + 1. Hence 0 P = 0 p +j(h, • • • , t r ), where 0 P+ 1 is a form of degree 
p + 1 in U , • • • , t r , with coefficients in 3. Since t t ss 0(770, 771, * • • , 77 m ), we 
will also have 0 P = g p 41(770, • • • , 77 m ), where g p +1 is again a form of degree 
p + 1 in 770, • • • , 77™ , with coefficients in 3- Consequently, a = gfp+i/t/o = 
rja-g p +i(lj 77,/770, • • • , rj m / 770), whence a as 0(5* 170). We conclude that 3**?o = 
s $, and this completes the proof. 

From the fact that the irreducible variety T(P) is defined by a principal ideal, 
namely by the ideal 0*770, it follows T(P) is a simple subvariety of V (of dimen¬ 
sion r — 1, and having 770 as uniformizing parameter). Therefore T(P) contains 
points which are simple for V. We shall need, however, the following stronger 
result: 

Theorem 2. Every point of T(P) is a simple point of V. 

Proof. Let P be a point of T(P), which we may assume to be a point at 
finite distance. We consider an arbitrary but fixed valuation B f with centers 
P and P (on V and V respectively). Assuming, as we did before, that 
v B '(ti) £ i = 1, 2, • • • , r, we will have U/rjo «3> and since 3 C Q(P), 

it follows that 

(2) -«Q(F), *-1,2, •••,»•. 

Vo 

13 Every prime ideal jj of 0*770 contracts to p (see footnote 11), and hence J* o,C5p. 
From this it follows immediately the ideals o »?o and 5 *770 have like decompositions into 
maximal primary components. 



590 


OSCAR ZARISKI 


We consider the ring o* = K[£i, • • • , £ n , t 2 /t\ , • • • , t r /h]. Let F* be the 
model whose general point (in non-homogeneous coordinates) is (£ 1 , • • • , f n , 
U/ti , • • • , tr/ti). Since v B '(ti/ti) fe 0, the center P* of B' on V * is a point at 
finite distance. We will have also the following relations, similar to (2): 

(3) I' € Q(P*), t - 0,1, • • • , m. 

From (2) and (3) it follows that h/t ) 0 is an element of Q(P) and that its reciprocal 
has non-negative value in the valuation B f . Since P is the center of B' on V, 
we conclude that ti/yo is a unit in Q(P). Similarly, ti/rj 0 is a unit in Q(P*). 
But then the quotients U/t \{= U/wyo/tx) are in Q(P), and the quotients 
i?t/i?o(= Vi/tvh/rjo) are in Q(P*). Therefore o C Q(P*) and o* d Q(P), and 
t/its implies: Q(P*) = Q(F). Thus, we have only to show that P* is a simple 
point of F*. For that we have to exhibit uniformizing parameters at P*. 

Let A be the residue class field of the point P, i.e. let A = ,o/p. Here A is an 
algebraic extension of K. Similarly, let A* be the residue class field of P*. 
We have A* 3 A, since o* □ o and since the prime o*-ideal p* of the point P* 
contracts in o to p. Let C \, • • • , c n be the P-residues of (•!,•••,£» (c» e A), fc 
and let d 2 , • • • , d r be the P*-residues of U/h , • • • , t r /t\ (d» e A*). The residue 
di will be the root an irreducible polynomial/,(z) with coefficients in A. Replace 
each coefficient of / t (z) by an arbitrary but fixed element of o of which it is the 
residue. We get a certain polynomial P t (z) with coefficients in o. We assert 
that the r elements 



are uniformizing parameters at P*, i.e. t [, t 2 , • • • , t! r generate in Q(P*) the ideal 
of non-units. It is sufficient to show that the ideal o*(t [, t' 2 , • • • , t' r ) is the inter¬ 
section of prime zero-dimensional ideals (one of these ideals will have to be the 
ideal defined by the point P*, since t[ = 0 at P*). Now this ideal is contained 
in the prime (r — l)-dimensional ideal o*fr(= o*p; the center of the p-adic 
divisor on F*). Modulo this prime ideal the elements U/tx , • • • , t r /k are alge¬ 
braically independent, while the residues of fi , • • • , f n are Ci , • • • , c n respec¬ 
tively. Therefore, if we pass to the ring o*/o*<i, we see immediately that the 
above assertion is equivalent to the following statement: if z 2 , • • • , z r are 
indeterminates over A, then the polynomials / 2 (z 2 ), • * • , f r (z r ) generate in the 
polynomial ring A[« 2 , • • • , z r ] an ideal which is the intersection of prime zero¬ 
dimensional ideals. Since the polynomials /<(«<) are irreducible over A and 
since A is of characteristic zero , this statement is true and its proof is straight¬ 
forward. 

II. Resolution op the Singularities op an Algebraic Surface 

5. Two lemmas 

If IF is an arbitrary collection of points on a given irreducible algebraic 
variety F (for instance, if IF is an algebraic subvariety of F), we denote by 



RESOLUTION OF SINGULARITIES OF ALGEBRAIC SURFACE 


591 


2V(TF) the set of all zero-dimensional valuations B (of the field 2 of rational 
functions on V) such that the center of B on V is in W . 

Lemma 1. If W is an algebraic subvariety of V and if for any point P of W 
there exists a resolving system for N(P), then there also exists a resolving system 
of N(W). 

Proof. Let s = dimension IF, i.e. s is the highest dimension of the irreducible 
components of W. For s = 0 the lemma is trivial, because W consists then of a 
finite number of points. We therefore assume that the lemma is true for 
s = p — 1 and we prove that it is also true if dim W = p. 

We fix a point Pi on each irreducible p-dimensional component of IF and we 
consider a resolving system V \, • • • , Vh of N(Pi , P 2 , • • • ). Let V * be the 
join of Vj Vi , • • • , Vh . The points of V* to which there correspond singular 
points of Vi form an algebraic subvariety IF* of V* (i = 1, 2, • • • , h). Let IF* 
be the intersection of Wt , • • • , IF* . The points of W which correspond to 
points of IF* form an algebraic subvariety IFi of W, of dimension < p, since 
PiiWi. It is clear that (Fi, • • • , F*) is also a resolving system of 
N(W — Wi). Hence, if there exists a finite resolving system for iVOFi), this 
system, together with V \, • • • , Vh , will give a resolving system for N( IF). 
Since dim IFi < p, our induction is complete. 

The second lemma deals with algebraic surfaces. Let F be a normal surface, 
and let P be a point of F. We apply to F a quadratic transformation of center 
P, getting a surface F[. If F[ is not normal, we pass from F[ to a corresponding 
derived normal surface Fi (see “Results,p. 290). The birational transforma¬ 
tion which consists in passing from a given variety to a corresponding derived 
normal variety shall be referred to in the sequel as a normal transformation. 
We know from section 3 that the transform of P on F[ is a pure (r — l)-dimen- 
sional subvariety of F[ . We take an arbitrary point P[ of this subvariety. 
To P[ there will correspond on Fi at most a finite number of points. Let Pi 
be one of these points. We now repeat the above procedure, starting with the 
normal surface F x and with the point Pi. dVVe will get first a quadratic trans¬ 
form F'i of Fi (with Pi as center of the quadratic transformation) and then a 
derived normal variety F 2 of F ' 2 . On F 2 we select at random a point P% which 
corresponds to Pi, and then we let P 2 be one of the points of F 2 which corre¬ 
sponds to P£ . In this fashion we proceed indefinitely, getting an infinite 
sequence of models F; F [, Fi ; F 2 , F 2 ; • • • ; , P, ; • • • , and of points P; 

P [, Pi ; P ' 2 , P 2 ; • • • ; Pj, P» ; • • •, where: a) P'- € Fi , P t € F* ; b) the quotient 
ring Q(Pi ) is integrally closed and c) Q(P) C Q(P'i) £ Q(Pi) C Q(Pa) £ 

Q(Pt) 

Lemma 2. The union of the quotient rings Q(P») (or lim Q(P*)) is the valuation 
ring of a zero-dimensional valuation of S. 

The proof is exactly the same as the one we gave in the case of algebraically 
closed ground fields (“Reduction,” p. 681, Theorem 10). 14 


14 In that proof we selected an element in Q(P ) which is a non-unit of Q(P) but is not 
in the << tangential ,, ideal. The consideration of the tangential ideal can be omitted. 
We know that if is the ideal of non-units in Q(P), $ ■■ (vo, m » • • • , i?m)> then the ex- 



592 


OSCAR ZARISKI 


6 . The existence of resolving systems 

Let F be a normal surface. We shall prove that the hypothesis that the 
field 2 of rational functions on F does not possess a resolving system leads to a 
contradiction. Under this hypothesis it follows, in view of Lemma 1, that there 
must exist a point P on F such that N(P) does not possess a resolving system. 
By a quadratic transformation of center P, we transform F into a surface F[ 
and into the derived normal surface Fi of F[ . Let Wi be the subvariety of F\ 
whose points correspond to P. It is clear that N(P) = N(Wi), and hence there 
must exist a point Pi on W\ such that iV(Pi) does not possess a resolving system. 
By repeated application of this argument, we get an infinite sequence of points 
P, Pi, • • • , Pi , • • • of the type considered in the preceding section, such that 
N(Py ) does not possess a resolving system, i = 1 , 2, • • • . Let B be the zero¬ 
dimensional valuation whose valuation ring B is the union of the rings Q(P t ). 
By the local uniformization theorem, let F * be a model on which the center P* 
of B is a simple point. We have Q(P*) C B , hence Q(P*) CZ Q(P*), for i suffi¬ 
ciently high, say i ^ m. Every valuation of center P,, i ^ m, will have P* 
as center on P*, i.e. N(P X ) C N(P*). Therefore F * is a resolving surface for 
N(Pi), if i ^ m, a contradiction. 

7. Proof of the fundamental theorem 

Let F , F r be the pair of surfaces which constitute a resolving system for a 
given set N of zero-dimensional valuations of our field 2 . The birational 
correspondence between F and F' may have fundamental points on either sur¬ 
face. Our first step consists in the elimination of the fundamental points of one 
of the two surfaces, say of F, by applying to F a sequence of successive quadratic 
and normal transformations, as described in the preceding section. As center 
of the quadratiCwtransformation we take a fundamental point of F, one at a time , 
until we have exhausted all the fundamental points of F. As a result we get 
some new normal surface, say Fi. If the birational correspondence between 
Fi and F' still has fundamental points on Fi, these points must be among the 
points which correspond to the fundamental points of F. In that case we 
proceed with Fi in the same fashion. We assert that after a finite number of 
steps we will get a surface Fi = F such that the birational correspondence between 
F and F f has no fundamental points on F. For, otherwise there would exist an 
infinite sequence of points P, Pi, P 2 , * • • , P, e F,, such that each point P< is 
fundamental and such that Q(P) C Q(Pi) C • • • . Let B = lim Q(P»), and 
let B be the corresponding zero-dimensional valuation (Lemma 2). Let P' be 
the center of B on F'. Then if i is sufficiently high we will have Q(Pi) □ Q(P'), 
and this is in contradiction with our hypothesis that P< is a fundamental point 
of the birational correspondence between F< and F f (see section 3). 

Since under quadratic transformations simple points go into simple points 

tended ideal of ? in Q(P i) is a principal ideal generated by one of the elements w , say 
by 170 . Take as the element 970 . 



RESOLUTION OF SINGULARITIES OF ALGEBRAIC SURFACE 


593 


(section 4) and since under normal transformations simple points are not affected 
at all, it follows that the pair of surfaces ( F, F') is also a resolving system of N. 

Our next step is similar to the step just executed, namely we now eliminate 
by quadratic and normal transformations the fundamental points of F' in the 
birational correspondence between F, F'. However, this time we only eliminate, 
those fundamental points of F f which are simple points of F'. No singular point 
of F' will be affected, even if it is a fundamental point. Let F' be the transform 
of F' thus obtained. 

We assert that the join F* of F and F' is a resolving surface for N. 

The proof is straightforward. For, let B be any valuation in the set A r . Let 
F, P', F' and P* be the centers of B on F, F', F' and F* respectively. We 
have to show that P* is a simple point. 

First Case: P' is a singular point. Then F is simple (since (F, F') is a 
resolving system for N). We have Q(F) □ Q(P') (since the birational cor¬ 
respondence between F and F f has no fundamental points on F) and also 
Q(P') = Q(F') (since the singular points of F' have not been affected in the 
passage from F' to F'). Therefore Q(F) □ Q(F') and consequently 16 Q(P) = 
Q(P*). Since F is a simple point, it follows that also P* is a simple point. 

Second Case: P' is a simple point. If P' is not a fundamental point of the 
birational correspondence between F and F', then Q(P') = Q(F) (since F is 
also not a fundamental point), and moreover Q(P') = Q(F') (since the points 
of F' which are not fundamental in the above birational correspondence have 
not been affected in the passage from F' to F'). Hence Q(F) = Q(P') = 
Q(P*), and therefore P* is a simple point. If, on the other hand, P' is a funda¬ 
mental point in the birational correspondence between F and F', then F' is not 
fundamental for the birational correspondence between F and F' (because the 
simple fundamental points of F' are eliminated in the course of the passage 
from F' to F'). Hence Q(F') 3 Q(F), Q(P*) = Q(F'). Since P' is simple, 
also F' is simple, and consequently P* is a simple point, q.e.d. 

The Johns Hopkins University 


15 We make use of the following property of the join V* of two varieties V and V if 
P*y P and P' are corresponding points of F*, V and V* respectively and if Q(P) 3 
Q(P'), then Q(P) -■ Q(P*). The proof is straightforward. 



Annals of Mathbmatics 
Vol. 43, No. 3, July, 1942 


A NEW HOMOLOGY THEORY* H 

By W. Mayer 
(Received March 27, 1942) 

1. Euler relations for the new homology groups 

We assume familiarity with the definitions and notations of our previous 
paper, “A new Homology Theory,” these Annals, vol. 43, 1942, pp. 370-380. 
The homomorphisms 

(1.1) C <+t -*C\ i - 0,1,2, ••• ;0 <q <p,‘ 
defined by 

(1.2) F a (K ,+8 ) = K* 

have the group Z\ +t as nucleus and the group B,_, as map. 

In this section the underlying simplicial system 2 is supposed to be finitejand 
if in addition we restrict ourselves to its regular subsystem S( r >, which restric¬ 
tion does not influence the homology groups, the ranks of all groups involved 
will be finite. Thus (1.1) yields for the ranks 

(1.3) r(^=r(0 + r(^ ) 0 < q < p, i = 0,1, 2, • • • . 

Hence 

(1.4) r{C) - r(Z‘) + r(BjL') 

is true for i ^ q and will remain true for i < qif the rank of a group of negative 
dimension is put equal to zero. This follows from 

(1.5) C* = Z\ for i < q (and i = 0, dtl, ±2, • • •). 

Remark: The introduction of chains of negative dimensions here has as 
its sole purpose to dispense with the handling of sub-cases. (This device was 
also used in §7 of our previous paper, but differently from the way adopted 
here.) 

Hence we have 

(1.6) r(C’) = r{Z\) + r(S , p 1 a a ), 0 < q < p, i = 0, ±1, ±2, • • • . 
Furthermore from 

(1.7) H' q = Z\ - B\ 
defining the (q, i)-homology groups, we derive 

(1.8) r(Hi) = r{Z\) - r{B\), 0 < q < p, i = 0, ±1, ±2, • • • . 

Eliminating r(Z l q ) from (1.6) and (1.8) we arrive at 

(1.9) r(H' q ) = r{&) - r{B\) - r{BV-\), 0 < q < p, i = 0, ±1, ±2, • • • . 

594 



NEW HOMOLOGY THEORY. II 


595 


Here we replace q and i by p — q and i — q respectively, to get 
(1.10) r(H jT_«) = r«7*-*) - r(Bp t ) - r{B\~ p ), 


0 < q < p, i = 0 , ± 1 , ±2, 


From (1.9) and (1.10) we eliminate r(B* P J t ) 


( 1 . 11 ) 


( 1 . 12 ) 


'r(Hi) - r(H £,) = r(C*) - r(C < “») - r(Bj) + r(Br’), 

, 0 < q < p, i = 0, ±1, ±2, • • • . 

For t = np + i, 0^ j < p (1.11) becomes 
r(H n q p+i ) - r{HyJ q i ~ q ) 

= r(C np+i ) - r(C np+J ~ 9 ) - r(B q p+s ) + r(B'"- 1)p+i ). 


For j and q fixed (0 ^ j < p, 0 < q < p) we sum the relations (1.12) for n = 
0 , 1 , 2 , • • • , to obtain the Euler relations 


(1.13) Z [r(//; p+J ) - r(//£ p + y - e )] = £ [r(C“ p+ 0 - r(C , " p+y ~’)]. 


The sums appearing in (1.13) are finite since 2 is finite. (For p = 2 ,7 = g = 1 
the relation (1.13) is the Euler relation for the classical modulo 2 case.) 

If we let 


(1.14) = r(C m ), 

then r m is the number of all simplexes of dimension m , simple or not. If on 
the other hand a m denotes the number of the simple simplexes of dimension 
m then r m is linearly expressible in terms of a 0 , a \, • • • , a m , with integral 
coefficients which do not depend on the underlying simplicial system 2. Con¬ 
sequently the right side of (1.13) is a linear function of a 0 , ai , • • • a N , where 
N denotes the dimension of 2 (i.e. N is the largest dimension of a simple simplex 
of 2), with coefficients independent of 2. 

For a Euclidean polyhedron | 2 |, which is a realization of the simplicial 
system 2, the left side of (1.13) has been shown to be a topological invariant. 
Hence this is true for the right side, and being linear in the a 0 , «i , • • • <x N ; 
this side can differ from the Euler number 

(1.15) ^GC) = <*0 ~ Oil + <*2 — • • • + ( — l^Ofjv 

only by an integral factor (§4). To determine this factor we may choose the 
system 2 in the simplest way, and this we do in taking the system 2 0 composed 
of one zero-dimensional simplex. Here a 0 = 1 and all the other a*, i > 0, 
are zero. Furthermore for 2 0 

(1.16) r 0 = n = • • • = r*_ 2 = 1, r n = 0, • • • n > p - 2, 
since only regular chains are admitted. For 2o the right side of (1.13) becomes 

(1.17) t j - r j- g - r p+i - q 



596 


W. MATER 


these being the only non-vanishing terms. Two cases arise: a) j 96 p — 1, 
and b) / = p — 1. 

Case a, j = 0, 1, • • • , p — 2, splits into subcases ai) j q and a*) j < q: 
ai) ry = 1, ry_, = 1, r P+j - a = 0, 

a*) r y = 1, ry_, = 0, r p+J . a = 1 • • • j < q - 1, 

0 • • • j = q — 1. 

Hence for j ^ p — l the expression (1.17) is zero except for j = q — 1 in which 
case (1.17) = I?(2o) = 1. 

In Case b, ry = 0, ry_ ? = r p _ ? _i = 1, r p+ y _ 4 = r 2p _,_i = 0, since 0 < q < p. 
Hence for j — p — 1 the expression (1.17) is (independent of q) equal to —2J(2o) 

Thus the following formula has been proved 

£ [r(Hy +i ) - r(H n p ^)] = /’ 2 ?( 2 ), 

n—0 

1 . . . I £ 

where/’ = 5f 1 — 5f -1 , with 5^ = 

0 ••• i 9* k. 

Remark: /? = 0 is possible only for j q — 1 and j 9* p — 1 since g - 1 ^ 

V - 1. 

This formula being true for any (finite) 2 suggests first 

(1.19) r(H a ) — 0, r(H1,Zl) = 0, for n ^ — 1 and n — q —1, (mod p), 

and indeed these two relations hold, as will be shown in the next section. It 
suffices to prove the first of them, since n ^ — 1 and n — q ^ —1, (mod p) 
entails n — q ^ —1 and (n — q) — (p — q) = n ^ — 1 (mod p). In §3 we 
shall prove 

(1.20) r(H v t p+t ~ l ) and r{H vp+p ~ l ) 

independent of q, likewise suggested by (1.18). 

In the following the simplicial system is no longer supposed to be finite. 

2. Proof of the rank relations 

(2.1) r(ffj) = 0 for n ^ — 1 andn — q ^ —1, (mod p). 
We first prove 

(2.2) For n fi — 1, (mod p) any (q, n)-cycle is a (p — q — l)-boundary. 

(Of course, for q = p — 1 the above statement is vacuous.) 

To prove (2.2) we introduce p — 1 linear operators D„ , a = 0,1, • • • , p — 2, 
with 




NEW HOMOLOGY THEOHY. II 


597 


»+l 


(2.3) A,(Pi • • • P* fl ) = (— l)“a! Z (P, 

j-i 


P? 


p—a—l 


Pn+l), 


for n-chains, n ^ 0 and with 

(2.3.1) D a K n = 0 for n < 0, (K w a chain of negative dimension). 
For n 0 and a = 0, 1, • • • , p — 3, we derive from (2.3) 



n+1 

P»+.) = (-l)“ +1 (a + 1)! E (Pi ■■■ Pf- a ~ I ••• Pn+0 

1 

n + 1 


+ ( —l) a a! EE(P,- Pf -" -1 • • • Pm-i)* . 

J-I ¥j 

By (2.3), (2.3.1) the last term on the right side of (2.4) equals 

n+1 

( —l)“a! EI(P,- P ?-”- 1 • • • P^,)* = A,F(P, • • • P„ +1 ). 

*-l J** 

and therefore we have for n ^ 0 and a = 0, 1, • • • , p — 3 
(2.5) (FZ)« - DaF)(P 1 • - • Pn+l) = Z>a + 1 (Pi • • • P n+ l), 


or, as an operator relation, 


(2.6) (FZ) a - Z) a F)K" = D a +,K n 

which holds for any n (because of (2.3.1.)) and for a = 0, 1, • • • , p — 3. 
From (2.6) we derive 

(2.7) i>« = Ef-D'^P-'AF, a = 0, 1, • • • , p — 2. 
Since 


(2.8) iWP, • • • P„ + i) = - (p - 2) !(n + l)(P t • • • P n+1 ), 


we get from (2.7) for a = p — 2 
(2.9) -(p - 2)!(n +l)K n = 

But p being prime, 





K\ 


(2.10) (p - 2)! = 1, r = ( — l)"(r + 1), (mod p). 

Hence for any n-dimensional chain K n , n = 0, ±1, ±2, • • • the relation 

(2.11) Z (r + i)F p ” r " 2 DoF r K ri = -(n + 1)K" 


r.O 


holds. For a g-cycle K n we have D 0 F*K n = D 0 F 9+1 K n = • • • =0, since 
then either the F*K n , F fl+1 K n , • • • vanish or they are chains of negative di¬ 
mensions. The relation (2.11) then becomes 



598 


W. MAYER 


(2.12) -(» + l)K n = F p ~ ? “ 1 (r + l)F*' r_, Z>oF K"} 
thus proving (2.2). 

The term corresponding to r = q — 1 inside the bracket in (2.12) is gDoF^K". 
We prove (2.1) by showing that for n — q ^ —1, (mod p) this term is a 
boundary: 

(2.13) DoF^K" = F( ), for n — g ^ — 1, (mod p). 

For g — 1 > n (2.13) is the zero-chain by (2.3.1), and hence a boundary. 
Here n q — 1, thus proving (2.13) for 0 > n — q + 1. The case n — q + 1 = 
0 can be neglected since by assumption n — q ^ —1 (mod p). Thus we may 
assume n — ^4-l>0or 


(2.14) 


n § q. 


Let (Pi • • • P„+i) be a simplex of the g-cycle K n ; then 
(2.15) < 


f D U F'-'iPt ■ ■ ■ P„ +1 ) = (q - 1) ! Do Z (Pi ■■■ P n+l)aj •. *a 9 —i 

(«!" ■ 


n—q+2 * 

= («? - D! Z Z (p h • • • pjt 1 • • • Pfc- i+ .). 

+ 3-1 


the summation running over all the (n — g + 2) faces of (Pi • • • P n +i). Again 
we have to introduce linear chain-operators A P ,p = 1, • • • , p — 2 which are 
defined for chains of dimensions ^ 1 by 

(2.16) A,(Px • • • P,) = Z (P. • • • rr • • • P 1 +1 • • • P,), V ^ 2, 

l3*l 

and which transform into zero chains of lower dimension. In (2.16) j, l run 
over 1, 2, • • • , v, (without repetition, j 1). Obviously 


(2.17) 


A p = A ff for p + <r = p — 1 


so that, of the operators A p only Ai, A 2 , 
n-g + 2^2we have, similarly to (2.15), 


A(p_d/ 2 , are distinct. Since 


(2.18) 


AxF^Pi ••• P^-i) 


- <« - »» £ z (p*, • • • pr • • • ps. • • • Pa.-.-)- 

(0r*-0«-«+*) 


Here, putting n — q + 2 = v, we take the boundary 
fFAiP- , (Pi---Pn + i) 


(2.19) 


= -(«-di z z (Ph---pgr 2 ---pt<”-p».) 

Ul} 


+ 2(r - l)(g - 1) I Z.. Z (Pfc • * • P|r‘ 


w. •• •?,) #-l 


Pfc) 


+ (?-!)! z z z (p ft -ptr' - pi ■■■P'.)>- 

tfi-W [;i| M/.1 



NEW HOMOLOGY THEORY. II 


599 


For v = n — q + 2^3 the last term (2.19) can be written 


(2.20) ?•(? — 1)! Z Z (P 

(Tl-Tr-l) l/«l 


71 


DP-1 


71 




the first sum running over all combinations (71 • • • 7 ,_i) of 1 , 2 , • • • , n + 1 . 
(Such a combination (71 • • • 7 ,- 1 ) is derived from a combination (ft • • • ft) 
in which all of the 71 , ••• 7,-1 occur. There are (n + 1 ) — (v — 1 ) = 
(g + v — 1) — (v — 1) = q such combinations (ft • • • ft).) But (2.20) = 
AjF*(Pi • • • P„+ 1 ). For v — n — q + 2 = 2 the last term (2.19) vanishes and 
so does AiF*(Pi • • • P„+i), hence (2.19) can be written 


( 2 . 21 ) 


(FAx - Ai F) F 4-1 (Pi • • • Pn+i) = 2(v - l)PoF l_1 (Pi • • • P^i) 


-(3-D! Z Z(P* 

lill 


P p—2 

0i 


Pi 


Pf.) 


For a = 2 , 3, • • • (p + l )/2 and chains of dimensions ^ 1, we introduce the 
linear operators 

(2.22) A„(Pi • • • P,) = z (Pi ' • • Pr • • • Pi • • • p.), v ^ 2. 

HU 


(Obviously A ( p_i )/ 2 = A( a+ D/ 2 ). Now ( 2 . 21 ) can be written 
(2.23) (FAj - AxF)F a_ 1 K" = 2{v - l)D 0 F 9 ~ 1 K n - A 2 F a_ 1 K\ 


Since v — 1 = n — g + 1 ^ 0 (mod p) by assumption, (2.13) is true if and 
only if 

(2.24) A 2 F a ^K B = F( ). 

This follows from (2.23), K B being a g-cycle. 

For p = 2, 3, • • • (p — l )/2 we derive equations analogous to (2.19), 

FA P F a_1 (Px ••• P„ +1 ) 

= -p(q - D! z z (/>*••• ptr~ i • • • p?r • • • p<o 

i01'"0,) [3D 

(2-25) ] + (p + 1)(? -i)i Z Z (Pft • • • PJP • • • PS, • • • p».) 

(01---0,) 

+ (3 - i)i z z z (Pfl, • • • pfr • • • ptf 1 • • • 

where the last term on the right is A p F a (Pi • • • P«+i). Hence for the 3-cycle 
K B we have for p = 2, • • • , (p — l)/2. 

(2.26) FA„F a- 1 K B = —pAp + 1 F a_ 1 K B + ( P + l)ApF a_ 1 K". 


But p and p + 1 are ^ 0, (mod p), hence from (2.26) it follows that the two 
chains 


(2.27) ApF a ~ 1 K", Ap+xF^'K", P = 2, 3, • • • , (p — l)/2 

are simultaneously bounding or not bounding. 



600 


W. MAYER 


For p — (p — l)/2 the chains (2.27) coincide and (2.26) becomes 

(2.28) FAcp-d/sF^K" = A ( ^ 1 )/2 F'- 1 K n . 

Hence all the A,F* -1 K" bound, p = (p — l)/2, • • • , 3, 2, and then (2.13) fol¬ 
lows from (2.23), which proves (2.1). 


3. Proof of the rank relations 

(3.1) r(tf £t p_2 ) = r(H' q p+, ~ l ) 

and 


(3.2) rim*’- 1 ) - r(^ p+p ->), q - 1, • • • , p - 1, r - 0, 1, 2, • • • . 

We first define a homomorphism 


(3.3) 


h: -> h: , 


r < 8, 


of the (r, n)-homology group into the (s, n)-homology-group of 2. Let 

(3.4) {Z"U 

denote the homology class of the r-cycle Z n in H? . Then either r > n-or 
F \Z n ) = 0"“ r , hence either s > n or F \Z") = F'“ r F r (Z”) = 0 n ~* and Z n is 
an .s-cycle. The mapping of homology classes 


(3.5) 


{Z n U*->{Z n \ B ». 


then defines the homomorphism (3.3). 

Remark: Since a (p — r)-boundary of dimension n is a (p — s)-boundary of 
this dimension, 5“ C B” , and the mapping (3.5) is independent of the choice 
of Z n in the class left in (3.5). 

Let us consider the nucleus of the homomorphism (3.3) as defined by (3.5): 
The class on the left in (3.5) belongs to the nucleus if and only if Z n C B? , 
i.e. if and only if for some chain K’‘ +p “* 


(3.6) 


pp-»(K n +J>-«) = z * 


For n ^ r from F r (Z n ) = 0" r then 


(3.7) 


___ qi 


-r 


follows. For n < r we have n+p—s<r+p— s and again K n+P ~* is an 
. (r + p — s)-cycle. Thus we have proved that {Z n ) B n r belongs to the nucleus of 
the homomorphism (3.3) if and only if a,(p + r — s)-cycle of dimension (p + n — s) 
exists (i.e. an element of ZJ+£T/) such that (3.6) holds. 

But (3.6) defines a homomorphism 


(3.8) Hit::; - h: . 

Indeed, Z n is an r-cycle if K n+P ~* is a (p + r — s)-cycle. Furthermore, to a 
p — (p + r — s) = (s — r)-boundary K n+P ~*, 



NEW HOMOLOGY THEORY. XI 


601 


(3.9) 




p-r (K n+p-r) 


corresponds in (3.6) 


(3.10) 


= pp-»p»—r^^n+p—r 


= F p - r (K B+, - r ), 


i.e. a (p — r)-boundary. Our above result can be stated as follows: 

{Z n } B ” belongs to the nucleus of the homomorphism (3.5) if and only if it 
belongs to the map-group of the homomorphism (3.8). Or: The nucleus of 
(3.5) is the map of (3.8). 

We now consider the nucleus of the homomorphism (3.8) as defined by (3.6) 
or by 


(3.11) -> {F-(K p+b -)U • 

The class on the left belongs to the nucleus of (3.8) if and only if 

(3.12) p-.( K »+P-.) = p— r ( K » +J >-r) 

for some chain Kl‘ +pr . But then 


(3.13) p-.[ K „ +P -. _ p ._ r(K „ +J) .. r)] _ Q . 

and the (p — s)- cycle K B+P_ * — F' _r (K B+p ~ r )> since p — (p + r — s) — s — r, 
belongs to the class on the left in (3.11). 

Thus the class on the left in (3.11) belongs to the nucleus of the homomorphism 
(3.8) if and only if this class contains a (p — s)-cycle. 

If K B+P_ * is a (p — s)-cycle, then in a way analogous to (3.5) (since r + p — s 
> p - s) 

( 3 . 14 ) {K B+p -*},;±r* - {K B+P ~* 

defines a homomorphism 

(3.15) • H n p tr‘ - 


Our last result can now be stated as follows: 

The class on the left in (3.11) belongs to the nucleus of (3.8) if and only if it 
belongs to the map of (3.15). 

Or: The nucleus of (3.8) is the map of (3.15). 

Thus, in the three homomorphisms 


(3.16) 


Hr -» h: 

- H p P tr~: -*■ h: 

TjP+n—B TjP+n—a 

{tip-, tl p-hr-a 


as defined by (3.5), (3.11) and (3.14) respectively, the nucleus of the first (second) 
is the map of the second (third) homomorphism. 



602 


W. MAYER 


Since p + r — s>p — s, the relations (3.16) may be continued to give the 
chain of homomorphisms: 

H n r -» H n , 

r/n+p-a _. Tjn 

tl p+r—8 tlr 


Tjn + p-8 
tl p—s 

—> 

Tjn+p—a 
tl p+r-a 

n;tr r 


Tjn+p—8 
tl p—a 

jjn+p—r 

—> 

Tjn+p—r 

tl p—T 

jrjn+P 

—► 

jjn+p—r 

ir r +p 

—> 

jjn+p 


in which the nucleus of any homomorphism coincides with the map of the fol¬ 
lowing one. We write for 


(3.18) s = r + 1, n — r = vp — 1, v = 0, 1, 2, • • • , r = 1, • • • , p — 2, 


the first three homomorphisms (3.17) 


(3.19) 


jjpp+r—1 _^ j^rp+r -1 

Tjvp+p-2. rjvp+P-2 

n p—r—l tl P -1 


By our earlier result (2.1) H’ r +T~ l and Hp^lT 2 are zero-groups. (Indeed 
vp + r — 1 — 1 and ^r, (mod p) and vp + p — 2 ^ —1 and ^ p — r — 2, 

(mod p).) Thus the nucleus of the first homomorphism and hence the map of 
the second is the group itself. Furthermore, the map of the third 

homomorphism and hence the nucleus of the second one is the zero-subgroup 
of Hp P Ji P ~ 2 . Therefore the second homomorphism (3.19) is an isomorphism 

(3.20) If p P X P ~ 2 » H ,p+T ~\ 


thus proving (3.1) (first for r = 1, 2, • • • , p — 2. But for r = p — 1 (3.1) 
becomes an identity). 

Now, beginning with the fourth, we write a second triple of homomorphisms 
(3.17) again using (3.18): 


( 3 . 21 ) 


Tjvp+p—l _ TTVp+p—2 

tlp-r —> tl p _ r _i 

H ; P+P “ 1 

^y(lj+l)p+r—l _^ jr^JP+p—1 


Here H V p* r - 1 2 and Hr+i l)p+r 1 are zero-groups, and the same reasoning as used 
before proves the isomorphism 


jjvp+p -1 ^ JJ9P+P —1 


( 3 . 22 ) 



NEW HOMOLOGY THEORY. II 


603 


thus proving (3.2). The ranks (3.1) are in general different from the ranks 
(3.2), as follows from the Euler relations (1.18) since, in general, E( 2 ) ^ 0. 

4. The Euler number 

Theorem (4.1): If a< denotes the number of i-dimensional simple simplexes 
of the finite simplicial system 2 of dimension n then (up to a factor) the Euler 
number 


EC 2 ) = E (-1 You 

t -0 

is the only invariant of 2 with respect to its subdivisions , which is linear in a 0 , 
a i, • • • , a n . 

Remark: In case 2 is a polyhedron, E( 2) (up to a factor) will be the only 
topological invariant linear in a\ , 

Let X A k «k , k — 0 , 1, “ n, be an invariant with respect to subdivisions 
of simplicial systems 2 of dimension n. Then we prove 

(4.2) E A k a k = fE( 2). 

*-0 

To prove (4.2) we can choose any system 2 of dimension n since by assumption 
the Ah do not depend on 2 . We choose for 2 the system composed of the 
w-simplex (Pi • • • P n +i) and its faces denoted by 2 n . Here 



Let 2 n be subdivided into 2„ and let a k n be the number of the fc-simplexes of 
2 l. Then 

(4.4) E A k (a k n - = 0 , 

fc -0 

i.e. the “vector” = a' k n — a k , k = 0,1, • • • n, lies in the hyperplane ]T) A k £ k = 
0. If for suitable choices of such subdivisions we arrive at n such linearly- 
independent vectors v = 1 , 2, • • • , n, then the A k s are determined up to 
a factor / and (4.2) is proved. 

We choose the subdivision 2 n of 2 n yielding ££ ,v v = 1 , • • • n (the v ih vector 
in our construction) by introducing the centroid of a v-face of 2 n . 

Let (Pi • • • P,+i) be that y-face, v = 0 included, S its centroid and a k ' v the 
number of jfc-simplexes of 2 « . A k -simplex of 2 * not containing S is of the type 
(P ai • • • Pa k+1 ) but not of the type (Pi • • • P,+iP/ 3 , • • • P**.,), which (for v 7 * 0 ) 
does not exist in 2 » . 

Hence the number of ft-simplexes of 2 * not containing S is 

/n + /n - A 

\k + 1/ \k-vj' 


(4.5.1) 



604 


W. MAYER 


A fc-simplex of 2* containing S is of the type (<SP„, • • • Pa k ) but not of the type 
(SPi ••• P>+iPgi ••• 

The number of Awdmplexes of 2^ with one vertex S therefore is 


(4.6.2) 

Thus 


(n + 1\ _ ( n - v \ 

\ k ) \k - v - \) • 


< 4 -« - C+0+(“ 1 *) - ft::) - G - 7 -1) 


and 


(4.7) 


f ,n,p n,v n "f l\ (71 v\ / W ^ \ 

J * k )-\k-y)-\h 

V = o, 1 , • • •, n + 1 . 


For ££•' the relations 
(4.8) 


E = o 




hold. This of course is a consequence of the invariance of the Euler number, 
but is readily verified as follows. The left side of (4.8) 

- s <-«*(• 1 *) -+ s <-«*(* -1) - s <-< ^0 

- (i - i) n+1 - (-D n+l + (-i) ntI = o. • 

Furthermore, 

<«) «*=("r)-(*)- 

and 

(««) s; -=(“+■)-(”■")-(“,v.jS: 

From (4.7) for n, v > 0 we get 

t n-i.»-i _ /w\ _ / n — v \ _ (n — v\ 

* k "W + \k-v) 


(4.11) 

or 

(4.12) 


— t n ’* — Al + l\ , Al\ — t n,p ( n \ 

" {t+i U + i/ + W”*** v* + i/’ 



NEW HOMOLOGY THEORY. II 


605 


We are now able to prove the linear independence of the vectors £*•', v = 1, 
•••,»,(& = 0, 1, • • • ,n). From 

(4.13) Za,r = 0 

v-1 

by (4.10) for k = 0 we deduce 

(4.14) £a, = 0, 
and then by (4.12) and (4.9) for k = 1, • • • , n 

(4.15) 23 1' 9 1 = 23 ch'+iZk-i'* = 23 '* == 0. 

V—1 V—0 Vmml 

Thus from the linear independence of the (n — 1 ) vectors v = 1 , • • • , 

n — 1 (fc = 0, • • • , n — 1) (by (4.13), (4.14) and (4.15)) the linear independence 
of the n vectors v = 1 , • • • , n, (k = 0, 1 , • • • , n), follows. But =, 1 , 
ft = 0 , 1 , hence, of the sets of vectors p = 2 , • • • , n, f = 1 , • • • , p, (ft = 0 , 

1 , • • • , p) each one is a linearly independent set, thus proving ( 4 . 1 ). 

Institute for Advanced Study 




Annaub or Mathematics 
Vol. 43, No. 4 t October, 1042 


ITERATION OF ANALYTIC FUNCTIONS 

By Carl Ludwig Siegel 

(Received April 23, 1942) 

Let 

(1) /(*) = !>*** 

k~l 

be a power series without constant term and denote by R > 0 its radius of con¬ 
vergence. The fixed point z = 0 of the mapping z —> f(z) is called stable, if 
there exist two positive finite numbers r 0 ^ R and r ^ R, such that for all points 
z of the circle | z \ < r 0 the set of image points zi = f(z), z n+ 1 = f(z n ) (n = 1, 
2, • • • ) lies in the circle | z | < r. 

It is easy to prove the stability in the case | «i | < 1, for then a positive number 
To < R exists, such that the inequality ] f(z ) | ^ | z | holds for \ z \ < r 0 , and 
r = r 0 has the required property. Henceforth, the inequality | ai | ^ 1 is 
assumed. 

If z = 0 is stable, then the images z n (n = 1,2, • * •) of the points z of the circle 
| 2 | < r 0 under the mapping z —* f(z) and its iterations cover a domain D which 
is connected and contains the point z = 0. For all z in D, the image point 
f(z ) again lies in D. Let 

(2) z = <p(£) = f Ckt* 

k-2 

be the power series mapping a certain circle | f | < p of the i* plane conformally 
onto the universal covering surface of D. Then the formula 

<p( f) = 2 -»/(*) = zi = <p(S i) 

defines a function fi = g(£) which is regular in the circle | f I < P and satisfies 
there the inequality | g(£) | < p; moreover g(0) = O and g f ( 0) = 1. It follows 
from Schwarz’s lemma that | | = 1 and fi = aif. Consequently, the func¬ 

tional equation of Schroder 

(3) ^(axf) = /(*(f)) 

has a convergent solution <p(£) = f + • • • . 

On the other hand, it is obvious that z = 0 is stable, if | ai | = 1 and the 
functional equation (3) has a convergent solution. 

If ai is an n th root of unity, then z = 0 is stable, if and only if the (n — l) th 
iteration of the mapping z —»/(z) is the identity. This is also easily proved by 
direct calculation. We assume now that | ai | = 1 and a* j* 1 for n = 1,2, • • • . 
By (1), (2) and (3), 

(4) c*(ai - ai)f* = L (f + ; 

607 

Linlithgow 1 i**' 

imperial Agficultut ui Kt&tufch Instituted 



608 


CARL LUDWIG SIEGEL 


'henpe c* (k — 2,3, • • •) is a polynomial in c*, • • • , c*_i whose coefficients depend 
upon oi, • • • , at, and there exists exactly one formal (convergent or divergent) 
solution v>(f) = £ + • • • of (3). The first example of a convergent series /(a) = 
oiz + • • • with divergent Schroder series <p(f) has been given by Pfeiffer. 1 Later 
Cremer 2 has constructed such examples for arbitrary Oi satisfying the condition 

liminf |o? - 1 | 1/n = 0 . 

n —>oo 

These a x are very closely approximated by certain roots of unity, and their 
linear Lebesgue measure on the unit circle | a x | = 1 is 0. 

Until now, however, it was not known if there exists a number a x of absolute 
value 1 , such that every convergent power series f(z) = a x z + • • • has a con¬ 
vergent Schroder series. Julia 8 tried to prove the erroneous hypothesis that the 
Schroder series is always divergent, if f(z) — a x z is a rational function and not 
identically 0. We shall demonstrate the following 
Theorem: Let 

(5) log | a? - 1 | = 0(log n) (n -> co); 

then the Schroder series is convergent. 

Write ai = e 2riu> ; then the condition (5) may be expressed in the form 

w-> \n . 

n 

for arbitrary integers m and n, n 1, where X and p denote positive numbers 
depending only upon w. It is easily seen that (5) holds for all points of the 
unit circle | a x | = 1 with the exception of a set of measure 0 . 

Lemma 1: Let x p (p = 1, • • * , r) and y q (q = 1, • • • , s) be positive integers , 
r ^ 0 , 8 ^ 2 , r + $ = t> 

£, x p +Y,y a = k, 2 Vi > ^ 5 (« “ 1. • • •»•); 

p-1 4-1 4-1 & & 

then 

(6) IT fl y\ ^ k 3 8 l ~‘. 

p-1 Q-l 

Proof: Denote the left-hand side of (6) by L and consider first the case 
k < 2t — 2. Then 

(7) ' kT'L £ AT* > (2 1 - 2)"’. 


1 G. A. Pfeiffer, On the conformal mapping of curvilinear angles . The functional equa¬ 
tion <p[f(z)] — a \*>(*), Trans. Amer. Math. Soc. 18, pp. 185-198 (1917). 

*H. Cremer, Vher die Hdufigkeit der Nichtzentren , Math. Ann. 115, pp. 573-580 (1938). 
*G. Julia, Sur quelques problhnes relatife d Viteration dee fractions rationnelles , C. R. 
Acad. Sci. Paris 168, pp. 147-149 (1919). 



ITERATION OF ANALYTIC FUNCTIONS 


609 


Assume now k 2 s 2t — 2 and let 



Then 


r + 2 y<i = v- 

<7—1 


t£g+l£g + l + r£ii<^k, "22 x p = k — i) + r, 

P- 1 


whence 


rr ^ 7, , , t't _ j>J - < + i, if v =S g - i + t 

11 x p 2: k ij + 1, 11 y q ^ j 

r ~ 1 e “ 1 [(v ~ 9 ~ t + 2 )g, if v ^ g — 1 + t. 

In the interval g + l^y^g — 1 + t f 
(k - v + 1 )(>J - t + l ) 2 S min {( k - g)(g - t + 2 ) 2 , (k - g - t + 2 )jf 2 }; 
in the interval g— 1 + t^rj^k, 

(k - V + l)(r? - <7 - * + 2)V ^ (fc - g - * + 2)g 2 ; 
in the interval 0 ^ 

(k - g)(g - £) 2 - (k - g - £)g 2 = {(k - g)£ - (2k - 3 g)g)£ ^ g(2g - k)S g 0 ; 
consequently 

L^(k- g)(g - t + 2) 2 

(8) k~*L £ ( g ~fc +2 J ^ ~ 2)” 2 § (2< - 2)" 3 . 

Now 

< - 1 g 2‘- 2 (t = 2, 3, • • •), 

and the lemma follows from (7) and ( 8 ). 

We use the abbreviation 


* «n = | of - 1 | 1 (n = 1, 2, • • •)• 

On account of (5), the inequalities 

«» < (271)” (ra = 1 , 2 , • • •) 

are fulfilled for a certain constant positive value v. We define 
Nx = 2 2,+1 , N 2 = 8'Ni = 2 *' +1 . 

Lemma 2: Let m t (l = 0, • • • , r) be integral, r ^ 0 and nto > mi > • • • > m, 
> 0; then 



( 9 ) 



610 


CARL LUDWIG SIEGEL 


Proof: The assertion is true in the case r — 0 ; assume r > 0 and apply 
induction. 

We have the identity 

ol(an - 1) = (of - 1) - (o? - 1) (0 < q < p), 

whence 

-i < -i i -i 

min (e„, e„) ^ 2ep_„ < 2’ +l (p — g)\ 

This simple remark is the main argument of the whole proof. 

Let « m , (l = 0, • • • , r) have its minimum value for l = h. Then 

(10) tm h < 2' +1 min {(»ii_i — m h y, (m h — 

if we define moreover m_i = » and m T+ 1 = — °o. On the other hand, the 
lemma being true for r — 1 instead of r, we have 


(ID 

Since 


*>»» n *»»! 
1-0 


Nl 


m a (m h -1 — m h+ i) 


(wia-i — m h ){m h — wu +1 ) i_i 


II (mi-1 - m t ) 


nth-1 — vih+i _ 1 1 ^_2 _ 

(m/k-i — m h )(m h — m A+ i) m k _i — m A m* — m h+ i ~ min (m A _x — m h , m h — w lA+1 ; 

the inequality ( 9 ) follows from ( 10 ) and (11). 

Consider now the sequence of positive numbers Si = 1 , S 2 , S 3 , • • • recurrently 
defined in the following way: For every k > 1 ,-let pt denote the maximum of all 
products Sj, Si, • • • 8 i r with li -f- h + • • • + lr — k > h ^ h ^ ^ L = 1 , 

2 ^ r ^ fc, and put 

(12) 5 * = «t_ijui. 

Lemma 3 : 

( 13 ) St £ k^'Nt 1 (k = 1 , 2 , • • •)• 

Proof: The assertion is true in the case k = 1 ; assume k > 1 and* apply 
induction. 

The numbers at = Af 2, 2 V 2 _1 satisfy the inequalities 


— = (flf 1 + rTNT 1 £ 2 2 ’NT l <1 (k £ 1, l £ 1), 

<**+! 

,and consequently 

(14) s^s* • • • ij t ^ r s 'ivr l (1 ^ ii + • • • + i/ = j < k;f £ 1). 
By (12), there exists a decomposition 

S* = «t-iS»,5 ff| ’ $ ta (gi + • • • + ffa = k > ^ ^ g„ ^ l). 



ITERATION OF ANALYTIC FUNCTIONS 


611 


In the case gi > k/2, we use this formula with g x instead of k and find a de¬ 
composition 

= • fa/, (hi + • • • + hf — gi > hi ^ • • • i> hp ^ 1); 

if also hi > k/2, we decompose again 

= th l -iSi l Si i • • • S iy (t'i + • • • + i y = hi > ii ^ ^ i y ^ l) 

and so on. Writing k 0 = k, fa = gi, fa = hi, • • • , we obtain in this manner 
the formula 

r 

= II (e* iA p ) 

p-o 

with k = fc 0 > h > • • • > k r > k/2 , where A p denotes for p = 0, • • • , r a 
certain product 6 ;i • • • 5/, and 

/b p k p +1 (p = 0, • • •, r — 1) 

k r (p = r), 

all subscripts ji, • • • , j/ being g k/2. The number / depends upon p; let 
/ = s for p = r. 

Using (13) for the s single factors of A r and applying (14) for the estimation 
of Ap (p = 0, • • • , r — 1), we find the inequality 

n a p ^ M- r - jn i, n (*p_i - k) 

p-0 (,8-1 P-1 

where 1 g j q g fc/2 (g = 1, • • • , «) and ji + • • • + j. = k r . By Lemma 2, 

II < 7VI +1 \k n (fcp-1 - fcp)V, 

p-0 ( p-1 J 

and consequently 

5* < N[~ l Nl~‘ (V 1 U x P tl yl) ’ 

\ p-l 8-1 / 

with t = r + s, x p = kp-i — k p , y t — j q . By Lemma 1, 

N\- k k 2 ’bk < N r i +1 N\- , S ,U ~ 1) g ( 8 ^)‘ 1 = 1, 
and (13) is proved. 

Proof of the Theorem: Since the power series (1) has a positive radius of 
convergence, there exists a positive number a, such that | a n | g a"” 1 (n = 2, 
3, • • •)• The functional equation (3) remains true under the transformation 
f(z) af(z/a) } (p{{) -> cup^/a ); hence we may assume | a„ | ^ 1 (n = 2, 3, • • •)• 
Instead of (4), we consider the functional equation 

Zv^-tfr+Zrr?')', 

k-2 1—2 \ r—i / 




( 15 ) 



612 


CABL LUDWIG SIEGEL 


where ij* , ij* , • • • are positive parameters. Then the coefficients 71 a l,y», 
7 * , • • • are uniquely determined by the formula 

(16) 7* = Vk 1 ]0 7i,yi, • • • yir (k a 2, 3, • • •), 

where h , - • • , l r run over all positive integral solutions of li + • • • + l r => k 
(r = 2, • • • , fe). Write 7 * = in the case n* — «*ii (k = 2,3, • • •), and 7 * a r* 
in the case ij* = 1 . 

The inequality 

(17) < 7 * g Sal-* 

is true for A: = 1. Applying induction, we infer from (12) and (16) that 
ff* S tk—lWc ^ • t j, = i 

hence (17) holds for all values of k. 

On the other hand, the power series 

*-l 

satisfies the equation 

* - r = a - trV, 

whence 

# - 1 + r - a - 6f + rV; 

consequently ^ converges in the circle | f | < 3 — 2\/2. 

By (4), (15) and (17), 

| cjt | ^ 5fcTjfe (i = 2,3, • • •)* 

It follows now from Lemma 3, that the Schroder series <p(f) converges in the 
circle | f 1 < (3 — 2 V / 2)2“ 6 ^ 1 . 


Institute fob Advanced Study 



Annals or Mathematics 
Vol. 43, No. 4, October, 1942 


NOTE ON AUTOMORPHIC FUNCTIONS OF SEVERAL VARIABLES 

By Cabl Ludwig Siegel 
(Received April 28, 1942) 


1 

Some years ago I found a method 1 of estimating the number of linearly inde¬ 
pendent modular forms of degree n and of weight g , which has been useful for 
the demonstration 2 of certain identities in the analytical theory of quadratic 
forms. The object of this note is to prove an analogous estimate concerning 
automorphic functions. 

Let 3 = ( Zki) be a complex symmetric^ matrix with n rows, and consider the 
space E defined by the condition @ — 33 > 0, with the line element 

d 8 = «*{48(e - ssr'dm - anr 1 }, 

the symbol a denoting the trace. If 91 and $ are n-rowed complex square 
matrices satisfying 2158' = 5821' and 21 S' — 53$' = 6, then the linear trans¬ 
formation 


( 1 ) 


3* = (913 + 93) C$3 + tr 1 


defines an isometric mapping of E onto itself. Those transformations con¬ 
stitute a group ft. 

Denoting by p(3i, So) the distance of two arbitrary points 3i and 3o of E, 
we have* 

p(3i, 0) = (±ui)\ 

where 

Uk = log | (k = 1, • • • , n) 

and Xi, • • • , X B are the characteristic roots of the hermitian matrix 3 i5i • 
Since 


4X* 

1-A* 


: + e~“* - 2 = 2l 


uV 


t=i (2i)\ 


1 C. L. Siegel, EinfUhrung in die Theorie der Modulfunktionen n-ten Grades , Math. Ann. 

116, pp. 617-667 (1939). 

* H. Maass, Zur Theorie der automorphen Funktionen von n Verdnderlichen, Math. Ann. 

117, pp. 638-578 (1940). 

E. Witt, Eine Identitdt zwischen Modulformen zweiten Grades , Abh. Math. Sem. Han- 
sischen Univ. 14, pp. 323-337 (1941). 

H. Maass, Modulformen und quadratische Formen iiber dem quadratischen Zahlkdrper 
R(,VSi>, Math. Ann. 118, pp. 66-84 (1942). 

• C. L. Siegel, Symplectic geometry , submitted for publication in the Amer. J. Math. 

613 



614 


CARL LUDWIG SIEGEL 


and 

]C U V ^ u l) = p 2l (&u 0), 

\fc-i / 

we obtain the inequality 

( 2 ) r~~r~ ^ sinh 2 £p, p = p( 3 i, 0 ). 

*-i 1 X& 

Let A be a subgroup of discontinuous in E , and assume that all frontier 
points of a fundamental domain F of A belong to E; i.e. E is compact relative 
to A. The least upper bound of the distance p(£i , ,3o) for two variable points 
Si and 3o of F is a finite positive number 5, the diameter of F. We use the* 
abbreviations 

(3) v = , b = sinh 2 * 8 , c - (v + 1 )b’. 

A 


2 

An analytic function f(£) of the v independent variables z k i (1 ^ k ^ l ^ n) 
is called an automorphic form with the group A, if it is regular in E and satisfies 
there the equations 

(4) • ./G3*> -*(»,») I »3 + ST7C3) 

for all transformations (1) in the group A, where g is a constant and the numbers 
v = 33) depend only upon 21 and 23. Let L = L(A, g, v) denote the set of 

all such functions f(£), the weight g and the multiplier system v being given. 
If fi and / 2 belong to this set, then so does X/i + p/ 2 , for arbitrary complex con¬ 
stants X and p; hence L is a vector space with a certain (finite or infinite) di¬ 
mension d. 

For automorphic forms of a single variable, i.e. in the case n = 1, the number 
d is given by the generalized Riemann-Roch theorem . 4 It is not known in 
which way this theorem might be extended to automorphic forms of several 
variables. We now assume that the weight g is real and that all multipliers 
r(2l, 23) have absolute value 1 . We shall derive a finite upper bound of d 
depending only upon n, g and 6. 

Consider first the case g = 0. Then, by (4) the absolute value abs f(£) 
is invariant under A; consequently it attains in E a maximum at an inner 
point. This proves f(£) is a constant, whence d = 1 , if 23) = 1 , and d = 0 
otherwise. In the remainder of the paper, we suppose g ^ 0. 

Lemma: Let f(£) be a function of the set L( A, g, v), not identically 0. If all its 
partial derivatives of the orders 0, 1, • • • , h — 1 (h 0 ) vanish at a point So of E, 
then h ^ bg. 

4 E. Ritter, Die multiplicativen Formen auf algebraischem Oebilde beliebigen Geschlechtes 
mit Anwendung auf die Theorie der automorphen Formen , Math. Ann. 44, pp. 261-374 (1894). 

H. Petersson, Zur analytischen Theorie der Gremkreisgruppen , Teil //, Math. Ann. 115, 
pp. 175-204 (1938). 



AUTOMORPHIC FUNCTIONS OF SEVERAL VARIABLES 


615 


Proof: The continuous function 

*08) = I ® ~ SS I 1 ' abs/(3) 

is invariant under A; consequently it has in E a maximum p > 0, which is 
attained at a point °f On account of (4), we may assume that 3o also 
lies in F. In case h > 0 , the function /(5J) vanishes at 3 = So , whence Si ^ So • 
In case h = 0 , the assumption of the lemma holds for every point So of E , and 
we may suppose Si ** So . 

If the transformation ( 1 ) is any given element M of the group ft, then the 
function | ®S + 8 I belongs to L(M~ 1 AM, g , v). Since 12 is transitive 

in E and the diameter 8 is invariant under 12 , we may assume for the proof of 
the lemma that So = 0 and p(Si, 0 ) ^ 8 . Let Xi, • • • , X n be the charac¬ 
teristic roots of SiSi, 0 ^ Xi ^ ^ X n ; then 0 < X n < 1 and, by ( 2 ) and ( 3 ), 

(5) o < i: ^ b. 

k-1 1 — A* 

We introduce a single complex variable z and choose in particular S = zSi • 
For all points z of the circle zz < X“\ the matrix S lies in E ; hence there /(S) is 
a regular analytic function \p(z) which vanishes at the point z = 0 at least of the 
order h and satisfies the relationship 

abs iA(z) = I <5 - zzSiSi r i V(z,3i) ^ | G — zzSiSi | -> V, 
where the equality holds for z = 1 . 

Let 1 < t < X^ 1 . On the circle zz ^ t , the analytic function z~ h yj/(z) attains 
the maximum of its absolute value at a point of the boundary, whence 

abs ^( 1 ) ^ t~* h max abs \p(z) 

zz— t 

I ffi - Si Si |- J V s | e - tSiSi i~v 

But |-(S — I = II"-i (1 — ^k) and therefore 

h ^ fflogn^T^/logi (1 < < < X” 1 ). 

k -1 1 ~ cXjfc/ 

Performing the passage to the limit t —► 1, we obtain the inequality 




The assertion of the lemma follows from (5) and (6). 

Theorem: The dimension d of L( A, g, v) is 0 for g < ff 1 and g cg p for g > 0. 

Proof: Assume d > 0 and choose in L( A, g y v) a function /(«3), which does 
not vanish identically. Applying the lemma with A = 0, we infer 0 ^ f>p. This 
proves the theorem in the case g < 0. 

Now consider the case g > 0. If f(&) s* 0 everywhere in E, then f~ l (£) is 
a non-vanishing function of the set L(A, —g, v *) and —g < 0, which is im¬ 
possible. Consequently we may apply the lemma with h = 1 and obtain the 



616 


CARL LUDWIG SIEGEL 


inequality 1 £ bg, whence 1 < (k + l)(bg) r = eg’. This proves the theorem 
in the case g > 0 and d = 0 or 1. 

In the remaining case g > 0, d ^ 2, let/i, • • • , f m be a finite number of linearly 
independent functions in L(A, g, v ) and m ^ 2. We determine the positive 
integer h by the condition 

m + 

and choose m constants ai , • • • , a m , not all 0, such that all partial derivatives 
of the orders 0,1, • • • , h — 1 vanish for the function 


/C3) = a Jl + • • • + dmfm 

at the point 3 = 0; this is possible, by (7), since we have to satisfy c + ro 

homogeneous linear equations with the m unknown quantities ai, • • • , a m . 
By (7) and the lemma, 

l)fc' (? + 1 )(bgY = eg’. 

This proves the remaining part of the theorem. 


3 

A function xC3) is called an automorphic function with the group A, if xC 8 ) = 
fi/fo , fo not identically 0 , where fi and f 0 are automorphic forms in the same set 
L(A, g , v ). For a sufficiently large value G > 0 , certain functions in the set 
L( A, (7, 1) can be expressed as Poincarg series , 6 and it may be proved by known 
methods that there exist v + 1 of those functions, say F 0 , • • • , F v , which are 
algebraically independent. Then the v quotients x* = F*/F 0 (k = 1 , • • • , v) 
are algebraically independent automorphic functions with the group A. 

Define q = [cv\G p ] and choose a positive integer Q satisfying the condition 
q + 1 > cv\(G + gqQ^Y- The number of power products 

p = x r f[xt* 

*-l 

with 0 ^ r ^ q, 0 ^ 8* (k — 1, • • • , v), Si + • • • + s, g Q is 

(8) A = (q + 1)( Q + > «-±i Q’ > c(gq + GQY; 

we denote them by Pi, • • • , P A . Then, the A functions/oF? Pi (J = 1, • • • , A) 
are automorphic forms of the set L(A, gq + GQ, v 9 )] by ( 8 ) and the theorem, 
they are linearly dependent. Consequently, the automorphic function x 
satisfies an algebraic equation of degree q whose coefficients are polynomials in 
Xi, • • • , Xr and not all identically 0. Since q is fixed, the automorphic func¬ 
tions with the group A form an algebraic field with exactly v independent element?. 

Institute fob Advanced Study 

1 M. Sugawara, Vber eine allgemeine Theorie der Fuchaschen Chruppen und Theta-Reihen, 
Ann. of Math. (2) 41, pp. 488-494 (1940). 



Aim aim of Mathematics 
Vol. 48, No. 4, October, 1943 


ON THE DERIVATIVES OF THE SECTIONS OF BOUNDED POWER 

SERIES 1 

By Rhoda Manning 
(Received January 8, 1942) 

1. Introduction 

Let f(z) represent a power series convergent in the open unit circle | z | < 1 
and satisfying the condition | f(z) | ^ 1 in | z | < 1. It is well known that the 
sections s«(z) of }(z) are not in general bounded in the open unit circle \z \ < l . 2 * 
In 1925 L. Fej 6 r 8 proved that the sections s n (z ), for all such functions /(z), 
satisfy the condition | s n (z) | ^ 1 in the circle | z | ^ £ for all n, and that this 
number £ cannot in general be replaced by a larger number. 

Let r n denote the radius of the largest circle \z \ g r n in which the sections 
s n (z), for all functions f(z) of the above type, satisfy the condition | s n (z) | ^ 1 . 
I. Schur and G. Szego , 4 * extending Fej^r’s result, proved that the radii r n con¬ 
stitute a monotone increasing sequence of algebraic numbers having the limit 
unity. They also studied the subclass of all functions f(z) satisfying the 
additional condition /( 0 ) = 0 , and showed that, for all such functions /(z), 
the radius R n of the largest circle \z \ ^ R n in which the condition | sl+i(z) | ^ 1 
holds, for odd n, n ^ 1 , satisfies the algebraic equation 

1 - 2r - r 2 - (2n + 4)r n+1 - (2 n + 2)r n+2 = 0 . 6 * 

Hence the sequence {/?*}, n odd, is ever increasing. The object of this note 
is to discuss the determination of the radii R n in the case when n is even. The 
author has found in this case that the R n , provided n ^ 12, satisfy the similar 
equation 

1 - 2r - r 2 + (2n + 4)r n+1 + (2 n + 2)r n+2 = 0. 

Hence for even n, n ^ 12, the sequence {/Z n } is ever decreasing. Both se¬ 
quences have the common limit p = 2 * — 1 , the only positive root of the equa¬ 
tion 1 — 2 r — r 2 = 0 . 


1 Presented to the Society, December 2, 1939. 

* L. Fej6r, Vber gewisse Potenzreihen an der Konvergenzgrenze, Sitzungsber. der math.- 
physik. Klasse der Bayer. Akad. der Wiss., 1910, Nr. 3. 

• L. Fej6r, Vber die Poaitivitdt von Summen, die nach trigonometrischen oder Legendreschen 
Funktionen Jortechreiten (Erste Mitteilung), Acta litt. ac sci. regiae univ. hung. Francisco- 
Josephinae, sectio sci. math., vol. 2, 1925, pp. 75-86. 

4 1. Schur and G. Szego, Vber die Abschnitte einer im Einheitikreise beschrdnkten Potenz- 

reihe, Sitzungsber. der Preuss. Akad. der Wiss., physik.-math. Klasse, 1925, in particu¬ 

lar pp. 545-555. 

1 Loo. cit. (4), p. 560. 


617 



618 


RHODA MANNING 


It follows that the sections of /'(«), for even n, n ^ 12, in general remain 
bounded by unity “longer” than the function f'(z) itself, an unusual occurrence 
in this type of problem. As another immediate consequence of the theorem, 
we regain the well known fact that the derivative fkft) cannot exceed unity in 
absolute value in the circle | z | ;£ 2 * — 1 , and that the bound 2 * — 1 cannot in 
general be replaced by a larger one.® 

In a thesis submitted to Stanford University 7 the author has shown, by 
treating each case separately, that the numbers R n , for the values of n ex¬ 
cluded by the theorem, are also algebraic, and that they satisfy the following 
order relations: 

R\ < Rg < R3 < Ri < i?* < Rt < Ri < Rg < Rs < Rn < Ru < • • • 

... < p = 2 * — 1 < • • • < Ru < Rn < .Rio • 

To facilitate computation of the radii R„ , n S 12, an asymptotic expression 
for R n is given, of the form 

Rn = p + (— l) n a»p n+1 + b n p in+1 + (-l) B cV n+1 , 

where 

a n = (n+l + ^y b n = (n + l)(n + 2 )o n - i (2 - 2 *)oi , 
and 0 < c'„ < 2 a„(n + l) 2 (n + 2 ) 2 . 

Finally, a closely related theorem, stated by I. Schur and G. Szego for odd 
values of n, is generalized to include large even values of n . 8 

2. Main theorem 

Letf(z) represent a power series convergent in the open unit circle \ z | < 1 and 
satisfying the conditions | /(z) | g 1 in \ z | < 1 and /(0) = 0. Let R n denote the 
radius of the largest circle | z | g R n in which the section s'„+i{z),for all power series 
f(z), satisfies the condition | s'n+i(z) | ^ 1 . If n ^ 1 , n ?£ 2, 4, 6, 8, 10, n an 
integer, then the radius R„ is the smallest positive root of the algebraic equation 

G n {r) = 1 - 2 r - r 2 + (-l)"[( 2 n + 4)r n+1 + (2 n + 2)r n+2 ] = 0. 

Proof. It has been shown 9 that the radius of the largest circle \z \ g R n 
in which the condition | s^+iCz) | ^ 1 holds, is the maximum value of r for which 
the harmonic polynomial 

Tn(r, </>) = $ + 2 r cos <t> + 3r 2 cos 2<j> + • ■ • + (» + l)r" cos mf> 


• J. Dieudonn6, Polynomes et fontions bomtee d’une variable complex, ficole Normale 
Sup4rieure, Annales Scientifiques, vol. 48, 1930-31, p. 352. 

7 Dissertation, Stanford University, June 1941. 

• Loc. cit. (4), pp. 558-559. 

' Loc. cit. (5). 



DERIVATIVES Or SECTIONS OF BOUNDED POWER SERIES 


619 


remains non-negative, for all real values of 0 . Let us denote the product 
2(1 — 2r cos 0 + r 2 ) 2 * T„(r, 0 ) = 1 — 4r 2 + 4r* cos 0 — r 4 

- (2 n + 4)r B+1 cos (n + 1)0 + 2r" + 2 [(2n + 4) cos n 0 + (» + 1) cos (» + 2) 0 ] 

— 2 r B+, [(» -f 2 ) cos (n — 1)0 + ( 2 n + 2 ) cos (n + 1 ) 0 ] + ( 2 n + 2 )r B+4 

• cos n 0 

by F„(r, 0 ). Since the cosine is an even function, we need only consider values 
of 0 satisfying 0 ^ 0 ^ x. 

If n is odd, then F n (r, x) is an obvious lower boundary for F„(r, 0 ). Since 
further F„(r, x) = (1 + r) 2 -G„(r), R„ is the only positive root of the equation 
G n (r) = 0. 

Now let n be even. If r — 0.42 and n ^ 8 , then 

F n (r, x) = (1 + r) 2 (l - 2r - r 2 + (2 n + 4)r " +1 + (2 n + 2 )r B+2 ) 

< (1 + r) 2 (— 0.0164 + 0.0113) < 0, 

whence R„ < 0.42, n ^ 8 . Hence we may restrict our proof of the inequality 

F n (r, 0 ) ^ F n (r, x), n ^ 12, to values of r contained in the interval 0 < r < 0.42. 

Now 

Fn(r,<f>) — F n (r, x) = 4r s (l + cos 0 ) — 2 r B+1 [(n + 2)(1 + cos (n + 1 ) 0 ) 

+ {(2 n + 4)(1 — cos n 0 ) + (n + 1)(1 — cos (n + 2)0) Jr 
-f- {(n + 2)(1 -f cos (n — 1 ) 0 ) ( 2 ?t 2)(1 -f- cos in -j- 1 ) 0 ) Jr 2 

+ (n + 1)(1 — cos n 0 )r 3 ]. 

On dividing this difference by the positive quantity 2 r 3 (l + cos 0 ), we notice 
that all the terms except the first in the resulting expression are of the form 

_ 1 + (— l )* -1 cos fc 0 _ _ 1 — cos &(x — 0 ) 

1 + cos 0 1 — cos (x — 0 ) ’ 


c > 0, k = 1, 2, 3, • • • . It is easily verified that this expression is never less 
than' — cfc 2 , and that it attains this value for 0 = x. Hence the inequality to 
be proved is equivalent to 


lim ^ B ^*’ ^ F n (r, x) 
2 r*(l + cos 0 ) 


2 -r B - s [(n + 2 )(n+l ) 2 


+ {(2n + 4 )» 2 + (n + l)(n + 2) 2 } r + {(n + 2)(n — l ) 2 
+ ( 2 n + 2 )(n + l) 2 }r 2 + (n + l)n 2 r 3 ] g 0 , 


which holds since we can satisfy simultaneously 

r B_2 (n + 2) 3 (1 + r ) 3 < 2, 0 < r < 0.42 and » £ 12. 



620 


EHODA MANNING 


Hencp for even n,» 12, and 0 < r < 0.42, 

F.(r,4>) Z F.(r,v) - (1 + r) 2 -G.(r), 

whence R n is the smallest positive root of the algebraic equation (?»(r) = 0 . 

3. Asymptotic Inequalities 

It will be assumed in the two cases which follow that n *£ 12. 

Case 1 . n odd. The following derivation of an upper bound for R n depends 
on the remark that R n is the only positive root of the equation (?«(r) = 0 . Since 
<? B ( 0 ) = 1 > 0 and (?„(r) is ever decreasing for positive values of r, we conclude 
that if G n {r) < 0 for r = r 0 , say, then R n < r Q . 

We shall suppose that r = p(l — x), where x = a n p n — b n p in , (n + 1 + §2*) = 
o„, b n — (n + l)(n + 2 )a n — J (2 — 2 *)a 4 , and show that with this choice 
of r, G„(r) < 0. We note that 0 < x < a n p n < 0.0005, since p 14 = 0.0000255 
• • • and n 3 p B is a decreasing function of n. Therefore r > 0 and for k = 1, 2, 
r B+ * > p B+ *[l - (» + k)x}. 10 Hence 

<?„(r) < 1 - 2r - r 2 - (2n + 4)p B+1 [l - (n + l)z] - (2n + 2)p B+4 [l -(n + 2)x] 

= 2(2*)px - P V - 2(2 } )p n+ 1 [a n - (» + l)(n + 2)x] 

= 2(2 i )p(o„p B - &„p 2B ) - p 2 (o 4 p 4B - 2a„b„p tn + blp tn ) 

— 2 ( 2 i )p n+ 1 [a„ — (n + l)(n + 2 )a n p B + (» + 1 )(« + 2 ) 6 „p 2n ] 

= - 6 n p 3B+1 [2(2‘)(n + 1)(» + 2) - 2a n p + 6 n p B+1 ] < 0. 

To derive a lower bound for R n , we notice that if G„(r) > 0 , then r < R n . 
Set r = p(l — x), where x — a n p n — b n p n + c„p 3B , with a n , b n as before, and 
c« = 2a»(« + l) 2 (n + 2) 4 . On expanding (1 — x) n+k as an exponential series 
with (n + k) log (1 — x) as argument, we find r n+k < p n+k [ 1 — (» + k)x + n 2 x 2 ], 
k = 1 , 2. Hence 

Gn(r) > 2 ( 2 *)p® - P V - 2 ( 2 *)p B+ 1 [a n - (n + l)(n + 2)x + a„nV] 

> 2 ( 2 } )p n+ 1 (a„ - Kp n + cj n ) - o 2 p 2B+4 - 2 ( 2 ‘)p B+ 1 [a„ 

— (« + l)(w + 2 )p B (a„ — f> n p" + c„p 2n ) + n 2 o 3 p 2B ] 

- 2 ( 2 ‘)p 3 n+I [c„ - (» + l)(n + 2 ) 6 n + (» + 1 )(» + 2 )c„p" - »V.J > 0 , 

since x < a„p B and c n > 2 (n + l)(n + 2 ) 6 » > 2 n 2 a 3 . 

Case 2. n et>en. To derive a lower-bound for R„ , we notice that (?»(r) = 0 
has two positive roots, the smaller of which is R n , and that (?»(0) = 1 > 0. 
Hence if G n (r) > 0 and G' n (r ) < 0 simultaneously, then r < R n . Let r = 
p(l + x), where x = o B p n + b H p in . A repetition of the argument in the first 
part of Case 1 gives r n+k > p n+i [l + (n + A:)x], k = 1, 2, and that <7„(r) > 0. 
That (?l(r) < 0, for 0 < r < J, is trivial. 


M Hardy, Little wood and Polya, Inequalities, p. 40. 



DERIVATIVES OF SECTIONS OF BOUNDED POWER SERIES 


621 


To find an upper bound for R n , we note that if <7„(r) < 0, then R n < r. 
Set r = p(l + x), where x = a n p n + 6„p 2n + c„p 8n . Then x < (n + 2)p“, and 
with a little attention it can be seen that r B+ * = exp[(n + k ) log p(l x)] < 
p B+ ‘[l + (n + k)x + (n + 1)V], k = 1,2. Hence 

0»(r) < — 2(2*)px - P V + 2(2 i )p n+1 [a„ + (n + 1)(» + 2)x + a„(n + 1)V] 

< - 2(2*)p B+1 (a n + b n p n + cj n ) - a 2 p 2 " +2 - 2 a n bj n+2 

+ 2(2 i )p" +1 [a» + (n + l)(n + 2)p n (a„ + 6„p n + c»p 2 ") + a„(n + 1)* 

(n + 2)V"] 

= - (2 } )p $B+I [c„ - 2 b n (n + 1 )(n + 2) - 2c»(n + 1)(» + 2)p B + 

(2 1 )a„6„p] < 0. 


4. A Related Theorem 

The method of proof of the main theorem applies to the following theorem: 
Let f(z) represent a power series convergent in the open unit circle \z\ < 1 
and satisfying the condition | f(z ) | ^ 1 in | z | < 1. Let a < 0,0 > 0, a + 0 = 1, 
and let r n he the radius of the largest circle \z \ ^ r» in which the sections s n (z), 
for all power series f(z), satisfy the condition | as 0 (z) + fis n (z) | ^ 1. Then for 
odd n, n ^ 1, and for sufficiently large even n, the radii r n satisfy the algebraic 
equation 

1 + (« - 0)r + ( —1)"2 pr n+i = 0. 

Hence lim r„ = — - . 

H—*0O P Ot 

Proof. The radius r„ is the maximum value of r for which the cosine poly¬ 
nomial 

T n (r, <f>) = a/2 + + r cos <t> + r 2 cos 20 + • • • + r n cos n4>) 

remains non-negative, for all real values of <t>. n Let 

Fn(r, <t >) = 2(1 - 2r cos <f> + r 2 )-T n (r, «t>) 

= a(l — 2r cos <t> + r 2 ) + /3[1 — r 2 + 2r n+2 cos rufi — 2 r n+1 cos (n + 1)^]. 
If r § 1, 

F n ^r, ^ g a(l — r) 2 + 0 £l — r 2 — 2r n+1 + cos W ^ 1 

£ - 2/3^1 + cos < 0. 

Hence r» < 1 for all». 


“ Loc. cit. (4), p. 568. 



622 


RHODA MANNING 


Now 

F n (r, w) = «(1 +2r + r 2 ) + 0(1 - r 2 + (-l) B 2r B+I + (-l) B 2r B+ *) 

= 1 + 2 ctr + (a - 0)r 2 + (—1)"20r B+1 + (-l) B 2/3r B+i 

= (1 + r)(l + (« - 0)r + (—l) B 2j8r B+1 ). 

If n is odd, then F„(r, t) is an obvious lower boundary for F„(r, <j>), for all 
real <f>. 

Let n be even. We shall show that the inequality 

Fn(?i <t>) — F n (r, if), t — r n 

holds for all real 4>, and for sufficiently large even n. 

Rewritten, it assumes the form 

— 2ar(l + cos <j>) — 2/3r n+I [(l + cos (n + l)</> + (1 — cos ru/>)r ] ^ 0. 

The substitution <j> = ic — x yields 

— a(l — cos x ) — /3r"[(l — cos (n + l)x) + (1 — cos nx)r] ^ 0, 

whence, dividing by the positive quantity (1 — cos a;), and taking the limit as 
x —> 0, we obtain the inequality 

— a j8r"[(n + l) 2 + n 2 r], 

which holds for sufficiently large n and r < 1. But r„ < 1 for all n. Hence 
r„ is a root of the algebraic equation 

1 + (« - /J)r + 2j8r B+1 = 0. 

Oregon State College, 

Corvallis, Ore. 



Amt aim or Mathematics 
Vol. 43, No. 4, October, 1042 


THE TRANSFORMATION T OF CONGRUENCES 

By V. G. Grove 
(Received June 11, 1941) 

1. Introduction 

We propose to study in this paper a certain relationship between congruences 
in a projective space of three dimensions. Analytical conditions for this rela¬ 
tionship, called by us the transformation T , were developed by Cook 1 in a form 
slightly different from that used in this paper. Fubini 2 also called attention to 
this relationship somewhat earlier; but neither of these papers showed the rela¬ 
tionship between the transformation T and the theory of W-congruences. The 
present paper is more closely allied with a recent paper by Fubini 3 on TF-con- 
gruences. 

Two congruences T and F will be said to be in the relation of a transforma¬ 
tion T , if the lines of the congruences are in one-to-one correspondence such that 

1. corresponding lines are not coplanar, 

2. the developables of the congruences correspond, and 

3. such that there exists at least three transversal surfaces 4 * of each congruence 
whose tangent planes at their points of intersection with the line of that 
congruence pass through the corresponding line of the other congruence. 

The transformation T is of two types, one of which we have called the asymp¬ 
totic type, and the other the conjugate type. Associated with each of these 
types there is a one-parameter family, or pencil of congruences. This pencil 
seems to be somewhat more general than the pencil 6 defined by Fubini. As in 
the case of Fubini’s pencils, we find that if one congruence of the associated 
pencil is a TF-congruence, all congruences of the pencil are TF. Associated with 
the transformation T are four congruences such that if any three are W- con¬ 
gruences, the other is also. 

Let the curves which correspond to the developables of T and F be chosen 
as the parametric curves on the focal surfaces S, , S w , S t , S„ of T and F. 
Then the homogeneous projective coordinates Zi , Wi , z { , w if i = 1,2, 3, 4, of 
the focal points on the lines of the congruences satisfy differential equations of 


1 A. J. Cook, 'Pairs of rectilinear congruences with generators in one-to-one correspondence , 
Trans. Am. Math. Soc., Vol. 32 (1930), pp. 31-46. 

* G. Fubini, Su alcune classi di congruenze di rette e suite transformazione delle Superficie R, 
Annali di Matematica, (4), Vol. 1 (1923-24), pp. 241-257. 

VG. Fubini, On Bianchi’s permutability theorem and the theory of W-congruences , these 
Annals, Vol. 41 (1940), pp. 620-638. 

4 A. J. Cook, loc. cit., says that each of the congruence has the intersector property I 

with respect to the other congruence. 

* G. Fubini, loc. cit., p. 634. 


623 



624 


V. a. GROVE 


the form 

z u = fz -f- gw -f- fi -J- §©, 

2» = m2 + ft®, 

®„ = Nt + MW, 

w v = Gz + Fw + Sz + Rw. 

The integrability conditions of system (1) may be written in the form 

m u — r„ + nN — sS = gG, 

«» — «» + s(R — m) + n{r — M) = —gP, 
fv + f(fn — m) + gS + sG = 0, 
gv + g(R — m) + sF + fn = 0; 


z u = fZ + gW + rz + sw, 
z v = mz + nw, 

w u = Nz + 

w v — Gi + fro + Sz + Rw, 


M v — R u + nN — sS = gG, 

S u ~ N v + S(r — M) + N{R - m) = —jG, 

(2b) F u + F{M — M) + GS + gS = 0, 

(?„ + G(f - M) + /S + FN = 0, 

and two other sets obtained from these by placing bars above the letters where 
they do not occur and removing those which do occur. 

Let us denote by x, y , 2 , etc. the points whose homogeneous projective coordi¬ 
nates are Xi , yi , 2 ,, etc. (i = 1, 2, 3, 4). Let S z be a transversal surface of f 
generated by the point x whose coordinates Xi are of the form x = w + X2. It 
follows that 

x u = (M + AS)# + X (/2 + Qw) + LiZy 

(3) 

x v = (R + \n)x + Gz + Fw + L22, 

wherein 

Li = \ u — [SX 2 + (Jf - f)X - tf], 

I/ 2 = X* — [nX 2 + (5 — m)X — B], 

The tangent plane to S x at x passes through the line g of T if and only if 
Li ~ L* = 0. If one equates the derivatives X MW and X„ tt computed from L\ = 0, 
and 1^ = 0, one finds, by using the integrability conditions (2), that these latter 
two equations can have analytic solutions only when the equation 

(4) gP \ s - (QG + gG)\ + }G = 0 

is satisfied. It follows from the third property demanded of two congruences 
that they be in the relation of a transformation T, that the coefficients of (4) 
vanish. Hence 

( 5 ) 


gP — gG + gG = JG — 0. 



TRANSFORMATION T OF CONGRUENCES 


625 


In a similar maimer one may show that 

( 6 ) QF = QG + gG = JO - 0 . 

Hence the congruences r and F are in relation T if and only if conditions (5) 
and (6) are satisfied. 

In general the tangent planes at z and w to S,, S a do not coincide. Hence 

(7) JP-gG * 0, JF -gG* 0. 

From (5), (6), (7) we find that the transformation T is of two types; we call 
these types respectively 

(i) the asymptotic type if 

(8) f = J= F = P = gG + QG = 0; 
and the 

(ii) conjugate type if 

(9) g — g = G = G = 0. 

2 . The Asymptotic Type 

Let us consider first the asymptotic type of the transformation T. Under 
the conditions (8), we observe first that the integrability conditions (2) imply 
that s = S = s~S = 0. 

Let Ai, A 2 be two distinct solutions of Li = Lt = 0, and suppose that these 
solutions determine the two transversal surfaces S x , S„ of F. Then from (3) 
we observe that x, y, z and w satisfy equations of the form 


( 10 ) 


x u = ax + bw, y u — a'y + b'w, 

x, = Ax + Bz, y v = A'y + B'z, 

z u = px + qy + rz, w u = Nz + Mw, 


with 


z v = mz + nw, w v = Qx + Py + Rw, 


(11) bp + b'q = 0, BQ + B'P = 0, 

The integrability conditions of system (10) are 


a v + bQ = A» + Bp, 

A' u + B'q 

bP = Bq, 

B'p 

B u + Br = aB, 

b' v + b'R 

bv ”1" bR — Ah, 

B’ u + B'r 


bb'BB' 


= a v + 
= b'Q, 
- A'b', 
= a'B'. 


9* 0 . 

VP, 


( 13 ) 


p, + Ap 
q v + A'q 
r, + Bp + B'q 
-I* Mn 


mp, 

mq, 

nN + m tt , 
nr, 


P„ + a'P = MP, 

Qu + oQ = MQ, 

R u + VP + bQ = nN + M., 
N, + mN = NR. 



626 


V. G. GROVE 


From the first of (2a) and (2b) and the third of (13) we find that gO + gO 
implies that 

pB + qB' + VP + bQ - 0. 

From (12) and (13) we see that 

a v Q/jg — A u A u j 

r v ■+■ M v -f- H~ a» = R u -f- m u -f- A v “f - A u , 
and we may verify that, by a transformation of the form 
(14) x = \x', y = ny', z = vz', w = pw\ 

we may make 


(16) 


a = o', A = A', 
r + M + 2a = R + m + 2A =0. 


0 


We shall assume that this transformation has been effected. The conditions 
(16) are maintained by transformations (14) with 

\/n = const. \pvp = const. 

Again from (12) we note that 


(16) 


d . B 
du l0g B' 


0 , 


d . b _ 
° g b' ~ °" 


And hence from (11) and (16) we may write 

(17) q = pU, Q - PV, b = —6'17, B> = —BV 

wherein U and V are respectively functions of u and v alone. 

The focal points z, w of Q are readily found to be determined by the formulas 

(18) z = (B'x - By)/D, © - (by - Vx)/D , D = bB' - b'B. 

Hence we can recover equations (1) in the form 



2« = pB( 1 — 177)© + TZ, 

z* = w + fz, 

(19) 

z v = mz + nw, 

z v = ml + nit), 

w u = Nz + Mw, 

©» = #2 + M©, 

wherein 

w„ = &'P(1 - UV)i + Rw, 

©» = 2 -f- iJ©, 


f = 2a — r — (log D) u , 

m = R, 


R « 2A — R — (log D) v , 

M = r, 

(20) 

« . BV • = 

VU U 


6'(1 - £77)’ 

B( 1 - £77)» 


£> - VB(UV - 1), A = 

- pP(l - 177). 



TRANSFORMATION T OF CONGRUENCES 


627 


3. TF-congruences in the Asymptotic Case 

The differential equations of the asymptotic curves on S,, S v are readily 
found to be respectively 

pUu du 2 + nP( 1 - UV ) dv 2 - 0, 

( 21 ) 

Np(l - UV) du 2 + PV v dv 2 = 0. 

Hence r is a IF-congruence if and only if the invariant 

W = nN - (1 _ UV y 

vanishes. 

The differential equations of the asymptotic curves on S t , S& are found to be 

N du 2 + ndv 2 = 0, 

Ndu 2 + n dv 2 = 0 . 

It follows that F is a IF-congruence if and only if the invariant 

W = nN — nN 


vanishes. But from (20) we see that 

( 22 ) W + W = 0. 

It follows therefore that , if one of two congruences in the relation of a transforma¬ 
tion T of the asymptotic type is a TF-congruence, the other is also. 

4. The Transversal Surfaces 

If from (10) one eliminates z and w } it will be found that the coordinates of 
the current points x , and y of S x , S y satisfy the equations 

%UU - vXu Xv I \ )X, 

(23) Xuv = Ax u + ax v + ( )x — b'PUy, 

_ Bn . , / \ 

Xw UU ”1“ ( )x f 


a/ b ' N , / \ 

Vuu = e'y u - pyyv + ( )y, 

(24) j/u. = Ay u + ay v + ()y - b'PVx, 

Vn = —p- Vu + <f> v* + ( )y, 

wherein the omitted coefficients are immaterial for our purposes, and wherein 
0 = a + M + (log b’U) u , 0' = a + M + (log b ') u , 

= A + m + (log B ),, 


( 25 ) 


*>' - A + m + (log BV).. 



628 


V. G. GROVE 


It follows from (23) and (24) that the curves on S x , S y which correspond to the 
developables of r and f form asymptotic nets N x , N y on those surfaces. Moreover 
those surfaces are not ruled surfaces. 

Denote by /,, I y the invariants whose vanishing imply that N x or N y is iso- 
thermally asymptotic. We find that 

Is = Iy ~ d^dv log (if) = 4(Au ~ at) ’ 

But the vanishing of this function, as is seen from the first of (12), implies that 
pP — qQ vanishes. Hence neither N x nor N y is isothermally asymptotic. 

Suppose Si is a transversal surface of F distinct from S x , S y . If the coordi¬ 
nates of £ are defined by 

( = y + \x 

it follows that the tangent plane to S{ at £ passes through g if and only if X is a 
constant. We may say that the line (£io) (or (£, z)) generates a pencil of congru¬ 
ences. They are the asymptotic tangents to the one focal surface S( , the locus 
of £ being the line (j. We note that /( = /* = I y for every X. 

If we denote the coordinates of the tangent planes to S x , S y , S t , S* respec¬ 
tively by £, ij, «, f we find that these functions satisfy the following system of 
differential equations 


(26) 


£„ = —a£ + qu, 

£. = -Af + Pf, 

+ bij — Mf, 

r. = -Pf - nu, 


Vu = —ai) + pu, 
y v = -Aq -l- Qf, 

Wu = -r-Nf — ttd, 
to, — B'£ -|- fit) — mu. 


We have said 6 that two nets are in relation C if the developables of the con¬ 
gruence of lines joining corresponding points of the nets intersect the sustaining 
surfaces of the nets in those nets. In particular two conjugate nets in the rela¬ 
tion of a fundamental transformation F are in relation C. Two nets in relation 
C are said to be K a transforms if 


d* 

du dv 


log a = 0 


where a is one of the cross ratios of the corresponding points of the nets and the 
two focal points on the line of the congruence through these points. In par¬ 
ticular nets in the relation of a transformation of Koenig are K a transforms. 

We readily verify that a — bB'/(b'B). From (17) we note that a = UV. 
Hence N x , N y are K a transforms in the asymptotic case. Similarly from (26) 
we see that N(, 2V f are also K a transforms since in that case a = UV. 


* V. G. Grove, Transformations of Nets, Trans. Am. Math. Soc., Vol. 30 (1928), pp. 483- 
497. Ibid., p. 493. 



TRANSFORMATION T OF CONGRUENCES 


629 


6. The Focal Surfaces 

From (19) we see that the functions z and w satisfy equations of the form 

z u , = mzu + rz v + ( )z, 
w uv = Rw u + Mw v + ( )w, 


the omitted coefficients being immaterial for our purposes. The nets N., N„ 
have equal point invariants if the respective invariants E,, E w defined by 

(27) E, = m u — r v , E w = M„ — R u , 

vanish. 

It is readily seen that 

2 (B m - E t ) = /* = /„. 

Hence not both N, and N w can have equal point invariants. 

From (21) we find that N, and N w are isothermally conjugate if the respective 


invariants 


(28) 

T d 2 pU u 

* du dv E nP( 1 - UV) ’ 


T d ’ * 1 PVv 

" dudv S Np( 1 - UV )’ 

vanish. But we may show that 

(29) 

II 

1 

Hence not both N,, N w 

can be isothermally conjugate. 


6. The Conjugate Type 

The conjugate type of the transformation T is characterized by the condi¬ 
tions g = g = G = G = 0 . Let S x , S„ be two transversal surfaces of f whose 
tangent planes pass through the lines g of T. Then from (3) and by use of a 
transformation of the form (14) we may show that x, y, z, w may be made to 
satisfy equations of the form 


(30) 


x u = bz, 
x v = Bw, 


Vu = b'z, 
y v = B'w, 


Zu — px + qy — Mz + sw, w u = Nz + Mw, 


z v = mz + nw. 


w v — Qx + Py+ Sz — mw, 


wherein 

(31) 


Bp + B'q = bQ + b'P = 0, m u = M v . 



V. G. GROVE 


The integrability conditions of system (30) are 


(32) 


b, +bm = BN, 

B u + BM — bn, 

Pu + qS = MP, 

Qu + pS - MQ, 

sS — nN — 2m u = 2M V , 

8 V — n u = 2 (ms + Mn), 


B' u + B'M = b'n, 
b' v + b'm = B'N, 

Pv + Qs = mp, 
q v + Ps = mq, 

S u - N v = 2 (MS + mN). 


System (30) is preserved under all transformations of the form (14) with 
X = const., it = const., pa = const. 

The focal points z, © on the line § of f are determined by the formulas 


(33) z = (B'x - By)/D, u> = (by - b’x)/D, 
Hence we may recover equations (1) in the form 
z„ = fz 
z, = 

W u = 


D = bB' — b'B. 


(34) 


Mz + sw, 

Zu = 

z — [M + (log D) u ]z — nw, 

mz + nw, 


ml + nw, 

Nz + Mw, 

= 

Nz + Mw, 

Sz — mw, 

w v = 

w — Nz — [m + (log D) V ]W , 


wherein 

ii = (BB' V - B'B V )/D, N = (b'b u - bb' u )/D. 

7. TF-Congruences in the Conjugate Case 

Denote by r n the congruence of lines (x z), I"a the congruence of lines (y, w), 
r 21 that formed by (y z), r is that formed by (x w). Let Wa be an invariant 
whose vanishing implies that r,/ is a TF-congruence. Four such functions are: 


(35) 


W n = 


Wn = 


d* , q 
du dv 5 

4 m u , 

TF 1S - 8 

ou dv 

d* . p 

dw dt; B' 

■ 4m„, 

W a = " 

ow dt; 


log ^ — 4m u , 
log ® - 4 m u . 


The congruences T, r are TF-congruences if the respective invariants 


(36) 


W ~ 
(A 


log A — 4 m u , W 


du dv 

pP - qQ ) 

vanish. 

But using (31) we show easily that 


du dv 


log D — 4 m u , 



TRANSFORMATION T OF CONGRUENCES 


631 


Hence 


(38) W + W = W n + W 22 = W 12 + W 21 . 


From the points x, y , z , w there may be formed three different skew quadri¬ 
laterals. From (38) we may say that if any three of the sides of these quadri¬ 
laterals generate W-congruences so also does the fourth side . v If we agree to say 
that the tangents to a family of asymptotic curves on a surface form a W- 
congruence, we note that equation (22) is then a special case of (38). 

Let Si be a transversal surface of r whose tangent plane at £ passes through 
the line g of T. Then if 

(39) { = x + \y 

it follows that X = const. We find readily that 


(40) £* = (b + \b')z, £„ = (# + \B')w y 

and if Tn , v\ 2 are the congruences of lines (£, z) and (£, w) respectively and 
W)j the corresponding invariants W, then 

(41) Wn = W u , W\ 2 = Wn . 

We may say that the congruences Tn and Ti 2 form pencils. We shall call 
them the associated pencils. It follows from (41) that if one congruence of an 
associated pencil is a W-congruence, all congruences of the pencil are W-congruences. 

Denote by the focal points (other than x or y) on the lines of the con¬ 
gruences Tij. We find that these points are defined by 


(42) 


P11 = Bz — nx , 
P21 = B'z - ny, 
It may readily be found that 

Piiv = ( B v + Bm)z — n v x , 
P 2 iv = (B' v + B'm)z - n v y , 


P12 = bw — Nx , 
P22 = b'w - Ny. 


Pi 2 u = (K + bM)w — N u x , 
P 22 u = (b'u + b'M) - N»y. 


Hence the developables of correspond to the developables of T and T . 

Denote by p<y the focal points other than £ (or 77) on the lines of the congruences 
T<y. We find that the coordinates of p) y are defined by the formulas 

P11 = (B + \B')z — nx, 

P12 = (b + \b')w — Nx. 

Hence 

( 44 ) Pll = Pll , P12 = P12 , Pll = P21 , p?2 = P22 . 

It follows from (43) that each of the focal points of a line of a congruence of a 
pencil moves along a line as that congruence generates the pencil. In the pencil 
as defined by Fubini in the paper cited, one focal point is fixed, the other focal 
point moves along a line. 



632 


V. G. GROVE 


(45) 


8. The Transversal Surfaces 

From (30) we may show that the coordinates x, y of the current points of 
S z , S y satisfy the following differential equations 

Xuu - x v + ^x P + b(px + qy), 

Xuv — + mj x u + + M^j x v , 

x„v = + (jjf ~ mjx v + B(Qx + Py); 

Vuu = (jp- - M^y u + ^y v + b'(qy + px), 
y«v = (jp + mj y u + + M^y v , 

(f ~ m ) v* + B '( p y + Q*)' 


(46) 


B'S 

V '** — 'gr 2/“ 4“ 


Hence N x , N y are conjugate nets in the relation of a transformation F. 

It follows from (45) and (46) that N x and N v have equal point invariants if 
the respective functions vanish: 


(47) 


E x 


a 2 . b 
log 


du dv 6 B ’ 


„ 3 * , 5 ' 

Ev du dv ° g B' ■ 


Again from (45) and (46) we note that the asymptotic curves on S x , S v are 
given by the respective equations 

(48) bqdu + Bpdv 2 = 0, b'pdu 2 + B'Qdv* = 0. 


But from (31) 
(49) 


bq _ b'p 
Rp ~ WQ ‘ 


Hence the asymptotic curves on S X) S v correspond. 

If we denote the coordinates of the tangent planes to S x , S y ,S t , S& by £, y, u, f 
respectively, we may write 


£ - (x, z, w), y = (y, w, z), f = (x, y, w) 

« = (y, x, z). 

We find that these functions satisfy the following system of differential 
equations 

(u ~ ff) Vu ~ pt> 

£» = Pa, r, => Qa, 


=* b'£ + by + — Nu, a u = — 8f — Mu, 

f* * — mf — Su, u y * B'£ 4* By — nf + Mu. 


( 51 ) 



TRANSFORMATION T OF CONGRUENCES 


633 


(52) 


The equatons of Laplace which (, rj satisfy are 

{.. = (j~ ”>)«• + (y - «)f.. 
-(? -»)- + (| -*)"•• 


Vuv 


It follows therefore that the nets N x , N y have equal tangential invariants if 
the respective invariants 


(63) 


E * = £e v '°s 9 P’ 


E„ = log v 


du dv 


Q 


vanish. From (47), (49), and (53) we see that 


E x + Ei == E y + En . 

As is seen from (33) the nets N x , N y are in the relation of a transformation 
K of Koenigs if and only if 

(54) bB' + b'B = 0 . 

Moreover from (31) and (54) one may show that 


(55) pP + qQ = 0. 

But the condition (55) implies that the tangent plans to S x and S y at x, y separate 
the focal planes of the line g of r harmonically. 

One may show from (32) that the condition (54) implies 7 that E x — E v = 0, 
and that (55) implies that E$ = E v = 0. It follows therefore that if the two 
nets N x , N y are in the relation of a transformation K they are also in the relation 
of a transformation 1 U. 

Michigan State College, 

East Lansing, Mich. 


7 L. P. Eisenhart, Transformations of Surfaces, Princeton, 1923, p. 134. 



Annals or Mathematics 
Yol. 43, No. 4, October, 1«42 


ON THE HOMOTOPY GROUPS OF SPHERES AND ROTATION GROUPS 1 

By George W. Whitehead 
(Received February 11, 1942) 

1. Introduction 

One of the outstanding problems in modern topology is that of classifying 
the mappings of an m-dimensional sphere S m into a topological space X . In 
terms of the Hurewicz theory of homotopy groups 2 this problem may be phrased 
as follows: to determine the structure of the m th homotopy group T m ( X ). Of 
particular interest is the case where X itself is an n-sphere S n . In this case the 
results of Hopf, 3 Freudenthal, 4 and Pontrjagin 5 have led to the solution of 
the problem for m g n + 2. For m > n + 2 almost nothing is known con¬ 
cerning the structure of T m (S n ). 

That this problem is closely related to the study of homotopy properties of 
the rotation group R n of the n-sphere has been shown by Pontrjagin, 6 who has 
used the one- and two-dimensional homotopy groups of R n to compute the 
groups t r n +i(S n ) (i = 1, 2). 

In the present paper we introduce an operation which associates with each 
mapping f(S m X S n ) C S n a mapping <£(>S m+n+1 ) C S n+l . This is a generaliza¬ 
tion of the procedure of Hopf 6 for the case m = n. This operation is shown to 
induce a homomorphism of w m (R n ) into 7r m+n +i(S n+1 ), which for m = 1, 2 turns 
out to be an Isomorphism. The connection of this homomorphism with one 
introduced by Freudenthal 4 is studied. 

In a recent paper Freudenthal 7 has announced without proof a very general 
theorem on extension of mappings, and used this theorem to construct maps of 
S 2n ~ 2 on S n of Hopf invariant l 6 for all even n. We shall use the above results 
to construct a counter-example to Freudenthars theorem. It is further shown 
that Freudenthars construction definitely fails if n > 2 and n == 2 (mod 4). 

2. Preliminary concepts 

In Euclidean (r + l)-space & r+1 let & denote the unit sphere, i.e., the set of 
points x = (xi , • • • , z r+ i) e & r+1 with 

r +1 

( 1 ) l *| 2 = = 1 . 

*-i 

1 Presented to the American Mathematical Society, December 30, 1941. 

2 W. Hurewicz, Proc. Akad. Amsterdam 38 (1936), pp. 112-119 

3 H Hopf, Math. Ann. 104 (1931), pp. 637-666. We shall refer to this paper as H I. 

4 H. Freudenthal, Comp. Math. 5 (1937), pp. 299-314. We shall refer to this paper 
as F I. 

* L. Pontrjagin, C. R. Acad. Sci. URSS 19 (1938), pp. 147-149, 361-363. 

3 H. Hopf, Fund. Math. 26 (1936), pp. 427-440. We shall refer to this paper as H II. 

7 H. Freudenthal, Proc. Akad. Amsterdam 42 (1939), pp. 139-140. We shall refer to 
this paper as F II. 


634 



HOMOTOPY GROUPS OP SPHERES AND ROTATION GROUPS 


635 


Let E'i (i = 1 ,2) be the hemispheres defined by the conditions x r+ i ^ 0, x T +i ^ 
0, respectively. E r+1 denotes the closed (r -f l)-cell | x | g 1 bounded by S'. 
We shall refer to the points x 1 — ( 0 , 0 , • • • , 1 ) and a : 2 = (0, 0 , • • • , — 1 ) as the 
north and south poles, respectively. 

Let Y be a metric space with distance function p(yi , y 2 ), y° a fixed point of Y. 
By Y s ' we shall mean the space of all mappings 8 f(S r ) C Y metrized by 

(2) p(f, g ) = L.U.B. p[/(x), g(x)} (/, g e Y s '). 

x c S r 

Let as 0 be the point of S' with co-ordinates ( 1 , 0 , • • • , 0 ). Then F sr (x°, y°) 
denotes the subspace of Y s ' consisting of those mappings f(S r ) C Y such that 
/(x°) = V° • Two mappings /, g e Y s ' (x°, y°) are said to be homotopic if they 
can be joined by an arc in Y sr (x°, y n ). The relation of homotopy is reflexive, 
symmetric, and transitive and divides the space Y s '(x a , y°) into equivalence 
classes, called homotopy classes. The set of all these homotopy classes we 
denote by x r (F). We shall denote the homotopy class of any/ e F sr (x°, y°) by f. 

We define an operation of addition between homotopy classes as follows: 
let/,- (t = 1, 2) e Y s '(x°, y°). Let 0 , (i = 1 , 2 ) be a mapping of E\ on S' such 
that (1) <t>i(S r " 1 ) = x°; (2) <pi(E r i — S' -1 ) C <S' r is a topological map of degree 1. 
Then we define a mapping/(<S r ) C Y as follows: 


(3) 


/ifoi(z)] 


(x e E\), 
(x e El). 


It is easily verified that the homotopy class of / depends only on the homotopy 
classes of /i and / 2 . Let 

(4) f = fi + U • 

Hurewicz 2 has proved that under the operation of addition so defined the set 
x r (y) becomes a group, called the r th homotopy group of Y. This group is 
abelian if r > 1 ; in all the cases we consider here it is also abelian if r = 1 . 


3. The homomorphism H 

Let Euclidean (m + n + 2)-space be represented as the product space <S m+1 X 
g n+1 , points x « g m+n+2 being represented by co-ordinates (p, q) (p t & m+1 , 
q e & n+i ). Then -S m+B+1 is defined by 

(5) IpT + I«I 8 = i. 

Let Hi and H 2 be the subsets of S m+ " +1 defined by 

( 6 1) I V I ^ I 9 l> 

( 62 ) I V I ^ I Q I. 


8 All mappings are supposed continuous. 



636 


GEORGE W. WHITEHEAD 


respectively. Let 

(7i) h(p, g) - (p/1g1, g/1 g I) ((p, g)«Hi), 

(7s) &(p, g) = (p/1 p I, g/1 p I) ((p, g)« h 2 ). 

Evidently ft | HiHi = ft1 HiH 2 * and maps HiH 2 into 5 m X <S n . Denote this 
mapping by ft Then 

Lemma 1. The mappings ft , ft, and ip defined above are homeomorphic map¬ 
pings of Hi on E m+1 X S n , Ht on S m X E n+l , and HiHi on S m X S n respectively. 

Let / be a mapping of S m X S n into S n . We associate with / the mapping 
H(f) = <t>(S m+n+l ) C S n+1 as follows: 4> maps the great circle joining the point 
(0, q) to the point (p, q ) on the great circle joining the north pole z 1 of S n+1 to 
the point fty~ l {p, g)], and maps the great circle joining (p, 0 ) to (p, q) on the 
great circle joining z 2 to/[^ - 1 (p, g)]. Evidently C E” +l , 4>(H 2 ) C E 2 +1 , 
while <j> = fifT 1 on H\H 2 . The functions defining the mapping <j> are given by 

ft(P, q) = 2 | p | • | g | -Up /1 p |, q/\ q |) (| p | • | g | ^ 0), 

( 8 ) ft( 0 , q ) = ft(p, 0 ) = 0 (i — 1 , ••*,» + 1 J; 

0»+s(p, g) = I g I s - I p I 2 . 


We use this operation to construct a mapping H = H m ,„ of r m (R n ) into 
»m+n+i((S n+1 ) as follows: let e « R n denote the identity mapping of <S B on itself, 
and let / tR?T(p 0 , e). If p e S m , q e S n , let f*{p, q) denote the point of S n 
into which q is carried by the rotation/(p). Let <fr = H(f*). Then it is easy 
to verify that <t> e g n +i* m n+ \x°, z 2 ), where x° = (p°, 0 ) and z 2 is the south pole 
of S n+1 . Let H(f) = <>. Evidently f = g implies H(f) = H(g), so that H is a 
well-defined mapping of ir m {R n ) into ir m+B+ i((S n+1 ). We have further 
Theorem 1 . H is a homomorphic mapping of it m (R n ) into v ra+ „ + i(»S B+1 ). 

For let f, g t ir m (R n ), and let h be the constant mapping h(p) = e (p e S m ). 
Then h = 0. Hence f + h = f, h + g = g, so that H(f + h) = H(f), 
H(h + g) = H(g). It is therefore sufficient to prove that 


(9) H(f + h) + H(h + g) = H(f + g). 

Let g' be mappings of S m into R„ defined by 

AMp)] 

h[<h(p)] 

Mft(p)l 

g[<h(p)] 

Then f'-f+h.g'-h + g. Let F = H(f'*), G = H(g'*). 


(10i) 

(1ft) 


/'(p) = 

g'(p) - 


(P « Ei), 
(peE?); 

(p e Ei), 
(p « Ei). 


• If /Or) C Y and A is a closed subset of X, f | A denotes the mapping of A into Y ob¬ 
tained by restricting the range of definition of / to the set A. 



HOMOTOPY GROUPS OF SPHERES AND ROTATION GROUPS 


637 


Let Vi denote the vertical projection of E r ? +n+1 on 2? m+n+1 (i = 1, 2). Then 
vi(x) = x for x t S n+n . Let F 0 = F \ E? +n+1 , H" = F | E? +n+ \ H' = 
G | £T +n+1 , Go ~ G | E% +n+1 . Then it is easily verified that H'v r 1 = H"i rj -1 . 
Call this mapping H 0 . Evidently F 0 (x) = G„(x) = H 0 (x) ( x t S m+n ). 

Let H t (0 ^ t ^ 1) be a homotopy of H 0 to x° keeping x° fixed. Then 14 


there exist homotopies F t , G t (0 ^ t ^ 1) of F 0 , G 0 
F,(x) = G,(x) = H,(x) (x e S m+n ). Let 

respectively, such that 

(Hi) 

F,(x) = 

HW*)] 

(x t E? +n+1 ), 
(z«E? +n+l ); 

(11*) 

HiMx)] 

G'{x) 

Gtix) 

(x e £T +n+1 ), 
(x e E? +n+l ). 

Evidently F| ■ 
Let 

= F, G( = G. 


(12) 

, P*{x) 

H(x) = 

<?.(*) 

(x e ET +n+1 ), 

(ie£? +B+1 ). 


Then Hj = H(f + g), while H[ = F( + = F + G = H(f + h) + H(f + g). u 

But H£ = Hi, which proves the theorem. 


4. Relations between the homomorphisms F, G, and H 

Let S m+n be the equator of »S m+ " +1 , S n the equator of <S" +l , and let / be a 
mapping of <S m+ " into S n . We associate with the mapping/a mapping F(f) — 
\f/(S m+n+1 ) C S' l+1 as follows: $ maps the great circle joining the north pole x l 
of S m+n+1 to the point x t S m+n on the great circle joining z 1 to /(z),and maps 
the great circle joining x 8 to x on the great circle joining £ to f(x). Evidently 
t(E? +n+l ) C E^g(E? +n+1 ) C Ei +l , while f = /on S m+n . If/ e S» s “ + V, A 
then F(f) e S n+iam+n+1 (x°, y°); moreover, / homotopic to g implies F(f) homotopic 
to F(g). Thus F induces a mapping F of ir m+ „(S n ) into ir m+n+l (»S n+1 ), which 
was shown by Freudenthal 4 to be a homomorphism. 

Let .ftn-i be the closed subgroup of R n consisting of those rotations which 
leave the north pole fixed. Evidently R n - 1 is isomorphic with the group of 
rotations of <S n_1 . Since R n -i C R n , there is a natural homomorphism G of 

Vm(,Rn~l) intO 7Tm(Rn) • 

Theorem 2. The homomorphisms F, G, and H are related by 
(13) FH m .n-i = H m ,„G. 

W K. Borsuk, Fund. Math. 28 (1937), p. 101. 

11 This follows from the definition of addition in ic m+ „ + i(S n+l ) given by S. Eilenberg 
(Ann. of Math. 41 (1940), p. 235), which is easily shown to be equivalent to the one given 
here. 



638 


GEORGE W. WHITEHEAD 


For let £ c T m (Rm-i), g = F[H m , n ^(f)], g' = #»,»[£(/)]. It is then easily 
verified that g = g' on S m+ \ Moreover 0'(F7 +B+1 ) C £? +l , g'(E? +n+l ) C F? +1 . 
Hence for no * is g’(x) = —g{x). It follows that g and g' are homotopic, so 
that g = g'. 

Let ^ be a mapping of & n ~ x into R n ~\ defined as follows: if x e S n ~ x , x' is the 
point in the great circle joining x 1 to x whose angular distance from x 1 is twice 
that from x 1 to x. Then 4>(x) is that rotation which carries x 1 into x' and leaves 
each point in the (n — 2)-sphere orthogonal to x 1 and x fixed. Let h = 
((/>). Then it can easily be shown 12 that if n is even h has Hopf in¬ 
variant 2. We have further: 

Theorem 3. The kernel of the homomorphism F[x 2 „_i(/S b )] C 7 r 2 „(»S n+1 ) 
(n even ) is the subgroup of x 2 „_i(/S n ) generated by h. 

The author has recently shown 13 that G(<j>) = 0; in fact, the kernel of the 
homomorphism G is the subgroup of generated by It follows from 

Theorem 2 that F[H„_i,„_i(4»)] = F(h) = 0. Let g e x 2 »_i(<S n ), and suppose 
that F(g) = 0. Then the Hopf invariant of g is even, 14 say 2k. Let f = kh. 
Then F(f — g) = 0, and f — g has Hopf invariant zero. Hence 15 f — g = 0, 
i.e., g = f = Ah. 

Theorem 4. H m ,„ maps v m (R n ) isomorphically for m — 1, 2. H m ,„ maps 
ir m (Rn ) on x m+n+ i(/S n+1 ) for m = 1 and for m = 2 , n > 1. 

Let h(S x ) C Ri be defined by 

1*1 — *8 

h(x) = 

1 *2 *1 

Then h maps S 1 homeomorphically on R\ , and h is a generator of the free cyclic 
group iri(fSi). But Hi,i(h ) maps S* on »S 2 with Hopf invariant l 16 and generates 
the group x 3 (*S 2 ). It follows from Theorems 2 and 3 that Hi,„ maps wi(R„) 
isomorphically on x» +2 (>S B+1 ) for n > 1. 

Since x 2 (fl») = 0, it follows that H 2 , n is an isomorphism. But 7 r„ +3 (<S n+1 ) = 0 
for n > l 5 , and hence H 2 ,„ maps x 2 (/?„) on x n+3 (-S n+1 ). This completes the proof 
of the theorem. 

6. Freudenthal’s theorem 

Freudenthal has recently announced 7 without proof a very general theorem 
on extension of mappings, and used this theorem to construct maps of (S 2 " -1 
on S" with Hopf invariant 1 for all even n. 17 In this section the foregoing 
results are used to construct a counter-example to Freudenthal’s theorem, and 
to show that the above-mentioned construction fails if n > 2 and n = 2 (mod 4). 

“ Cf. H II, p. 431. 

M Ann. of Math. 43 (1942), Theorem 5. 

“ F I, Satz III. 

»• F I, Satz II, 2. 

*• H I, p. 664. 

w F II, p. 140. 



HOMOTOPY GROUPS OF SPHERES AND ROTATION GROUPS 


639 


Let points z of Euclidean 2n-space be represented by complex co-ordinates 
(zi, • • • , z n ). Then aS 2 *” 1 is represented by the equation i zfa = 1. 

Let P n —i denote complex projective (n — l)-space. Then there is a natural 
mapping ^(S 271 ”" 1 ) C P n _i defined by mapping each point z e S 271-1 into the point 
of P n _i with the same coordinates. This is evidently a fibre map in the sense 
of Hurewicz and Steenrod, 18 the fibres being great circles. This mapping 
c i can foe extended to a mapping ^(P 2n ) C P n , where 
)p(zi , • • • , Zn) = (zi, • • • , Zn , (1 — £ It is easily verified that ^ is a 

homeomorphism on E 2n — aS 2 ” -1 and ^ on S 2n_1 . 

Let X be a topological space, / a mapping of P n _i into X. Then 

Theorem 5. The mapping f ( P n ~ i ) C X can be extended to a mapping 
f*(P n ) CL X if and only if the mapping f<f>(S 2n ~ l ) C X is inessential . 

For if f<t> is inessential, there is a mapping F(E 2n ) C X such that F = fy on 
aS 2 *” 1 . Let /* = FyjT 1 . Then f* is the required extension. Conversely, if f* 
is an extension of /, let F = /*^. Then P maps E 2n into X and F = fy on 
S 2n_1 . Hence f<t> is inessential. 

Let giS 1 ) C R 2n -1 be defined by 

X\ x 2 0 0 

— x 2 x\ 0 0 

0 0 Xi x 2 

g(x) = 0 0 — x 2 xi 


0 0 0 0 

I 0 0 0 0 

Then g is essential or inessential according as n is odd or even. For if n = 1, 
g is a generator of wi(Ri), so that g is essential. If n = 2, we have g(>S 1 ) Cl Q 3 , 
where Q 3 is the quaternion subgroup of R 3 . But 7n(Q 3 ) = tti(S 3 ) = 0. Hence 
g = 0 in Q 3 C R 3 , and g is inessential. The proof is completed by induction. 

Let h = H(g). Then it follows from Theorem 4 that h(S 2n+1 ) CL S 2n is essen¬ 
tial if n is odd and inessential if n is even. Moreover, it can be directly verfied 
that there is a mapping h'(P n ) CL S 2n such that h = h'<l>, and that h' has degree 1. 
An application of Theorem 5 gives 

Theorem 6. If n is even , the mapping h'(P n ) CL S 2n can be extended over 
P n + 1 . If n is odd , it cannot be so extended . 

The theorem of Freudenthal’s referred to above can be phrased as follows: 19 
Let K be a complex , f a normal mapping 20 of K q into S 9 . Suppose that f can be 
extended over K 9+1 . Then f can be extended over 

Let K be a triangulation of P n +i, so that P n becomes a closed subcomplex L 
of K. Then L C K 2n . Let h! be the mapping of L into aS 2 ” of degree one 

18 W. Hurewicz and N. E. Steenrod, Proc. Nat. Acad. 27 (1941), pp. 60-64. 

F II, p. 140. 

8 «I.e.,/(X<r-i) - a? 8 . 


... 0 

0 

... 0 

0 

... 0 

0 

0 

0 

x x 

Xi 

• ■ • —Xi 

Xi 







640 


GEORGE W. WHITEHEAD 


described above. Then* 1 h' can be deformed into a normal map h”; moreover, 
h" can be extended over K if and only if the same is true of h'. Let IT(K — L) 
denote the r* h cohomology group of K — L with integral coefficients. Then 
IT(K — L) = 0 for r < 2n + 2, while H 2n+i (K — L) is a free cyclic group. 
In particular, H 2n+l (K — L) = 0. It follows from a theorem of Whitney** 
that h" can be extended over K in+1 . But h" cannot be extended over K* n+ * 
for » odd. 

Freudenthal’s construction of maps of S in ~ l on S in is based on an application 
of his theorem to the case K = P 2n , f(K 2n ) C £*", where f(P n ) C 5*" is of 
degree one. The argument above shows that this construction breaks down if 
is odd and > 1; for / cannot even be extended over the subspace P n + 1 of P 2 „ . 

Purdue University 

11 H. Whitney, Duke Journal 3 (1937), p. 53. 

M Loc. cit., Theorem 2. 



Amma jm or Mathematics 
Vol. 48, No. 4, October, 1042 


LINEAR p-ADIC GROUPS AND THEIR LIE ALGEBRAS 

By Robert Hooke 
(Received December 18, 1941) 

1. Introduction 

The theory of real or complex Lie groups necessarily treats only those topo¬ 
logical groups which are locally connected. It is the object of this paper to 
develoo methods of Lie theory for the opposite case of totally disconnected 
groups. 

We shall consider those groups into which can be introduced local analytic 
coordinates from a p-adic field K, calling these p-adic Lie groups over K. The 
concept of the associated Lie algebra over K will be used at once to obtain the 
usual properties of Lie groups. Bearing in mind the results of Ado 1 on im¬ 
bedding any Lie algebra of characteristic zero in a Lie algebra of matrices, we 
shall restrict ourselves to the study of Lie algebras of matrices over K and the 
p-adic Lie groups contained in the full linear group over K. 

Although the coordinates in these groups may be defined only in a certain 
neighborhood of the identity, all the groups which will occur will be entire 
groups. It is necessary, however, to identify two subgroups whose intersection 
is open in each, as is done in the theory of real local Lie groups. It will then be 
shown in sections G and 7 that the usual one-to-one correspondence exists 
between the subgroups of a lie group and the subalgebras of its Lie algebra. 

The last sections are devoted to certain special groups and their Lie algebras, 
and in particular to the determination of groups whose Lie algebras are the 
various “non-exceptional” normal simple Lie algebras over K , which have been 
classified by Jacobson. 

The author wishes to express here his appreciation of the assistance and 
encouragement of Professor C. Clievalley in the preparation of this paper. 

2. Notation and Preliminary Theorems 

We shall first list, without proof, a few necessary theorems from p-adic analy¬ 
sis. Unless otherwise noted, these theorems may be found in the papers of 
Chabauty, [4], and Chevalley, [5]. 

Let R be the field of rational numbers and p be a fixed prime. There is 
determined by p a valuation v in R which is defined by 

v(p) = p 0 < p < 1, p a real number. 

(For this notation and elementary results, the reader is referred to Albert, [2].) 
R p will denote the complete p-adic number field determined from R by the 
valuation v. This valuation has the properties: 

(a) v(xy) = v(x)v( y), 

(b) v(x + y) ^ max v(x), v(y). 


1 See Ado, [1]. The numbers in square brackets will refer to papers in the bibliography. 

641 



ROBERT HOOKE 


642 

If x is an element of R p , then t>(x) is defined and is equal to p m where m is 
some rational integer. If m is not negative, that is, if v{x) £ 1, then x is called 
an integer of R p . The integers of R p form a ring E Rp all of whose ideals are 
powers of the prime ideal (p). 

Now let K be any finite algebraic extension of R p . The integers of K are 
defined as those elements whose irreducible equations over R p have coefficients 
which are all integers of R p . The valuation v has a unique extension to a 
valuation of K, and the integers of K are those elements k such that v(k) ^ 1. 

If we put d(x , y) = v(x — y) in K , then K becomes a metric space which is 
complete, locally compact, and totally disconnected. The symbol K n will denote 
the direct product space of K by itself to n factors and will be called p-adic 
n-space over K. 

A series X) a « with terms in K converges if and only if lim,,-^ v(a n ) = 0. 
An analytic function defined on K n is by definition the sum in its region of 
convergence of a power series of the type 

Z ^h 1 h t -h n xl 1 Xi i • • • a£r 

which converges for all points (xi , x *, • • • , x n ) in some neighborhood of the 
origin in K n . Such a power series can be differentiated term by term to give a 
new series convergent in the same region. It has the properties of the abso¬ 
lutely convergent series of complex analysis. 

We conclude this summary with a list of theorems from p-adic analysis. 
These will be referred to throughout the paper by the numbers given them. 

(1) Every integer of R p is a limit of rational integers . 

(2) Letf(xi , x 2 , • • • ,x n ) bean analytic function defined in a neighborhood D of 
the origin in K n . Letfi{y \, 2 / 2 , • • • , 2/m), {i = 1,2, • • • y n)ben analytic functions 
defined in a neighborhood D ' of the origin in K m such that /,(0, 0, • • • , 0) = 0, 
(i = 1,2, • • • , n). Then if in f we substitute for each Xi the series fi , we get a 
series f'(y \, 2 / 2 , • • • , 2/m) which converges for all points y in D f which are such 
that the point with coordinates fi(yi , 2 / 2 , • • • , y m ) is in D. 

(3) Let fi , / 2 , • • • , fh be h functions of h + m variables U \, u 2 , • • • , Uh ; 
X \, x 2 , • • • , x m such that 

MO, 0, • • • , 0) - 0, i - 1, 2, • • • , h, 

and such that 


-P(/i, /a, • • • , fh) ^ q 

D(ui ,Ui, ••• , w A ) 

when the m and the Xi are all zero. Then the equations 

Mu 1 ,«*,*•*,«*; xi ,*»,•••, x m ) = 0, i = 1, 2, • • • , h, 
define the Ui as analytic functions of the x { , 

Ui = Fi(x 1 , xt, • • • , a:*), i = 1, 2, • • • , h, 



LINEAR p-ADIC GROUPS AND THEIR LIE ALGEBRAS 


643 


where 


Fi(0, 0, • • • , 0) = 0, i = 1, 2, • • • , h. 

(4) (Cf. Lutz, [14].) A system of differential equations ' 

dy</dt = fi(t, 2/i, 2/a, * * - , y n ), i - 1,2, • • • , n, 

where the fi are analytic functions, has, in some neighborhood of t = j/< = 0, one 
and only one solution in the form 

Vi = 0.(0, i - 1, 2, • • • , n, 

where the g, are analytic functions with the initial conditions 

0,(0) = 0, * = 1, 2, • • • , n. 

(5) If x is an element of K and v(x) < p 1/(p v , the series 

exp x = 1 + x + x 2 /2\ + • • • + x n /n\ + • • • 

converges , and v((exp x) — 1) = v(x). We have, as in ordinary analysis, exp 
(x + y) = (exp x)(exp 7 /), when all of these exist. 

(6) If x is an element of K, the series 

log x = (x - 1) - (x - 1)72 + • • • + (-l)" _1 (x - 1)7 n + ■■■ 
converges when v(x — 1) < 1. 

(7) If v(x) < p 1/(p ~ 1} , log (exp x) exists and is equal to x. If v(x — 1) < p 1/(p-1) , 
exp (log x) exists and is equal to x . To prove the second statement, we need 
only show that v(log x) < p 1/(p_1) . We have 

v(logx) g max (v n ), where v n = v[(x — 1 ) n /n]. 

If v(n) = p a , then n ^ p a , and we have 

a g P*” 1 + p“' 2 + • • • + v + 1 = (p a - l)/(p - 1) S (n - l)/(p - 1). 
Hence v n < g p 1Kp ^\ Q.E.D. 

(8) If an analytic function f{t) is equal to zero for a sequence of values of t 
approaching zero and /(0) = 0, then f{t) is identically zero . (The proof is as in 
ordinary analysis.) 


3. Matrices in a p-adic Field 

Given a field K , we shall denote by K n the full matric algebra of n-rowed square 
matrices over K , and by K n i the Lie algebra obtained from K n by defining the 
commutator by 

[x, y] = xy - yx. 

This will be called a pure commutator of degree 2 in x and y . If c m is a pure 
commutator of degree m in x and y, then by definition, [c m , x] and [c m , y] are 
pure commutators of degree m + 1 in x and y. The full linear group of order n 
over K we shall denote simply by G K , since n will be fixed throughout. 



644 


ROBERT HOOKE 


A sequence of matrices is said to converge, if for each fixed pair i, j, the ele¬ 
ments in the i th row and j th column of these matrices form a convergent sequence. 
The limit matrix is the matrix whose elements are the limits of these sequences. 

Given a matrix A = 11 a< s 11 in K n , there can be defined a weak type of valua¬ 
tion V in K n by putting V(A) = max v(o,,). This valuation satisfies the con¬ 
ditions 

(a') V(A + B) £ max V(A), V(B), 

(b') V(tA) = i»(<)F(A), < in K, 

(c') V(AB) £ V{A)V{B). 

Now a sequence of matrices A m = || a,'” ) || with elements in a p-adic field K 
converges if and only if 

lim victf? — a[? +1> ) = 0 for all i, j. 

m—*ao 

We have, however, 

»(«!;’ - a$ +l) ) ^ V(A n - A m+1 ) 

for each i and j , and there exists for each m a pair of integers i and j such that 
the equality sign holds. We thus get the same condition for the convergence 
of a sequence of matrices as for a sequence of elements in K, using now the 
valuation V . It can be seen, therefore, that in dealing with sequences and 
series of matrices, we get automatically the same theorems for commutative 
matrices with the valuation V that are proved for elements in K with the valua¬ 
tion v , provided these theorems are proved using only the fact that the valuation 
v satisfies the conditions (a'), (b'), (c ; ). 

In particular, we have the following theorems. 

Theorem 1. If X is in K n and V(X ) < p 1,ip ~ 1 \ the series 

exp tX = I + tX + • • • + t n X n /n\ + ■ • • {I the identity matrix) 

converges for v(t) ^ 1 and we have F(exp tX — 1) = V{tX). Also for any matrix 
Y there exists a real number r > 0 such that exp tY converges for v(t) ^ r. 
Theorem 2. If V(A — I) < 1, the series 

log A = (A — I) — (A ■— I) 2 / 2 + • • • + (-1 ) n ~\A - I) n /u + • • • 

converges . If V(X) < p 1/(p ~ 1} , then log (exp X) is defined and equal to X. Also if 
V(A — /) < p 1/<p_1) , then exp (log A) is defined and equal to A . 

If X and Y are commutative matrices, it can be shown in the usual way that 
exp (X + Y) = (exp X)(exp Y) when these exist. We shall need the following 
generalization of this fact for non-commutative matrices: 

Theorem 3. Let X and Y be matrices in K n such that 

V(X), V(Y) < P 11 ^. 

i) There is a matrix Z defined by exp Z = (exp X)(exp F). 



LINEAR p-ADIC GROUPS AND THEIR LIE ALGEBRAS 


645 


ii) Z = X + F + limm_ x f m , where the f m are linear combinations of higher 
commutators of X and Y with rational coefficients. 

Proof, i) By Theorem 1, F((exp X) — I) = F(X) and F((exp Y) — I) = 
F(F). Hence 

F[(exp X) (exp Y) - 1] = F[(exp X - 7) (exp Y - I) 

+ (exp X - I) + (exp Y - /)] 
g max V(X), F(F). 

By Theorem 2, therefore, Z = log [(exp X)(exp F)] exists and exp Z = 
(exp X)(exp F). 

ii) If X and F are real matrices, it has been shown by Hausdorff, [7], that 
Z = X+ F + E£ a r T(X, F), 

r-2 «-l 

where f* is a pure commutator of degree r in X and Y and each a 8 is a rational 
number. Each a r is finite. 

It follows that 


Z = X + Y + limZE a r ’f r, (X, F). 

wi—*oo r—2 a—1 

Each f' can be expressed as a polynomial which is homogeneous of degree r 
in X and F. Let us now put 

nX, F) - £ a r T(X, F). 

a-1 

m 

Then Z = X + Y + lim,^*, 2 F r (X , Y), where each F r is a homogeneous 

r -2 

polynomial of degree r in X and Y. 

Expressing this series as n 2 series in the elements of the matrices therein, we 
have 


Zii = Xii + Yi t, + lim £ r iS {X lu ••• , Xnn) Fn, • • • , F„„), 

m-roo r—2 

i, j = 1, 2, • • • , n. 

It is also true, however, that for sufficiently small values of the elements, the 
series 

Zu = {log [(exp X)(exp Y)]} t/ , i,j = 1, 2, • • • , n, 
converge. These may be written 

Zy = Xii “H Yy "l" ll 111 2 Pii(Xll, • • • , Xnn ; Yn, • * • , Ynn ) 

m-roo r—2 


*',i = 1, 2, ••• ,n 



646 


ROBERT HOOKE 


where P r is the sum of terms of degree r in the expansion. We now have the 
equivalent of two power series expansions for Zijin terms of the n 2 real variables 
Xij , Yij . The two series must therefore be identical term by term. 

If now X and Y are p-adic matrices, it follows from i) that 

Zh = Xu + Yu + lim 2 Pij, i,j = 1, 2, • • •, n, 

m—»oo r—2 

and so 

Zii = + Fi, + lim t P'n . i. = 1, 2, • • •, n. 

tn—r—2 

Returning to the matric expressions, we have 

Z = X +Y + lim £ F r (x, F) 

m—►oo r—2 

= X + F + lim £ £ a r, f’(X, Y), 

m~*oo r—2 a—1 

since for any finite value of m these are identical. Q.E.D. 

4. p-adic Lie Groups 

We make the following definition as in the theory of real and complex Lie 
groups: 

Definition. A p-adic Lie group G is a topological group equipped with a 
homeomorphic mapping of a neighborhood N of its identity element onto a neigh¬ 
borhood of the origin of K m which satisfies the condition : If X) Y, Z are in N, 
XY = Z, and X is mapped on (xi, x 2 , • • • , x m ) in K m , etc. f then 

Zi = fi(x i, x 2 , • • • , x m ; 2/i, 2 / 2 , • • • , 2/m), i = 1,2, • • • , ra, 

where the fi are analytic functions . (? 2 s then said to have local analytic coordinates . 

The group (?* can be shown to be a p-adic Lie group. We introduce a metric 
topology by defining the distance between two elements A and B as V(A — B). 
Then if we put each matrix A in the form A = I + 11 a* y 11 , we can define the w 2 p- 
adic numbers a t; - as the coordinates of A . There clearly exists a real number 
q > 0 such that for any set of a<y in /£ satisfying the inequality ^ g, the 

matrix whose coordinates are these numbers is non-singular and in G K • It 
follows that there exists a neighborhood of / in G K homeomorphic to a neigh¬ 
borhood of the origin in If”. Since multiplication is a polynomial operation 
in this group, these coordinates are analytic. Since K is locally compact, 
so is G k . 

A complete system of neighborhoods of / is furnished by the set of spheres 
S r consisting of those matrices whose coordinates satisfy for some fixed real 
number r the inequality 

v(ajj) ^ r. 

Since v is a discrete valuation, these spheres are both open and closed. 



LINEAR p-ADIC GROUPS AND THEIR LIE ALGEBRAS 


647 


For r < 1, S r is always a subgroup. From the nature of the valuation, the 
product of two elements in S r is again in S r . The existence of inverses follows 
from the fact that the determinant of any matrix in S r has value 1 . These 
spheres form an infinite descending chain of open and closed subgroups whose 
intersection is J, so Gr is totally disconnected. It is the existence of these open 
subgroups which creates most of the difference between the theory of p-adic 
Lie groups and that of real Lie groups. 

6. The Lie Algebras of G K and its Subgroups 

The Lie algebra (infinitesimal group) of a p-adic Lie group may be defined 
exactly as in the case of real Lie groups (cf. Pontrjagin, [15]). Many relations 
here may be proved exactly as in the ordinary case, and so their proofs will be 
omitted. We shall depart from the usual procedure, however, by using the Lie 
algebra to obtain conditions for the subgroups of a Lie group to be themselves 
Lie groups. 

An analytic curve in Gr is an analytic function 

F(t) = / + Xit + X 2 t 2 + • • • 

where the X* are matrices in K n . The series converges to a non-singular matrix, 
that is, to an element of G K , for all t such that v(t ) is less than some fixed real 
number q. The expressions/,//) denote the coordinates of /(/), and the tangent 
(at I) of /(/) is the matrix A"i, as usual. 

/(/) is a one-parameter-subgroup if f(h + t 2 ) = f(t\)f(t 2 ). The analytic curve 
exp tX for any matrix X is such a subgroup. The one-parameter subgroups 
here differ from those of ordinary lie theory in the following respect: If q > 0 
is any real number, then those values of t for which v{t) ^ q and for which /(/) 
is defined give values of/(/) which form an entire group, not merely a local group. 
The fact that products exist in this group follows from the fact that v satisfies the 
condition (6). 

From Theorems 1 and 2 it is seen that there exists a neighborhood of I in 
Gr in which for every element A, the matrix X A = log A is defined and exp tX A 
is a one-parameter subgroup which is defined for v(t) ^ 1 and passes through A 
for t = 1. All functions of the form exp tX are one-parameter subgroups in 
their region of convergence. A converse to this statement is contained in the 
following theorem. 

Theorem 4. In G K there exists a neighborhood of the identity in which there 
lies one and only one one-parameter subgroup with a given tangent. 

Using the uniqueness theorem for differential equations, (4), this can be 
proved exactly as in the theory of real Lie groups, (Pontrjagin, [15], pp. 185- 
187.) It follows that there exists a sphere S about I in which the only one- 
parameter subgroups are the exponential functions. Given any subgroup H of 
G k and any sphere S r contained in S we shall denote by H r the subgroup H n S r . 

Definition. The Lie algebra L of a subgroup H of Gr is the set of tangents 
to analytic curves lying in some H r . This is clearly independent of the choice 
of the number r. 



648 


ROBERT HOOKE 


The commutator of two elements X, Y in the Lie algebra of 0 K is defined as 
in Pontrjagin, [15], p. 238, and it can be shown that [X , Y] = XY — YX . The 
Lie algebra of G K is then the Lie algebra K n i , since any X in K n is the tangent 
to an analytic curve exp tX in G K . From the definition it follows as usual that 
the set L associated with a subgroup of G K is a subalgebra of K n i ; it also follows 
from the definition and Theorem 3 that it is an ideal (invariant subalgebra) if 
and only if H r is an invariant subgroup. 

6. Analytic Subgroups and Subalgebras 

Definition. Two subgroups H' and H n of G K are equivalent if there exists a 
sphere S r around 1 such that H r = H" . 

It can be shown that this is the same identification that is used in the theory 
of real local Lie groups, where H ' and H n are identified if their intersection is 
open in each. It is the object of this section to use this equivalence relation 
in obtaining a one-to-one correspondence between classes of equivalent sub¬ 
groups of G k and subalgebras of K n i . 

It has been shown that every subgroup of G k has a Lie algebra which is a 
subalgebra of K n i . Conversely, it will now be shown that if L is any subalgebra 
of Kni , there is a closed subgroup of G K whose Lie algebra is L. Let H be the 
totality of elements of G K of the form exp X, where X is an element of L, and 
such that F(exp X — I) < p 1/(p-1) . By Theorem 1 we must have V{X) < 
P 1/(p “ x) . If X and Y are in L and V(X), V(Y) < P mp ~ l \ we have from Theorem 3 
that (exp X)(exp Y) = exp Z\ here 7(exp Z — I) < p 1/(p-l) and Z is in L be¬ 
cause L is locally compact and Z is a limit of elements of L. H is therefore 
closed under multiplication. H must clearly contain the identity and the 
inverses of all its elements, since exp 0=1 and exp (— X) is the inverse of exp X . 
Hence H is a group. H is closed since it is a homeomorphic map of a bounded 
and closed subset of the locally compact space L. 

Definition. A subgroup H of G K is analytic if it is equivalent to a subgroup 
constructed from a subalgebra as above . 

Theorem 5. If H is an analytic subgroup of G K and f(t) is any analytic curve 
in H with tangent a x , there exists in H a one-parameter subgroup with tangent a x . 

Proof. Let L be the subalgebra from which H has been constructed, and 
let S r be a sphere contained in S. Let 

f(t) = / + ai< + arf 2 + • • • 

be the given analytic curve. Let f(ti) be some point on this curve in H r . We 
can then define 


= log/(*i) 


and the one-parameter subgroup 


gi(u) = exp uXi/U 



LINEAR p-ADIC GROUPS AND THEIR LIE ALGEBRAS 


649 


passes through f(h) for u = h and lies in H r for all values of u such 
that V\gi(u) — I] ^ r. The matrix Xi is in L since the group H is defined by 
L and exp X x is in H. 

The tangent to the curve gi(u) is 

X[ = X 1 /h = (1/fc) log/(fe) 

= (l/ti)[(a,iti + (htl + • • • ) — (diti + 02^1 + • * • ) 2 /2 + * • • ] 

= CL\ + he 

where e approaches zero with h . Let us now consider a sequence of parameters 
ti , h , • • • which approach zero. The corresponding one-parameter subgroups 
gi(u), g*(u), • • • have tangents X[ = a x + he, X 2 = ai + he, • • • 

H is closed, and so for any fixed value of u, lining g n (u) is a point in H. The 
set of these points for all values of u under consideration can be shown to form 
a one-parameter subgroup g{u) which is clearly not merely the identity. From 
Theorem 4, g(u) must be an exponential function, so it is clear that its tangent 
must be the limit of the tangents X n , which is cu . The element ai must be 
in L, since L is locally compact. Q.E.D. 

It follows from this theorem that if H is an analytic subgroup defined from 
a subalgebra L, then the Lie algebra of II is L. We thus obtain a one-to-one 
correspondence between the subalgebras of K n i and the classes of equivalent 
analytic subgroups of G K . 

Theorem 6. Let H be an analytic subgroup of G K with Lie algebra L of order m. 
There exists in Gk a system D of local analytic coordinates a'! in a neighborhood 
S r of I such that an element A of G K is in H r if and only if 

a" = 0, i = m + 1, • • • , n 2 . 

The system D arises from the original system by an analytic transformation , and so 
by (2), we may say that H is defined by analytic functions . 

Proof. Let X be any matrix in K ni such that exp X exists in G K . We define 
the functions 

hi(X) = hi(xi ,x 2 , , x n *) = (exp X ) t , i = 1, 2, • • • , n 2 , 

where the Xi are the elements of the matrix X in K n and the (exp X)i are the 
coordinates of this element in G K . We have then A<(0, 0, • • ■ , 0) = 0 and 
(dhi/dXj )o = &), so the Jacobian of these functions is not zero at the origin. It 
follows from (3) that the equations 

a% —■ hi(ai , a 2 , * * * , % — 1, 2, * * * , w , 

can be solved for a,- in terms of the original coordinates a x , a 2 , • • • , a n *. By 
(2), therefore, the numbers ay furnish a new local analytic coordinate system 
in Gk • 

Wh*t we have done is to assign to each element A of Gk near I the elements 
of log A as its coordinates. Now given a subalgebra L of K n i , we can change 



650 


ROBERT HOOKE 


the basis of K n i so that a matrix of K n i is in L if and only if its last n 2 — m 
coordinates are all zero. Such a transformation is algebraic, and analytic, 
and has a non-vanishing Jacobian at the origin. Relative to this basis, log A 
has n 2 new elements, and the last n 2 — m of these are zero if and only if log A 
is in L. We assign these as coordinates in Gk and the conditions of the theorem 
are satisfied. Q.E.D. 

It follows from this theorem that an analytic subgroup H of G K has a local 
analytic coordinate system of dimension m and so is a p-adic Lie group. 

Theorem 7. Let H be an analytic subgroup of Gk with Lie algebra L . Let 
X 1 , X 2 , • • , X m be a basis for L and fi{t) be analytic curves in H with tangents 
X' respectively (i = 1, 2, • • • , m). Then there exists a neighborhood of I in H 
all of whose elements may be put in the form 

fl(tl) ■ • • f m (t m ). 

Proof. Let us define the functions 

Fifyl y h y * * * y tm) == [/l(^l) */2(fc) * " * fm(tm)]i > i = 1, 2, • * • , U , 

i denoting the coordinate of the expression in brackets relative to a coordinate 
systerti as described in the last theorem. These give an analytic mapping of a 
neighborhood of the origin of K m into a neighborhood of I in H . From the 
independence of the tangents to the/, it can be shown that the Jacobian of these 
functions does not vanish at the origin. Hence we can solve these equations 
for the U in terms of the /, and so by the last theorem the mapping defined by 
these functions covers an entire neighborhood of 1 in H. Q.E.D. 

7. Analyticity of Subgroups 

We have seen that an analytic subgroup H of G K is closed and defined by 
analytic functions. It is now desirable to determine whether these two condi¬ 
tions are sufficient for analyticity. It will be shown that if K = R p , any closed 
subgroup H is analytic and so is a p-adic Lie group. If K R p , however, 
this fact does not hold, as in the case of complex Lie groups. The subset of 
elements with coordinates in R p is a closed subgroup but is obviously not analytic. 
We must therefore divide our argument into two parts. 

Let H be a closed subgroup of G K and let L be its Lie algebra. Let H ' be an 
analytic subgroup corresponding to L and H ’ r , H r be intersections of H ' and H 
respectively with some S r contained in S. 

Lemma. Let A be any element in H r and let X A = log A. If K = R p , H r 
contains exp tX A for all t in K such that v(t) g 1. If K ^ R P , but H is definecP 
by analytic functions , the same holds . 

Proof, i) Suppose K = R p . If n is any rational integer, 

exp nX A = (exp X A ) n 

and so the expression on the left is in H r , since exp X A is in H r . By (1), how¬ 
ever, if t is any element of R p such that v(t) ^ 1, it is a limit of a sequence of 



UNEAR p-ADIC GROUPS AND THEIR LIE ALGEBRAS 


651 


rational integers t m . Hence 

exp tX A = lim exp t m X A 

m~* oo 

and is in H r since H is closed. 

ii) Suppose K ^ R p but that H r is defined by analytic functions, that is, 
there exist analytic functions Fi(x \, x 2 , • • • , x n %) such that a point A in G K 
with coordinates (a x , a *, • • • , a n a) is in H r if and only if , 02 , • • • , a n t) = 0 
for all i. By the above argument, exp tX A is in H r for all t in R p such that 
v(t) g 1. Hence we have 

Fi[(exp tX A ) 1 , (exp tX A ) 2 , • • • , (exp ^) n2 ] = 0 

for all i , and t in R p . By (2) the F, are analytic functions of t , and by (8) 
they must be zero for all t in K. Q.E.D. 

Theorem 8. Let H satisfy the conditions in the lemma. Then H is analytic. 
Proof. It follows from the lemma that H' r contains H r . By Theorem 7, 
a neighborhood of I in H' r can be defined by analytic curves in H r . This 
neighborhood is completely in H r , so H is equivalent to the analytic sub¬ 
group H'. Q.E.D. 

This theorem completes the setting up of the one-to-one correspondence be¬ 
tween subgroups of G k and subalgebras of K n i. 

8. Some Special Groups 

Before discussing the special groups, it will be necessary to prove a theorem 
which occurs in the ordinary theory of Lie groups. 

Theorem 9. Let H he an analytic subgroup of G K with subalgebra L. Let 
Hi be an invariant subgroup of H with subalgebra Li , an ideal in L. Let H be 
an analytic subgroup with Lie algebra L and such that there exists a continuous 
homomorphism of H onto R with Hi being the set of elements mapped on the iden¬ 
tity I. Then L = L/Li . 

Proof: We must first show that any continuous homomorphic map of a one- 
parameter subgroup is an analytic curve. It is sufficient to prove that any 
continuous homomorphic map of the ring E K of integers of K into G R is analytic. 
We need only prove this for E Rp since E K is a direct product of a finite number 
of the E Rp . Let t f(t) be such a mapping into some S r C S in G K . We 
know that there exists an X such that 

/(1) = exp X, 

and so /(n) = exp nX 

for any rational integer n, since / is a homomorphic mapping. Since / is con¬ 
tinuous, we have, by (1), 

/(<) = exp tX 

for any t in Es 9 - This mapping is analytic. 



652 


ROBERT HOOKE 


It follows that if f(t) is a one-parameter subgroup in H r , its map is an ana¬ 
lytic curve and one-parameter subgroup in B r and all one-parameter subgroups 
in B r are so obtained. This establishes a homomorphism between L and Z. 
Clearly, Li is the set mapped onto the zero element and so Z = L/Li . Q.E.D. 

Given a Lie algebra L in K n i , we know that there is an infinite number of 
analytic subgroups of G K whose Lie algebra is L. We are interested in finding, 
for a given L, an analytic subgroup 77, defined by algebraic conditions, whose 
Lie algebra is L . 

Let 21 be an associative algebra of order n over K and 2>(2l) be the Lie algebra 
of derivations of 21. Let H be the group of automorphisms of 21. H and D(2t) 
may be imbedded in G K and K n i respectively. 

Theorem 10. The Lie algebra of H is D( 21). 

Proof. Let X be an element of D(2l) such that exp X is defined. Let a 
and b be any two elements of 21. Then 

[(exp tX)-a][(ex p tX)-b] = X (l/pl(w — p)l)(X p a)(X n_p 6)1 

n-o Lp-o J 

= Z <7»! I" E Cn. P (X p a)(X"- ,, 6)l 

n—0 L.p—0 J 

= E t n /n\ X n (ab ) 2 

n-0 

= (exp tX)ab. 

Hence exp tX is an automorphism in 21 for every t for which it is defined. 

Conversely, let A be an element of H near I so that A = exp X for some 
matrix X . H is analytic, being algebraically defined, so exp tX is in H and hence 
is an automorphism in 21 for all values of t for which it is defined. We have, 
therefore, 

(exp tX)-ab = [(exp 2 X)-a][(exp tX)-b ]. 

Expanding these series in powers of t and multiplying out, 

ab + (tX)'Ob -f* tei = ab + ta^X'b) t(X‘a)b -f- te 2 
and 

X-ab + ei = a(X-b) + (X-a)b + 62 -. 

The quantities € 1 , €2 approach zero with t. Taking the limit of both sides as t 
approaches zero, we find X is a derivation. Q.E.D. 

Let 21 be an associative algebra over K and J be an involution (involutorial 
anti-automorphism) 8 . An element a in 21 is called J-skew if a J ■» — a. It is 
called /-orthogonal if aa J = afa = fc • 1 , where k is an element of K. The set 
of /-skew elements is a Lie algebra over K and the set ®/ of /-orthogonal ele¬ 
ments is a multiplicative group. 


* For this step, see Jacobson, [10], p. 207. 

* For the required results on involutions, see Jacobson, [9], and Albert, [3], oh. X. 



LINEAR p-ADIC GROUPS AND THEIR LIE ALGEBRAS 


653 


Theorem 11. The Lie algebra of ®j is © = ©y © K. 

Proof. The elements of © are in the form z = a + fc, where a is in ©., and 
k is in K. When exp z exists, it is in 31, since 31 has a finite basis and is closed. 
Hence (exp z) J is defined. Any finite number of terms of the series exp z J is 
equal to the corresponding number of terms of (exp z) J , so exp z J exists and is 
equal to (exp z) J . Hence when exp (a + k) exists, 

[exp (a + &)][exp (a + &)]' = [exp (a + fc)][exp (—a + k)] 

= exp 2k € K, 

and so exp (a + k) is in ®j . 

Conversely, if (exp tz)(cx p tz) J = (exp tz) J (ex p tz) = k eK, for all t insome 
neighborhood of zero, then zz J = z J z and (exp z) (exp z) J = (exp 2 ) (exp z J ) = 
exp (2 + z J ) = k. If 2 is sufficiently small, log k exists and 2 + z J = log k =&'. 

It is known that any 2 in 31 can be written uniquely as 

2 = b + d where b J = — 6, d J — d . 

Hence 2 + z J = b + d + b J + d J = 2d = k'. It follows that d = fc'/2. Hence 
2 must be in the form a + k and so the Lie algebra of is ©. Q.E.D. 

9. Groups of Simple Lie Algebras 

A simple p-adic Lie group is defined as usual as a group with no invariant sub¬ 
group which is not discrete or equivalent to the whole group. These, of course, 
are the groups with the simple Lie algebras. 

A simple Lie algebra over any field K of characteristic zero has been shown 
by Landherr, [13], to be normal simple over its extended center, which is a 
finite algebraic extension of K. We shall, therefore, restrict ourselves to the 
p-adic Lie groups whose Lie algebras are the “non-exceptional” normal simple 
Lie algebras over K. The normal simple associative algebras over a p-adic 
field K have been classified by Hasse, [6]. The Lie algebras which are normal 
simple over any field K of characteristic zero are shown by Jacobson, [12], to 
arise, except for the finite number of exceptional cases, from associative algebras 
over K in one of the following ways: 

1) (Type Aj) Let 31 be a normal simple associative algebra over K , and let 
8li be the Lie algebra obtained in the usual way. The derived algebra Slj is 
normal simple. 

2) (Types B, C, D) Let 31 be a normal simple associative algebra over K 
with an involution J of first kind. Then ©/ is a normal simple Lie algebra. 

3) (Type A n ) Let 81 be a simple associative algebra over K with center 
S = K(q) } where q 2 , but not q } is in K; J is an involution of second kind. The 
derived algebra ©/ is a normal simple Lie algebra. 

Using the results of Hasse, Jacobson has classified the normal simple Lie 
algebras over K and determined their automorphism groups. Any simple 
Lie algebra over if may be imbedded in K n i and its automorphism group in G K . 
The automorphism groups are analytic, being algebraically defined. 



654 


ROBERT HOOKE 


We wish to find for each normal simple Lie algebra L an analytic group H 
in G k whose Lie algebra is L. Any other group in G K with the same Lie algebra 
is equivalent to H. 

We must consider separately the three cases: 

1) The group H of automorphisms of 21 has a Lie algebra isomorphic to 21 * in 
Case 1. 

Proof. Since 21 is the enveloping algebra of 21*, and is simple, we have 

21* = C © 21* 

where C is abelian and hence must be the center of 21*. (cf. Jacobson, [8].) 
C is then the center of 21 and so is isomorphic to K. Hence 

21 \ S 21 i/K. 

It is known, however, (Jacobson, [10]), that when 21 is normal simple, 

0(21) & 21 i/K. 

By Theorem 10, therefore, the Lie algebra of H is isomorphic to 21* . Q.E.D. 

2) The group H of automorphisms of ©/ has Lie algebra @y . 

Proof. Let K represent the abelian Lie algebra of scalar matrices. Let 
© = ©/ © K and let & j be the group of ./-orthogonal elements of 21. 

We have shown that the Lie algebra of ®/ is ©. Now ®j is continuously homo¬ 
morphic to H with K 0 being the set of elements mapped onto /. (cf. Jacobson, 
[9]; K 0 is the multiplicative group of K). It is easily seen that the Lie algebra 
of Ko is /£, so by Theorems 9 and 11, the Lie algebra of H is isomorphic to 
©/X S ©j. • Q.E.D. 

3) Let H be the group of automorphisms of @j induced by inner automorphisms 
of 21. The Lie algebra of H is ©/ . 

Proof. The enveloping algebra of ©/ is 21. (cf. Jacobson, [11], p. 182). 
We have, therefore, as in Case 1), 

©/ = c © @;, 

where C is the center of the Lie algebra ©/ . The elements of C must commute 
with those of ©/ in the associative multiplication, and hence with all of 21, so 

C = @; n 2, 

since 2 is the center of 21. 

We have proved that the Lie algebra of & j is © = ©/ © K , and we have 

© = ©/©(©/ n 2) © K. 

It is known (Albert, [3]) that 

2 = K © (@, n 2). 

We have, therefore, © = ©/ © 2. It has been shown by Jacobson, [11], (p. 
185), that &j is homomorphic to H. This can easily be seen to be continuous 



LINEAR p-ADIC GROUPS AND THEIR LIE ALGEBRAS 


655 


and the group S 0 is the group mapped on the identity. It follows that H has 
Lie algebra ©/ . Q.E.D. 

Although we have treated these cases separately, the result is the same in 
each case. Let L be the Lie algebra arising from 21 in one of these three ways. 
From the results of Jacobson, [12], (pp. 339, 340), and from the fact that all 
automorphisms of a normal simple algebra are inner, the group H of auto¬ 
morphisms in L induced by inner automorphisms in 21 has Lie algebra L in 
each case. 

Princeton University 


Bibliography 

1. Ado, I. Bull, de la Soc. Physico-Math. de Kazan, 6 (1935). 

2. Albert, A. Modern Higher Algebra , U. of Chicago Press, 1937. 

3. Albert, A. Structure of Algebras , Am. Math. Soc. Colloquium Publications, vol. 

XXIV, New York, 1939. 

4. Chabauty, C. Sur les equations diophantiennes, etc., Annali di Matematica, 17 (1938), 

pp. 127-168. 

5. Che valley, C. Sur la theorie du corps de classes, etc., Journal of the Faculty of 

Science, Tokyo, 2 (1933), pp. 365-474. 

6. Hasse, H. t)ber p-adische Schiefkdrper, Math. Annalen, 104 (1931), pp. 495-534. 

7. Hausdorff, F. Die symbolische Exponential/ormel, etc., Ber. der Ges. der Wiss. zu 

Leipzig, 58 (1906), pp. 19-48. 

8. Jacobson, N. Rational Methods in the Theory of Lie Algebras, these Annals, 36 (1935), 

pp. 875-881. 

9. Jacobson, N. A Class of Normal Simple Lie Algebras of Characteristic Zero, these 

Annals, 38 (1937), pp. 508-517. 

10. Jacobson, N. Abstract Derivation and Lie Algebras, Trans. Am. Math. Soc., 42 (1937), 

pp. 206-224. 

11. Jacobson, N. Simple Lie Algebras of Type A, these Annals, 39 (1938), pp. 181-188. 

12. Jacobson, N. Simple Lie Algebra$ over a Field of Characteristic Zero, Duke Math. 

J., 4 (1938), pp. 534-551. 

13. Landherr, W. tfber einfache Liesche Ringe, Hamb. Abhandlungen, 11 (1935), pp. 

41-64. 

14. Lutz, E. Sur Vequation y 2 ■= x 3 — Ax — B, etc., J. fur die reine und angew. Math., 

177 (1937), pp. 238-247. 

15. Pontrjagin, L. Topological Groups , Princeton, 1939. 



Anxau of Mathsmatxcs 
Vol. 43, No. 4, October, IMS 


ON THE MODULAR REPRESENTATIONS OF THE SYMMETRIC 

GROUP 

By R. M. Thrall and C. J. Nesbitt 
(Received December 11, 1941) 

1. Introduction 

The purpose of the present paper is to determine all modular (matrix) repre¬ 
sentations of the symmetric group, @ m , of degree m, where m < 2p. The 
elements of the representing matrices are to be chosen from any field, f, of 
characteristic p. Every indecomposable (modular) representation of is 
equivalent to a rational one (i.e. to one in which f is the prime field) so the nature 
of the field, f, is of no particular interest in what follows. 

In the last several years considerable progress has been made in the theory 
of modular representations of finite groups. 1 This theory is especially well 
worked out in case the order of the group is divisible by only the first power of 
p; hence our requirement m < 2p. (Actually we treat here only the cases 
p ^ m < 2p since for m < p the ordinary theory applies, leaving no problem.) 

Any representation of a group (or of an algebra) is completely characterized 
by its indecomposable constituents. In general there are an infinite number of 
inequivalent indecomposable modular representations of a finite group. One 
of the main results of the present paper is a proof that the symmetric group of 
degree less than 2p has only a finite number of inequivalent indecomposable 
representations. In sections 2-4 we determine the structure of the regular 
representation of @ TO (or of its f group ring 9i m ). In section 5 we show how any 
indecomposable representation can be built up from “elementary modules” 
(see section 3 for definition and references) of the group ring. 

In section 6 we state Nakayama’s results 2 on the modular representations 
of @ m and add a discussion of the behavior of representations of © m when 
considered only for elements of © m _i. 

The final three sections are devoted to specific determination of all representa¬ 
tions of with indications of generalization to © p +i, © p + 2 . 

2. Preliminaries 

Let m = p + Z, l < p, and let 

(1) m = a i + 2<*2 + • • • + Ma m 

be a partition of m. Corresponding to this partition (a) there is a;Class C{a) 
of conjugate elements of © m such that if $ € C(a), then s is a product of 1-cycles, 


1 See the bibliography at the end of the paper. 
1 See (8] especially part II. 


656 



MODULAR REPRESENTATIONS OF SYMMETRIC GROUP 


657 


c *2 2-cycles, • • • , c* m m-cycles. The number of conjugate classes, and hence the 
number of ordinary irreducible representations is P m , the number of partitions 
of m. 

Since m = p + Z, a p in (1) may be 0 or 1. The class C(a) is p-singular (that 
is, the order of the elements of C(a ) is divisible by p) if and only if a p = 1. 
Then the number of p-singular classes is equal to Pi . It follows that the 
number of modular irreducible representations which is equal to the number of 
p-regular classes is P m — Pi . 3 

Corresponding to the decomposition of the group ring 91m into a direct sum 
of directly indecomposable invariant subalgebras there exists a classification of 
the ordinary irreducible and the modular irreducible representations, and their 
characters into blocks. 4 A block S3 r is said to be of type p if all the ordinary 
irreducible representations which belong to S3 T have degrees = 0 (mod p fi ), 
but at least one of these degrees ^ 0 (mod p* -1 " 1 ). In our case we have just 
blocks of type 0 or lowest kind, and blocks of highest kind (here of type 1). 

Theorem II of [1] states that the number to of blocks of lowest kind is equal 
to the number of p-regular classes of conjugate elements where the number 
of elements in the class is prime to p. The number of elements in the class C(a ) 
is m\/n(a) where n(a) = ai!a 2 !2 a2 • • ■ a m \m am is the order of the normalizor 
of any element s in C(a). To determine t 0 for our case, we count the partitions 
of m where a v = 0 (that is, C(a) is p-regular) and m\/n(a) ^ 0 (mod p). Since 
for i > 1, a, is less than p, and a p is here 0, then n(a) = 0 (mod p) only if ai *£ p. 
One easily sees a 1-1 correspondence between the partitions of m with a x ^ p, 
and the partitions of l. We thus obtain to = Pi . 

We denote by x T , y T the numbers of ordinary and of modular irreducible 
characters which belong to a block . If is of highest kind x T = y T = 1; for 
33 r of lowest kind x T ^ y T + 1. 5 But, by the above, ^x T — X) y T = 
P m — (Pm — Pi) = Pi , and there are P t blocks of lowest kind, so the only 
possibility is that for each block of lowest kind x T = y T + 1. We shall show 
below that x T = p for blocks of lowest kind. 

In the theory of modular representations of groups two sets of numbers play 
leading roles: the decomposition numbers describe the splitting of the ordinary 
irreducible representations (when taken in the modular sense) into modular 
irreducible constituents; the Cartan invariants give the multiplicities of the 
modular irreducible representations as constituents of the indecomposable parts 
of the modular regular representation. If D r , C T denote the matrices of de¬ 
composition and Cartan numbers for the block B r , then a main theorem is that 8 

(2) C T = D' r D r . 


• Cf. Brauer-Nesbitt [1], §8 for proof that the number of p-regular classes is equal the 
number of modular irreducible representations. 

. 4 Brauer-Nesbitt [1], §9, and Nakayama [9], Theorem 5. 

• Brauer-Nesbitt [1], §19, Theorem 5. 

• Brauer*Nesbitt [1], §§4, 5 and 9. 



658 


R. M. THRALL AND C. J. NESBITT 


From Brauer’s work (in particular, see Theorem 14 of [2]) we have in our case 
that for a block of lowest kind 

1 

11 

(3) D t = 11 

11 
1 

Here D, has x T rows, y T columns. Then C T has y, rows, y, columns, and from 
(2), (3) 

21 
121 
121 

121 
12 

It follows from (4) that det C T is y r + 1 = x r . 

We consider now the set M of n(a)’s corresponding to the p-regular classes 
C(a), and form the product n n(a). For each of the Pi partitions (a) with 
a p = 0, a\ ^ p, n(a) is divisible by p, and all other n(a) of the set M ^ 0 (mod p). 
Then p Pl is the highest power of p which divides'JI n(a). By Theorem I of [3] 
the determinant of the complete matrix C of Cartan invariants is then equal 
to p Pl , that is, 

detC = I[ det C T = p Pl . 

But for blocks B r of highest kind det C T = 1, and for blocks of lowest kind 
det C T = x T , and there are Pi blocks of lowest kind; hence for each such block 
x T = p. To each block of lowest kind there belong p ordinary irreducible representa¬ 
tions and p — 1 modular irreducible representations . 

It follows also that there are P m — pPi blocks of highest kind and that the 
total number of blocks is P m — (p — 1 )P t . 

3. Loewy form 

Let 21 denote any matrix representation of 9t m . We consider the algebra 21 
as a system of linear transformations of a vector space 93 of suitable dimension. 
Let 5K, 5rt 2 , • • • , 5W 1 = (0) denote the powers of the radical of 21. We form 

the upper Loewy series of 93, namely 7 

(5) 93 3 m => • • • 3 9? 4_1 SS 3 0. 

7 For a discussion of Loewy series see $§2, 5 of [4]. That (5) is the upper series for 
,. A &^ollows by proving Theorem 12.3A of [4] for vector spaces having 21 as operator system 
^ rather than for ideals of 21. 







MODULAR REPRESENTATIONS OF SYMMETRIC GROUP 659 

Here SR*"" 1 ©/^'® is the unique maximal completely reducible factor group that 
may be obtained from 9 f i p ~~ 1 33(p =1,2,•••,/)- If we adapt the coordinate 
system in 35 to the series (5), then SI appears in upper Loewy form, 

8i(Sl) 

( 6 ) 31 ~ &(*) 

* 

W) 

where the ?»(Sl) are the upper Loewy constituents of SI, and are completely 
reducible, t may be called the Loewy length of SI. 8 

We consider now the Loewy form of the indecomposable parts of the regular 
representation SR of SRm . Let &i, t$s , • • • , 3* denote the modular irreducible 
representations of $R W . It follows from [6] (see, in particular, Theorems 2, 3, 
and 8) that we may denote the indecomposable parts of SR by Ui, • • • , U* 
where, if U* is written in the form (6), ?i(U*) = 8/(11*) = 3* 

8 . 

m) 

(7) u* = 


We have considered here only the upper Loewy forms. We might similarly 
have discussed the lower Loewy forms. For the indecomposable parts U« of 
SR m (see below) the upper Loewy forms coincide with the lower Loewy forms. 

In the following we shall use the notation c K \ to denote the multiplicity of g x 
as constituent of U« ; c* x is a Car tan invariant. 

4. Elementary modules 

Let SR denote the modular regular representation of the group ring SR m of 0 m . 
It is well known that SR is a faithful representation of SR m . Then instead of 
considering elements of SR m we shall for the time being speak of the corresponding 
elements of SR. We assume SR to be in reduced form, that is 

SRn 

SR 21 SR 22 
SR.i ••• SR.. 

where the SR« denote irreducible constituents of SR. We may further assume 
that SR = SR* + SR, where SR* is the semi-simple algebra obtained from SR by 
replacing the SR.-y with i > j by 0, and where SR, the radical of SR, has 0 in place 




• Cf. Brauer, [4], §5. 



660 


R. M. THRALL AND C, J. NESBITT 


of the 9t,*» in the main diagonal. Let, as before, , • • • , 5* denote the 

distinct modular irreducible representations of $K m , and fi , / 2 , • • • , /* their de¬ 
grees. We mean by $R* the simple subalgebra of $R* which has 0 in place of all 
$R« except those which are equivalent to g*. Let e K (ij) (t, j = 1, 2, • • • , /«) be a 
set of matrix units for 5R? . We denote the unit element of the simple algebra 
by «*(«)• An element a of SR we say is of type (/c, X) if e K ae\ = a. 

In [6] it was shown that a system of primitive elements 

(6) b\ > ^2 > * > ^ = C«A 

could be chosen so that these and their right and left products with suitable 
e K (ij) gave a basis for SR. More fully, for each type (k, X) there are c K \ elements 
b in the set (8) of that type. If b p is of type ( k , X) then we take all the elements 

(9) e K (il)b p e x (lj) (i = 1, 2,•••,/,, j = 1, 2, - - - , /a) 

for part of our basis of SR. Doing this for each b p we obtain what has been called 
the Cartan basis of SR. This Cartan basis can be so arranged that the regular 
representation with respect to this new basis is split into indecomposable and 
irreducible parts. 

The Cartan basis system is the starting point for the definition recently given 
by W. M. Scott of elementary modules . 9 An element a of SR, expressed in terms 
of the Cartan basis elements will have the form 

a = h1j{a)e,{il )b„ e x ( 1 j ). 

Scott arranges the coefficients h P j(a), for a fixed p, in a matrix H p (a ) = 
|| h p ij(a) ||. The Abelian additive group generated by the matrices H p (a ), 
a € SR, Scott has called an elementary module. The significance of these for us 
here is that the representations of SR m , in particular, the regular representations, 
may be expressed in simple form by these elementary modules. 

We have two cases to consider: 

(1) Blocks of highest kind . Let 93 r be a block of highest kind and be the 
unique modular irreducible representation belonging to S T . Then c\ p = 0 for 
X^/i, cxx = 1, and the elements e\(ij) (i, j = 1, 2, • • • , f\) may be taken as the 
Cartan basis for $8 r . The corresponding elementary module is just 5x. Fur¬ 
ther, ftx = Ua , where Ux is the unique indecomposable part of SR corresponding 
to . 

(2) Blocks of lowest kind . Let S8 T be a block of lowest kind, and let us choose 
our enumeration of the modular irreducible representations so that gi, ga > 
• • • , belong to S3 T , and further so that the matrix C T of Cartan invariants 
for S3r is in the form (4). 

Then for k ^ 1 or p — 1, c KK = 2, = 1 = c,+ i , K , and all other c\ K are 

zero. Corresponding to these invariants there exist primitive elements e«(ll) 
and b M of type (k, k), i,« of type (k - 1, k) and of type (k + 1, *). 


• SCQtt, [10]. 



MODULAR REPRESENTATIONS OF SYMMETRIC GROUP 


661 


The Cartan basis elements 


(a) 

e.(jl) 

3 = 1, 2, • • • 

,/« 

(10) (b) 

e«_i(&l)b,_i,« 

fc = 1, 2, • • • 



l = 1,2, ••• 

>/«+1 

(c) 

e,(ml )b a 

m = 1,2, ••• 

,/« 


form a basis for an indecomposable 9i-left ideal 1* = 9te*(ll) which is a summand 
in the expression of 9? as a direct sum of indecomposable left ideals. 1* may be 
considered as the representation space of the indecomposable part U« of 9J. 
The form (7) of U* shows that the Loewy length t of U* ^ 3. It cannot be 
greater than 3 for then Ut 3 !* 0, that is, there would have to exist primitive 

elements of type (*, k) belonging to 9t 3 . Then t must equal 3, and again from (7) 
it follows that $W 2 l« is generated by the elements (c), b KK belongs to 9l 2 , and that 
, b K +i tK belong to 91. We may assume the primitive elements so chosen 
that b K -i, K = b KtK +ib K +i, K = b K , K (here , ?>*,*+1 correspond to the 

Cartan invariants i = c KtK +\ = 1). We denote by $Z _1 , JpZ +1 , & the 
elementary modules corresponding to c x (ll), b K - i |Jt , b K + i, K and b KtK . Then 
following the method of [6], §3, taking in our basis of 1* first the elements (a), 
then those of (b) and lastly those of (c) we calculate 11* to be 


(11) U* = 

We similarly can compute 

U p -i 


5* 


$r l 

S«-l 

$: +1 

0 S<+1 

& 

#«-i S« i 


Si 

$=: 

»l 

S2 


§2 Si |i 


3p-2 

®* P Z\ §lz\ Sp-i 


6. Indecomposable representations of dim 

We set out now to determine all indecomposable representations of . 
A first clue is that the Loewy length ICHJl) of any representation 3D? of 9fc m ^ 3. 
This follows from our above result that Z(U«) = 1 or 3 according as U« belongs 
to a block of highest or of lowest kind, and Theorems 6.6C, 11.5B of [3]. 

A second simplification comes from observing that 3D? may be taken in upper 




662 


R. M. THRALL AND C. J. NESBITT 


Loewy form and at the same time have its simple parts expressed in terms of the 
elementary modules Let us picture 99? in reduced form 


SW = 


2Wn 

9D?21 99?22 


and denote by 99 ?* the subalgebra of 99 ? obtained by replacing in 9 )? the parts 
99?,*y, 3 < i by 0. It follows from Scott’s results , 10 that if in the representation 
SD?, the semisimple algebra 9 ?* of 9 ? is mapped into 99?*, then the simple parts 
99 ?,7 are expressible as linear combinations of the elementary modules Let * 
® be the representation space of 99?, and suppose the Loewy length of 99? is 3. 
We take the Loewy series 3? 3 9?® H> 9? 2 ® 3 0. Here 9? 2 ® may be considered 
as an 9?* space, and as such is a direct sum of irreducible 9f* spaces, 9? 2 ® = 
®i 2) © • • ■ © Further 9?® as an 9i* space is a direct sum of 9? 2 ® and a 

complementary space which may also be written as a direct sum of irreducible 
9?* spaces. By continuing the argument, we obtain that as an 9?* space 

35 = 35i 0) © • • • © 35L 0 * © 351” © • • • © 35L? © 35 i (2) © • • • © 35IV. * 

Adapting the co-ordinate system to this decomposition of ®, we obtain 9?* 
mapped on 99?* in 99?, and simultaneously have 99? in Loewy form. 

Let now 99? be a modular indecomposable representation of 9? m and let 9? m = 
Xi © £2 © • * • © denote the decomposition of 9 i m into indecomposable 
two-sided ideals. Since the representation space may be written 


® = 9UB = EiSB © £ 2 ® '© ••• © £,® 

and 99? is indecomposable, we find that only one £„®, say X,®, can be different 
from zero. This implies that only SC,- is mapped on something different from 
zero in the representation 99?, and that 99? contains only modular irreducible 
representations belonging to the block corresponding to X %. 

We use the Loewy length Z(99f) of 99? to distinguish three cases. 

1) Z(99?) = 1. Then 99? is both completely reducible and indecomposable, 
and so 99? must be equivalent to a modular irreducible representation 5« . We 
observe that if 99? contains a modular irreducible constituent belonging to a 
block of highest kind then Z(99?) = l . 11 

2) Z(99?) = 2. Then the modular irreducible constituents of 99? must belong 

to a block 35 t of lowest kind. Let gr, ,$*,&>$!>$«> 1 * -, , 

5 p-i, §pli denote the elementary modules of 3) T ; here 5i > • • •, 5 p-i are the 
modular irreducible representations of 35 f , , X k, are the elementary 

modules of 33 P which belong to the first power 91 of the radical, and the 
are those which belong to 91 2 . As here the Loewy series for 2W is 35 3 9135 3 (0), 

» Cf. Scott, [10]. 

11 For a modular irreducible representation belonging to a block of highest kind is also 
an indecomposable part of the regular representation. Then apply Remark 3 of [7]. 




MODULAR REPRESENTATIONS OF SYMMETRIC GROUP 


663 


the ©J will not appear in 3K. Our notation for the elementary modules is 
based on the form (4) of the matrix C T of Cartan invariants. From the above 
considerations, we may suppose that 911 in Loewy form is written (denoting 
the unit matrix of degree / by E/) 


(12) 9)1 = 


Eh, X Si 


Eh ,, X Sa 


Eh p -i X Sp-i 


Aix$, Alx$l 


Eh i X g 2 


A\ X ©i,. 


Eh X 


Ai:l x ® p P zl 


Ehp-i X Sp—1 


or in a similar form with the even in the top Loewy constituent and the odd 
g* in the bottom constituent. If we assumed that both even and odd con¬ 
stituents g* appear in the same Loewy constituent, then by permutation of the 
rows and columns in 2ft a decomposition of 30? would be obtained. 

From Schur\s lemma it follows that a matrix P which commutes with 3)? 
has the form 


(13) 


Pi X E hl 


P = 


Pz X E h 3 


II P p -iX Eh v - V |i 

where P, is a square matrix of /i, rows. In addition in order that P commute 
with 30? the following relations must be satisfied 

A\Pi = P i A\ 


(14) 


aIp 3 = pal 


A p p :lP p -t = p p -iA p z\. 

Here Al has h p rows, h q columns. 

We assume first that all hi (i = 1,2, • • • , p — 1 ) are different from 0 . Then 
the relations (14), together with the theorem 12 that a matrix P commuting with 
an indecomposable representation 3)? can have just one distinct characteristic 
root, are sufficient to show that each hi = 1. 

In outline the argument is this. We take P with P 2 = Pz = • • • = P p - 1 = 0, 
then the only relation (14) remaining is 

(15> AlPl m o. 


11 Cf. Brauer-Schur, [11]. 




664 


B. M. THRALL AND C. J. NESBITT 


If now the rank of A\ were less than hi we could find a Pi with characteristic 
root 7*0 to satisfy (15), contrary to the Schur-Brauer theorem. Then the rank 
of A\ = hi , and so hi ^ hi . Now taking Ps = P 4 = • • • Pp_i = 0, and choosing 
Pi to satisfy A\Pi = PiA \, the remaining relation (14) is P t A\ = 0 and by the 
same argument as before the rank of A\ is ht, and ht ^ ht. Next, we take 
Pi = Ps = — Pp_i = 0, choose Pi, Pi to satisfy the first two relations 

(14), and consider A\Pt = 0. We obtain in this manner that 

^1 =» ht ^ ht ^ ^ hp-i . 

If, however, we had started at the other end of the relations (14), we would 
obtain 


h p -1 2s Ap_s Ss • • • ^ hi 


so that hi = hi — • • 
so are non-singular, 
is arbitrary, P 2 = (.4i) -1 PiAi , 


= hp^i = c, say, and further the Al are all of rank c and 
It follows that the relations (14) are satisfied when Pi 
Ps = (Aij) _1 P 2 ,i4l, etc. Then Pi must be of 
degree 1 (for otherwise Pi could have two different characteristic roots) and so 
c — 1. 

The second step in this argument requires some elaboration. Since we have 
seen A\ is of rank hi , we may find non-singular matrices X and Y such that 
E hl 
0 


XA\Y = 


= Al , and setting Pi = Y 1 P i Y, P 2 = XP 2 X *, we have from 


(16) 

that Pj has form 


i?Pi = PiAl 


P 2 = 


Pi 

Pit 

0 

Pa 


We take P> = P t = • • • = P p _ 1 = 0, assume that the rank of Al is less than 
ht , and seek Pi, P 2 such that (16) is satisfied, and 

PiAl = 0 


where = XA\ . Under our assumption that the rank of A\ < ht, we can 
find a vector x = (0,0, • • • , 0, x<, a;< + i, • • • ,x h ,),Xi^ 0 which is annihilated by 
Al. Choose Pt to have the vector x in its 4 th row, and zeros in the other rows. 
Then trace Pt = x<, so that Pt has a characteristic root ?*0. Further, for this 
choice of P», we can always find Pi so that (16) is satisfied. Thus our assump¬ 
tion that Al has rank less than ht would lead to a matrix P which commutes 
with SW and has x< and 0 for characteristic roots, which gives a contradiction. 

For the case that not all g, belonging to the block ©, appear in the inde¬ 
composable representation study, of the relations (14) show that the gf, which 
do appear have multiplicity 1 and form a sequence 


> 3«+ 1 »* * • > 8«+r • 



MODULAR REPRESENTATIONS OF SYMMETRIC GROUP 


665 


3) Z(9R) = 3. Here again the modular irreducible constituents of 9ft must 
belong to a block of lowest kind. We have also here that ft 2 33 ^ 0. Then there 
is a Cartan basis element b KK C 5ft 2 such that b KK $$ j* 0, suppose that b KK x ?*■ 0, 
x €®. Since b M = b K -i, K we have that i tK x ^ 0, similarly b K+ \ >K x ^ 0, 
and from b KK = b KK e K { 11) we have also that e K (ll)x j* 0. Let, as above, l* denote 
the indecomposable left ideal generated by the elements (10). Then it may 
be shown that 1, contains no left annihilators of x other than 0, so that 

a —> ax, a c l* 

is an isomorphic mapping of I* upon the invariant subspacc = \ K x of 93. It 
follows that 9ft contains a constituent equivalent to the indecomposable U* of 5R 
which corresponds to I* . This implies that the indecomposable representation 
9ft is equal to U« , 13 

We gather these results in 

Theorem I. An indecomposable representation 9ft of 9f m has Locwy length 
l( 9ft) ^ 3. If /f9ft) = 1, 9ft is equivalent to a modular irreducible part of . 
For l( 9ft) = 2, 9ft has the form (12) with hi = 1 for a sequence of consecutive values 
of i, and the remaining h { = 0, or 9ft is of the similar form but with the positions of 
the odd and the even constituents reversed . When /(9ft) = 3, 9ft is equivalent to an 
indecomposable part of the regular representation of 9f m . 

6. Nakayama’s results 

We have seen above that the number of blocks of lowest kind is Pi , and that 
the number of blocks of highest kind is P m — pPi . Nakayama (8 II) has shown 
how to relate blocks and partitions more precisely. In the present section we 
state his results and give a slight extension of them. 

Associated with the partition (\):Xi + • • • + X* = m, (Xi ^ ^ X* > 0) 

of m is a diagram T consisting of k rows of fields , X, fields in the i th row, the j ih 
elements of the rows being arranged in a column. The field in the i ih row and 
j th column will be denoted by ( i , j). By the (i, j)-hook H = H{i, j) of T we 
mean the set of fields ( i , v ) with v ^ j together with the fields (v, j) with v > i. 
We call the number, h, of fields in H its length. If just r rows of T contain 
elements of H we call r the height of H. By T — H we mean the diagram T' 
obtained from T by deleting the fields of H and then moving each field (i f , j') 
with i f > i , f > j one row up and one column to the left. 

We next divide the partions (X) of m into classes according to the following 
rules. If the diagram, T, of (X) has no hook of length p , then (X) is put in a 
class by itself, called a class of highest kind . If T has a hook, H, of length p 
and height r, we denote (X) by X r (M) where (/x) is the partition whose diagram 
is T — H. For r = 1, • • • , p there is exactly one partition for each partition 
(/x) of Z( = m — p). The p partitions X r (*i) defined by (y) are put into a class 
which we call a class of lowest kind . We can now state Nakayama^ results. 


11 Cf. Remark 3, Nakayama-Nesbitt, 17]. 



666 


R. M. THRALL AND C. J. NESBITT 


Theorem II. Let 2t(X) denote the ordinary irreducible representation of 
defined by (X) and let ft(X) be the modular representation induced by 3l(X). If (X) 
belongs to a class of highest kind , then ft(X) is irreducible and constitutes a block 
of highest kind . The p representations 3l(X r (/x)) r = 1, • • • , p, are Zfoe ordinary 
irreducible representations belonging to a block of lowest kind , wAicA we accordingly 
denote by 58 (m). TPe can enumerate the modular irreducible representations 3i(m)> 

• • • , belonging to $8(m) so 

ftQ^G*)) <“> ftifa), 5(X 2 (m)) 5i(m) + &(m), * • • , 

5(^ p l ( y )) 3(x p (m)) 3p-i(m) 

(«-> denotes “has same irreducible constituents as” not “is equivalent to”). 
Let (m) = (mi , • • • , mji) (where mi ^ M 2 ^ ^ m* > 0 = m*+i = • • •) be 

a partition of 1. If m» > m *+1 we denote by (m | i) the partition (mi , • • * , m* ~ 1, 

• • • , nk) of Z — 1. In the course of proving the above main theorem Nakayama 
showed that 

07) (x'(m)I 0 = x y (M | 0 

except when (a) i = 1 and My = My+i or (b) i = j — 1 and My = My -1 • (In other 
words successive removal of hooks is almost a commutative process). This fact 
enables us to prove the following corollary 14 to the main theorem. If we con¬ 
sider a representation ft of only for permutations omitting the letter m we 
get a representation of @ m _i which we denote by ft*. 

Corollary I. 

i 5yG* | i ) + 6hw+i$(m/ + V — h Mi + 1, * • • , 

My-i + 1, My+i, • • - , M*) 

77&e sum is over aM i for which m > m»+ 1 . 

Proof. It is well known 15 that 2l(X)* <-» 2l(X | i) where the sum extends 
over all i for which X,* > . Applying Theorem II and equation (17) to 

the induced modular representation ft(X)* for (X) = X p (m) we get 

(19) 5p-i(m I i) + Sp(m I i) 

+ ^MpMp+i5(^ P (m) I 1) + ^Mp- 1 m p i5(X P (m) I P - 1), P = 1, • • • p. 

(The sum is over all i for which m* > M 1 + 1 .) We have also from the main 
theorem that 

( 20 ) r'GO «-» ^(x y ( M )) - 3 (x^(m)) ••• (-i) m 8f(x , (p)) 

The corollary now follows at once by starring both sides of (20), substituting 
from (19) in each term on the right hand side and simplifying. 


14 Aside from this corollary everything in the present section is a restatement of Naka- 
yama’s Jesuits. 

11 [131 p. 215. 


( 18 ) 


5 * 00 * 



MODULAR REPRESENTATIONS OF SYMMETRIC GROUP 


667 


Observe that the terms on the right hand side of (18) all come from different 
blocks and so we can strengthen the corollary by the additional statement that 
8 /m)* w a completely reducible representation of ® m _i. 

7. Definitions and notation 

In the next section we will consider the decomposition of representations of 
@ m when considered as representations of various subgroups © m _ v . To facili¬ 
tate this we introduce the following calculus of partitions. 

Let (X) be a partition of m (as in §6 above) and let (z) = (z'i, • • • , if) be a 
sequence consisting of pi l's, M 2 2’s, • • • , y k k’ s. By (X | i) = (X | i x • • • if) we 
mean the partition 16 (Xi — pi , • • • , \k — p*) of m — v. The sequence ii , • • • , i v 
is said to be (\)-proper if for each v from 1 to y the partition (X | z*i • • • if) of 
m — v has non-negative terms in non-increasing order. Otherwise (z) is called 
{\yimproper. 

In this and the following sections we shall understand by 3t(X):$ —> A(X$s) 
Young’s rational semi-normal form 17 of the irreducible representations of © m 
defined by (X). 

If the sequence (z) is (X)-proper we define ?l(X | z) to be the corresponding 
representation s —> A (X | i\s) of . If (z) is (X)-improper we shall mean 

by 2I(X | z) a zero rowed square matrix. An important property of the semi¬ 
normal form is that 9I(X) when considered as a representation of 0 m _ v (the 
symmetric group on the first m — v letters of @ m ) is already in completely re¬ 
duced form with the 2l(X | z) as diagonal constituents. In other words each 
(X)-proper sequence (z) of length v contributes an irreducible constituent. If (z) 
and (z') are two (X)-proper sequences of length v , then 9l(X | z) appears above 
2l(X | z') if the first nonvanishing difference z‘i — z‘i , * * • , i v — iv is positive. 

We recall that the rows of 9I(X) are in 1-1 correspondence with the regular 
diagrams belonging to (X). The representation 9l(X | i) occupies the (consecu¬ 
tive) rows whose corresponding diagrams have m in the z\ th row, ra — 1 in the 
z 2 th row; • • • , m — v + 1 in the i v ih row. By A(\ | i \j%s) we mean the rec¬ 
tangular submatrix of A(X$s) whose rows are those of 9I(X | i) and whose 
columns are those of 2l(X J j). By the v th refinement of 2l(X) we mean that each 
A(X$s) is to be considered as a matrix whose elements are the submatrices 
A(\\i\jls). 

S. The representations of 

For m = p every A(X$s) is p-integral. There is just one block © = 33(0) of 
lowest kind. From Theorem II it follows that 2l(X) belongs to S3 if and only 
if the diagram of (X) is a hook, i.e. one of the partitions (p — p, l p )> P = 

0, ••• ,p - 1. 


1# Note that Xy ^ m is not required since for some purposes partitions with negative 
summands are useful, 
w [14] pp. 217-88 or [12]. 



668 


B. M. THRALL AND C. J. NESBITT 


Let (X) = (p — p, 10 where 0 < p < p — 1. There are just four (X)-proper 
sequences of length 2. These are (in order) (i 1 ) = (p + 1, p), (i 2 ) = (p + 1,1), 
(t*) = (1, p + 1), (t 4 ) = (1, 1). Note that the diagram of (X 1i”) is a hook of 
length p — 2. We denote by (X0 the partition (p — p — 1, l p-1 )-of p — 2, 
and by gr p the degree of the representation &(X0 of © P _ 2 . In this notation we 
have H(X | i 1 ) = S£(X P_1 ), 3I(X | t 2 ) = 3I(X | i*) = 2l(X p ), and SI(X | i 4 ) = 3I(X P+1 ). 

For elements of <Sp- 2 the non diagonal parts of the second refinement of Sl(X) 
all vanish and along the main diagonal we have 2l(X p-1 )> 2l(X' 1 ), 2t(X p ), 2I(X P+1 ) 
in the order named. Let t r denote the transposition (r — 1, r). Then 
A(\ | i* | i) = 0 for j * k unless (j, k) = (1, 2), (2, 1), (3, 4) or (4, 3);. 
and A(X | i‘ j i k \t P ) = 0 for j ^ A: unless (j, k ) = (2, 3) or (3, 2), the non zero 
parts being scalar; say A (X | i 1 \ i k \t p ) — ajkEjk where E jk is the identity matrix 
of suitable degree. For the a jk we have a n = —1, <*22 = — l/(p — 1), <* 2 a = 
(p 2 — 2p)/(p — l) 2 , <*32 = 1, <*33 = 1/(P — 1), <*44 = !• 

Now consider the induced modular representation 5(X). Since 2l(X) is p-in- 
tegral we are justified in keeping the same refinement notation, i.e. we write 
F(\\s) = || F(X | i 1 1 || where each submatrix of /'’(XjJs) is obtained by con¬ 

sidering the corresponding submatrix of A (X$s) mod p. For s e ©„_i we havfe 

0 0 

F(X | p + 1 $s) 

0 0 

F(X$s) = 

0 0 

F(X| 1J«) 

0 0 

where F(\ | p + l$s) occupies the first two sets of rows and columns and 
F(X | IJs) the last two sets. F(X | p + 1) is (as the notation implies) /1(X | p + 1) 
taken mod p. 

Furthermore, we have 

-E n 0 0 0 

0 Ea 0 0 

0 Ea — Eta 0 

0 0 0 Eu 

Since © p is generated by ® p _i and t v , the forms of these matrices show just 
how to write down the irreducible constituents (3>(0), 5 P +i(0)) of 5(X). (For 
p = 0 and p = p — 1 the situation differs only in that certain indicated parts 
of the refinement do not appear, and of course there are no representations 
3r„(—1) and $J P (p)). Summarizing we see that {$ P (0X8) — {$(p — p, l p-1 $8) if 

—JE7 0 

8 e and F p (0%t p ) = o ” jBjj w ^ ere is of degree g" -1 and En of 
degree g". 

This’ completes the determination of the irreducible representations of @ p . 


F(x$g = 



MODULAR REPRESENTATIONS OP SYMMETRIC GROUP 


669 


But we can get still more information from the above process. For by Theorem I 
it follows that the lower left hand corner (last two sets of rows first two sets of 
columns) of g(A) must be a numerical multiple of $£ +1 (0). We may, and do, 
suppose that the Cartan basis for 9t p was so chosen that this numerical multiple 
(which is obviously not zero) is unity. Then we have /fj +1 (0$s) = 0 if s c © p _i 

and H p p +1 (0^t p ) = (E, 2 of degree g"). 

To calculate .f>£+i(0) we replace the above used form of Young’s semi-normal 
representation by the one generated by the transposes of the matrices -A(X$< r ) 

0 0 '1 

r = 2, • • • , p; and we get ff£+i(0$< p ) = „ j| (i.e. the transpose of 

.C/23 U 11 

Hf'mtp)), and H' +1 (0ls) = 0 if s « S p _i. 

The final step in our program of determining all representations of ©„ is the 
calculation for the elementary modules ^jJ(O) of unmixed type. This calcula¬ 
tion is based upon the form (formula (11) above) of U p (0) and the following two 
facts: (1) <r = 1 and (2) t p commutes with every element of © p _ 2 . It follows 
at once from t\ = 1, H p +1 (0\t r ) = // p+ i(0$< r ) = 0 that H p (Oft r ) = 0 for r < p. 
Since © p _i is generated by U , • • • , f P -i this shows that H p (0fs) = 0 if s « <5 p -i . 
It remains therefore only to calculate H p (0\t p ). 

Considered as a representation of @ p _ 2 , U„(0) takes the form 

5(x p_1 ) 

S(x p ) 

5(x'- 2 ) 

SCx'- 1 ) 

S(x p ) 

sex'- 1 ) 

S(x') 

Since t p commutes with 91 p _ 2 it now follows from Schur’s Lemma that H p (0\t p ) = 
oi E 0 

1 11 r 7 > (E n and E 22 as above). Now apply the condition t 2 p = 1 and 

0 Ct2 

we get ai = — a 2 = 1/2. 

9. Further specific results 

The above methods can be applied without serious additional complications 
(save in notation) to obtain similar results for © p+ i and @ p+2 . For @ p+ i there 
is again just one block of lowest kind and the ordinary representations belonging 
to it are p-integral (i.e. when put in rational semi-normal form). For © p+2 there 
are two blocks of lowest kind, and although some of the ordinary semi-normal 
representations belonging to these blocks fail to be p-integral, they become so 
after a simple transformation which does not irreparably upset the refinement 



670 


R. M. THRALL AND C. J. NESBITT 


into submatrices. But for with Z > 2 no such simple transformation into 
p-integral form has yet been found. 

One might hope to get further information by starting with one of the known 
integral forms for the irreducible representations of @ m , rather than with 
Young’s semi-normal form. The drawback to such a procedure is that these 
integral forms are not adapted for descent to the subgroups © m _ v of . So 
when they are taken mod p their decomposition seems to be almost (if not fully) 
as difficult as that of the regular representation, and so there is no particular 
point in using thenr. 

Institute for Advanced Study and 
Uni versity of Michigan 


Bibliography 

1. R. Brauer and C. Nesbitt, On the Modular Characters of Groups , these Annals, (2), 

vol. 42, (1941), pp. 556-590. 

2. R. Brauer, Investigations on Group Characters , these Annals, (4), vol. 42, (1941), pp. 

936-958. 

3. R. Brauer, On the Cartan Invariants of Groups of Finite Order , these Annals, (1), vol. 

42, (1941), pp. 53-61. 

4. R. Brauer, On Sets of Matrices with Coefficients in a Division Ring , Trans. Amer. 

Math. Soc. (3), vol. 49, (1941), pp. 502-548. 

5. R. Brauer, On the Representation of Groups of Finite Order , Proceedings of the Na¬ 

tional Academy of Sciences, vol. 25, (1939), pp. 290-295. 

6. C. Nesbitt, On the Regular Representations of Algebras , these Annals, (3), vol. 39, 

(1938), pp. 634-658. 

7. T. Nakayama and C. Nesbitt, Note on Symmetric Algebras , these Annals, (3), vol. 39, 

(1938), pp. 659-668. 

8. T. Nakayama, On Some Modular Properties of Irreducible Representations of a Sym¬ 

metric Group I, Japanese Journal of Math., vol. XVII, (1940), pp. 89-108; 
II, Japanese Journal of Math., vol. XVII, (1941), 411-423. 

9. T. Nakayama, Some Studies on Regular Representations and Modular Representations , 

these Annals, (2), vol. 39, (1938), pp. 361-369. 

10. W. M. Scott, On Matrix Algebras Over an Algebraically Closed Field, these Annals (2), 

vol. 43, (1942), pp. 147-160. 

11. R. Brauer and I. Schur, Zum Irreduzibilitiitsbegriff in der Gruppen linearer homogener 

Substitutionen, Sitzungsber. Preuss. Akad., 1930, p. 209, §2. 

12. R. M. Thrall, On Young’s Semi-normal Representation of the Symmetric Group, Duke 

Journal, vol. 8, (1941), pp. 611-624. 

13. H. Weyl, The Classical Groups, Princeton, 1939. 

14. A. Young, Quantitative Substitutional Analysis, Part VI, Proceedings of the London 

Mathematical Society (2), vol. 34, (1932), pp. 196-230. 



Annals of Mathematics 
Vol. 43, No. 4 f October, 1942 


ON THE DECOMPOSITION OF MODULAR TENSORS (I) 

By R. M. Thrall 
(Received December 11, 1941) 

1. Introduction 

This paper is a companion to the preceding one 1 [5], so we number formulas, 
theorems, etc., consecutively from those in that paper, and preserve the same 
notation unless a change is specifically indicated. 

Let 23i be a vector space over a field f of characteristic p. We are interested 
in the decomposition of the space 23 m of all tensors of rank m, relative to the 
Kronecker m ih power representation n m of the group © of all non-singular linear 
transformations of 2$i into itself. In this paper we determine the structure of 
25m subject to the two limitations: I. m < 2p; II. f has at least m elements. 
The first limitation is due to the incomplete state of the theory of modular 
representations of the symmetric group. The second limitation is less serious, 
although the decomposition is actually different if f has less than m elements. 
We hope to treat this case in a later paper. 

The principal results about 23 m are contained in Theorems III and VII, 
together with formula (38). The problem is attacked by exhibiting the en¬ 
veloping algebra of n m as the commutator algebra of a certain permutation 
representation of the symmetric group of degree m. A main tool in this process 
is application of Remark I, below, which states that the order of the commu¬ 
tator algebra of a group of permutation matrices is independent of the under¬ 
lying field, i.e. is even the same characteristic 0 as characteristic p. 

2. The commutator algebra of a monomial group 

An n-rowed matrix is called monomial if it has exactly one non-zero element 
in each* row and column. A permutation matrix is a monomial matrix in which 
each of the n non-zero elements is 1. A diagonal matrix is a monomial matrix 
whose non-zero terms all lie on the main diagonal. 

Let 21: s —* A($) = || aa{s) || be a monomial {-representation of degree n 
of a group ©; f being any field. (We call 21 monomial when each A(s) is 
monomial.) The set 23 of all {-matrices B such that 

(21) A($)R = BA(s) for all s in © 

is called the commutator algebra of the representation 21. We are interested in 
determining the nature and order of 23. 

We first treat the case in which 21 is a permutation representation. Suppose 
that 


1 Brackets refer to the bibliography at the end of the paper. 

671 



672 


R. M. THRALL 


( 22 ) 


A(s) 


Xi 


X\{») 

X n 


X n (») 


then (21) in the form A(s)BA(s) 1 = B is equivalent to the equations 


(23) &»(«)/<«) = bu i f j = 1, • • • , n, for all s in ©. 

We divide the index pairs (i, j) into systems of transitivity according to the 
equivalence relation: 

(24) (iy j ) ~ (i', j') <-» there is an s in © for which i ' = i(s);j' = ,;(s). 

It is clear from (23) that R is a commutator of 91 if and only if 

(25) bn = whenever (t, j) ~ (i\f). 

Lemma I. The order of the commutator algebra of a permutation representation 
(of a finite group) is equal to the number of systems of transitivity in the Kronecker 
square of the representation. 

Proof. We can consider the n 2 numbers as coordinates of the general 
vector in the space on which 91 X 91 operates. Then it follows from (22) that 
A(s) X A(s) sends the column matrix 11 x,- 2 /y 11 into the column matrix 
II ||. Thus the systems of transitivity under 91 X 91 are just the same 

as the systems of transitivity of the index pairs (i, j) defined by (24) above. 
With a system of transitivity we associate the matrix B having b, y = 1 if (i y j) 
is in the given system and b {j = 0 otherwise. The matrices thus constructed 
are clearly linearly independent; and it follows from (25) that they constitute 
a basis for 93. 

A monomial matrix A can be represented uniquely as the product DP of a 
diagonal matrix and a permutation matrix. If A' = D'P' is a second monomial 
matrix, then the product A" = A A' = D"P" has D" = DPD f P~ l and 
P " = PP'. Returning now to the arbitrary monomial representation 91 we 
write A($) = D(s)P(s); D(s) = || d i} di(s) ||. We have just proved that P(st ) = 
P(s)P(t) and so s —> P(s) is a permutation representation of © which we call 
the permutation representation belonging to 91. 

The equations (21) are now equivalent to 


(26) d»(«)&*o)io>/dy(s) = bij for all s in ©, i, j = 1, • • • , m. 

We divide the index pairs (i, j) into systems of transitivity according to 
‘Consider the subgroup $ = #(t, j ) containing all s for which i = i(s) y j = j(s). 
We say that the system of transitivity containing (i, j ) is singular if for some s 
in § y di(s) tA dj(s). [A simple computation shows that being singular is a 
property of the system of transitivity, independent of the representative (i, j) 
used to test for singularity.] If (i, j) belongs to a singular system we have 
bu — 0 by (26) and then by transitivity = 0 for every (t', j 9 ) ~ (i, j ). 
However, if (i, j) belongs to a non-singular system then there is a commutator B 



DECOMPOSITION OF MODULAR TENSORS 


673 


with bn - 1, and bvy 9 * 0 if and only if (i\ j') ~ (i, j). The equations (26) 
will never lead to relations connecting bn and 6,-,. unless ( i, j ) (*'» j'), so if 

there is any solution with b {j = 1, there is one with b {j = 1 and = 0 if 
(i', f) is not in the same system as (i, j). Since the A(s) form a group, any 
equation in (26) connecting b t oo;(o and &,(*),( o is implied by those connecting 
bij to by? with (i', j ') = (i(s) y j(s)) y (i i(t ), j(0), iCV 1 )). Hence our 

problem is reduced to showing that the equations (26) with b {j on the right- 
hand side, are soluble with bij = 1. We know that for s in $ they are con¬ 
sistent. If ( i(s ), j(s)) = (i(0, i(0) then w = ts 1 e and now calculating 
d<(0, d;(0 from the equation .4(0 = A(m)A(s), we get di(t)/dj(t) = d<(*)/dy(«) 
which establishes the consistency of the equations. We have now proved 
Lemma II. 77ie order o/ the commutator algebra of a group of monomial ma¬ 
trices is equal to the number of non-singular transitive systems of index pairs. 

Let f and $ be any two fields. A group of permutation matrices can be re¬ 
garded as lying in either f or St, since 1 is an element of any field. Lemma I 
states that the order of the f-commutator algebra is equal to the order of the 
^-commutator algebra. We shall apply this fact below to the case where one 
.field is of characteristic 0 and the other of characteristic p. For future reference 
we restate this (weaker) form of Lemma I as 

Remark I. The order of the commutator algebra of a group of permutation 
matrices is independent of the field of coefficients. 

The analogue to Remark I for monomial groups is not true. For consider 


the group of order p generated by A(s) = 



where w is a p th root of 


unity (in a field of characteristic 0). Since w = 1 mod p, the modular image 
of this group is generated by A (s) = * ^ j . The non-modular commutator 


algebra has order 2 and the modular one order 4. 


3. Vanishing forms 

Suppose that f is the Galois field with q elements. Let Yi , F 2 , • • • be 
indeterminants over f, and let y\ , y 2 , • • • be variables whose domain is f. 
Consider the ideal in f[Fi, F 2 , • • • ] generated by F< — F,, i = 1, 2, • • • . 
Modulo this ideal every f-polynomial F(Yi , F 2 , • • •) is congruent to a unique 
f-polynomial F*(Yi , Y 2 , • ■ • ) of degree less than q in each F,, and 
F(Vi , 2 / 2 , • • * ) = 0 is equivalent to F*(Fi, F 2 , • ■ • ) = 0. In particular, a 
f-polynomial F(Fi) of degree less than q cannot vanish over f (i.e. F{yf) = 0) 
unless every coefficient is zero. 

Lemma III. There is no non-zero t-form P(F t; ) of degree m ^ q in the n 2 
indeterminants F tJ , such that P(yij) det || t/,y || = 0 where the yare n 2 variables 
whose domain is f. 

Proof. If m < q — 1, then F(Y i} ) = P(F ti ) det || F.-y || is of degree less 
than q in each F»y, so P(Y i} ) 9 * 0 implies F(F»y) = F*(Y i} ) 0 and therefore 

F(y%i) 9* 0. This leaves the two cases m = q — 1, m = q. The proofs for 



674 


R. M. THRALL 


these are similar, so we treat here only the harder case, m = q. Write P(F,y) 
as a polynomial in Fn with coefficients P< = P,(F< } ) that are polynomials in 
I[Fi 2 , • • • , F„»], i.e. 

p( y xj) = p 0 y? 1 + PiFrr 1 + • • • + Pm. 

We also write 


det || Yij || = QoFn + Qi. 

Then 

FiYij) = QoPoY ri 4 * 1 + (Q 0 P 1 + QiPo)YTi + • • • + QiPm . 

If we replace Y m+1 by Y 2 and Y m by F, P(F,y) is replaced by the congruent 
polynomial (no longer a form) 

F'(Y ij) = (Q 0 P 2 + Q l Pi)Fn _1 + • • • + (QoPm -2 + QiP m -s)Yn 

+ (QoPo + QqP m—1 + QlP m— 2) F11 + (Q0P1 + QlPo + QlPro-l) F11 + Q\P m . 

Since F'(Yij) is of degree q — 1 (or less) in Fn we cannot have F'(yij) ~ 0 
unless the coefficient of each power of Y n becomes zero when Yij is replaced 
by yij • For i > 2 the coefficient of Yu is of degree less than q in each indeter¬ 
minant and so vanishing for yij is the same as vanishing for F t y; i.e. 

(27) QoPi = —QiPi-i i = 2, - • • , m - 2. 

The lemma is trivial if n = 1. For n > 1, Q 0 and Qi arc relatively prime 
forms of degree > 1. (27) for i = 2 requires Pi = P 2 = 0, for otherwise Pi, 

of degree 1 , would be divisible by Qo, of degree > 1 . Hence, Pi = P 2 = • • • = 
P fn— 2 = 0. We can apply the same process to each F»y and conclude that 
F{y%j) = 0 implies that P(F t y) /ms no terms o/ cte^ra? different from m, 1, 0 in 
eac/fc F % j . But then P*(F,y) = P*(F t y) det || F t y || and P*(F,y) is clearly not 
zero unless P(F.-y) = 0; this completes the proof for m = q. Observe that the 
form P(F) = F 11 F 12 — F 11 F 12 shows that the lemma is false for m > q. 

We use Lemma III as a modular substitute for what H. Weyl [ 6 , p. 4] calls 
the “principle of the irrelevance of algebraic inequalities/ ? 

4. General program 

Henceforth f shall be a field of characteristic p. We are interested in the 
decomposition of the Kronecker m tfl power representation U m :A —> n m (A) = 
A X • • • X A (m-i actors) of the full linear group ® = ®(n, !) of n-rowed non¬ 
singular I-matrices. We propose to effect this decomposition by exhibiting the 
enveloping algebra 2 l m of n m as the commutator algebra of a certain permuta¬ 
tion representation of the symmetric group of degree m. 

The order of 2l m is the number of linearly independent monomials of degree 
m in N = n 2 variables a,y which range freely over f save for the restriction 
det f| a»y || 7 * 0. Hence, by Lemma III we have 



DECOMPOSITION OF MODULAR TENSORS 


675 


Lemma IV. If l has at least m elements, then the order of ?!♦» is 

i.e. is equal to the number of linearly independent monomials of degree m in N 
indeterminants. 

Let x(ii, ■ • • ,i m )(ii, • • • , i m = 1, • • • , n) be the components of an arbitrary 
vector in the f-space 2$ m of all tensors of rank m\ i.e. is the representation 
space [6, pp. 96-98] for the Kronecker m th power representation of ©. Let s 
be the permutation 1 —> 1', • • • , m —> m'. We define s as a linear operator 
on by the equation x' = sx where x\i x , • • • , i m ) = x(i v , • • • , i w »). Let 

T m (s) be the matrix which describes this mapping. It is evident that 
X m : $ —► Tm(s) is a permutation representation of degree n m of © m . 

We call a f-matrix of degree n m bisymmetric if it commutes with every T m (s). 
Since the set 93 m of all bisymmetric f-matrices is the commutator algebra of a 
permutation representation, its order will be the same as the order of the set 
$8™ of all bisymmetric matrices in a field of characteristic 0. This latter order 

[6, p. 130] is known to be 

It is trivial to verify that II w (.4) is bisymmetric; i.e. C 93™ . Now apply 
Lemma IV and we sec that 

Theorem III. If t has at least m dements, then the enveloping algebra of the 
Kronecker m th power representation of the full linear group of degree n is the set 
of all bisymmetric t-matrices of degree n m . 

Still paralleling the non-modular theory, our next step is to determine the 
indecomposable constituents of . Then we obtain the decomposed form 
of ?( m , by starting with the commutator algebra of the decomposed form of X m . 

When this is all accomplished we shall know the structure of the decomposed 
form of n m ; i.e. we shall know the degrees of the irreducible constituents; the 
nature and multiplicities of the indecomposable constituents. But we shall 
still have no direct construction for the representations themselves, or much 
information about the characters of the representations. This is a general diffi¬ 
culty encountered when a representation of a group is studied by determining 
its enveloping algebra as a commutator algebra. The root of this difficulty is 
the lack of criteria for determining which elements of the enveloping algebra 
actually correspond to group elements. Attempts to remedy these deficiencies 
for the present theory are postponed to later papers. We also postpone any 
discussion of the case in which f has less than m elements. 

5. The representations X m 

We may regard X m as a permutation group whose elements T m (s ) are written 
on the “letters” x(i x , • • • , im). Two letters x{i x , • • • ,i m ) and x(j x , • • • ,j n ) 
belong to the same system of transitivity of X m if and only if the integers 
ji , • • • , j m are just the integers i x , • • • , i m in some arrangement. We now 
arrange the basis vectors of 93 m so that the letters of the several systems of 
transitivity are brought together. In the language of representation theory, 


C+r 1 ) 




676 


R. M. THRALL 


this exhibits the representation X m as the direct sum [6, pp. 19, 20] of its transi¬ 
tive constituents . 

Let (X) = (Xi, • • • , X*) denote the partition m = Xi + • • ■ + X* , Xi ^ ^ 

X* > 0, and let x(ii , • • • ,i m ) have the first Xi indices all 1, • • • , the last X* 
indices all A. The permutations s such that sx(ii , • • • ,i m ) = x(i x , • • • , i m ) 
constitute a subgroup of © m isomorphic to ©(X) = @x l X • • • X ©x* (here X 
denotes group-theoretic direct product). Let S(\) denote the sum of the ele¬ 
ments of ©(X), and suppose that Si = 1, s 2 , • • • are elements, one from each 
coset of ©(X) in © m . Then the elements SiS (\) constitute a F-basis for the 
left ideal 8(X) of 9t m (the F group ring of © m ) generated by £(X). Left multi¬ 
plication of 8(X) by a permutation s merely permutes the basis elements $,*S(X), 
and so 8(X) is representation space for a (transitive) permutation representation 
[3, p. 110] £<x) :« -> T a) (s) of @ m . 

It is obvious that, as left 9i m -space 8(X) is isomorphic to the F-space made up 
of tensors whose only non-zero coordinates are x(ii , • • • , i m ) and its conjugates 
sx(ii , • • • , i m ). Hence X(\) is equivalent to a constituent of X m ; and, con¬ 
versely, it is clear that every transitive constituent of X m is equivalent to one 
of the X(x) . So to know the decomposition of X m we need only know the 
decomposition of each X (X) ; or equivalently, to write each 8(X) as a direct sum 
of indecomposable left ideals of 9t m . 

We have S(\) 2 = n(\)S(\) where n(\) = Xi! • ■ • X*!. Hence if Xi (and there¬ 
fore every X,) is less than p, the ideal 8(X) has the idempotent generator e(X) = 
S(X)/n(X); and so 2 8(X) = 9J m S(X) can be written as the direct sum of indecom¬ 
posable left ideals which are direct summands of (i.c. which themselves have 
idempotent generators). Stating this in thfc language of representation theory 
we have 

Theorem IV. If Xi < p, I ( x> is a direct sum of indecomposable constituents 
of the regular representation of @ m . 

If m = p, Theorem IV covers all but one partition, the exception being 
Xi = p. But X( P ) is the identity representation, and so we have established 
the following theorem for m = p: 

Theorem V. For m < 2p an indecomposable constituent of X m is either an 
indecomposable constituent of the regular representation of @ m , or one of the irre¬ 
ducible representations* §i(m) of © m . 

Proof. There is nothing to prove for m < p. We proceed by an induction 
on m based upon the already verified case m = p. We suppose p < m < 2p 
and that the theorem is already verified for m — 1. Considered only for ele¬ 
ments of ©m-i, Xm is just X m -i repeated n times. Hence, any indecomposable 
constituent of X m must, when considered only for elements of © m _i, split into 
indecomposable constituents of X m -i. 

Reference to Theorem I shows that Theorem V is false only if (I) X m has 


2 See Theorem I [11, p. 9. 

2< The notation gi( m) is explained in [5], §6. 



DECOMPOSITION OF MODULAR TENSORS 


677 


j > 1, as an indecomposable direct constituent, or (II) X m has an inde¬ 
composable direct constituent of Loewy length 2. We now apply Corollary I 
to show that either I or II contradicts our induction hypothesis. 

The application to I is immediate. For let (/x) be a partition of l = m — p. 
Then since m > p> there will be some i for which m > m +1 ; and so in Xm 
would require g/pi | i) in X m -\ , contrary to our induction hypothesis. 

Let 25 be an indecomposable representation of of Loewy length 2, whose 
irreducible constituents are gX/z), j = j 0 , • • • ,j 0 + r, r ^ 1. Let 25* denote 
25 considered only for elements of @ w -i. If 25 is an indecomposable direct 
constituent of Xm , then 25* is a direct constituent of I m _i . 

By Corollary I, the irreducible constituents of 25* will be either of highest 
kind or of the form ( i) for j in the range j 0 , • • • , j 0 + r. Since no ^-(m) 
is repeated in 25, it follows from Corollary I that no ^(ju I i) can be repeated in 
any indecomposable consistuent of 25*; and so 25* can contain no indecompos¬ 
able constituent of Loewy length 3. Since r ^ 1 and m > p, 25* must contain 
some I i) for j > 1. This g ; ( n | i) must lie in an indecomposable direct 
constituent of 25* of Loewy length 1 or 2. But then 25* cannot be a direct 
constituent of , because of our induction hypothesis; hence 25 cannot be 
a direct constituent of Xm ; i.e. II is impossible. This completes the proof of 
Theorem V. 


6. The commutator algebra of X m 

Instead of studying X m itself, we investigate the more general case of any 
representation which is the sum of any number of the representations J (X) , 
repetitions permitted. Let 25 denote the decomposed form of any such repre¬ 
sentation. We group the constituents of 25 in such a way that 


(28) 


!®i 


25 = 


252 


where 25 T consists of all the constituents of 25 that belong to the block 25 r of 9f m . 
The blocks 25 r correspond to two-sided ideals that are minimal direct summands 
of 9? m . Hence the commutator algebra 255 of 25 is the direct sum of the com¬ 
mutator algebras 255 r of the 25 T i.e. 


(29) 


ffii 


255 = 


2Ba 


If 25 r belongs to a block of lowest kind, it will just be an g(A) repeated, say, 
3 times. Then, 4 since g(A) is a total matrix algebra, 255 r is equivalent to the 


4 Cf. [61, p. 92. 



678 


R. M. THRALL 


total {-matrix algebra of degree 6 repeated /(X) times, where /(X) is the degree 
of 0f(X). 

If 33, belongs to a block 33, = 33 (m) of highest kind, the situation is somewhat 
more complicated. For simplicity in notation we drop all arguments 00 from 
the letters denoting representations and matrices. According to Theorem V, 
33, has the form: 


(30) 


Ey 0 X gl 


33, = 


E yi X Ui 


[I E y ,_ x X Up_i II 

where E, is the unit matrix of degree v and X denotes Kronecker product. To 
obtain uniformity in notation we set = Uo • Suppose that W<yt/y(s) = 
, for all s in @ m , and denote by An any f-matrix of y< rows and y, 
columns, i, j — 0, • • • , p — 1. Then 


(31) 


W T 


A oo X Woo • • • A Op-i X Wop-i 
A p —io X W ^ 10 * * * A p—ip—i X p—ip—i 


is an element of SB, ; and, conversely, any element of 2B T can be written as a 
linear f-combination of elements of the form (31). 

To determine the number hi,- of linearly independent U\; we use the Cartan 
matrix for the block 33 and apply the general theory of intertwining matrices. 6 
The result is that hn = 0 (and therefore Wn = 0) unless j is i — 1, i, or i + 1. 
hoo 1 , 2 , if 1 , ' ' ' , Jp 1 , 1 , i 0 , * * * , 'p 1 . 

Since the A</s are arbitrary this gives 

( 32 ) 70 + 27 o 7 i + 27? + 27172 + • • • + 27 p_i 

= (7o + 7i) 2 + (7i + 72 ) 2 + • • • + 7p-i 


for the order of SB T . 

To determine the structure of 9B r we must know the form of all the Wa . 
We subdivide WH so that the rows of its submatrices are the same as the rows 
occupied by the irreducible constituents of 11< and the columns of the sub¬ 
matrices are the same as the rows occupied by the irreducible constituents 
of U,-. See formula (11) for the form of the U<. We omit the details of the 


' See, for instance, Theorem 4 [4], p. 648. 



DECOMPOSITION OP MODULAR TENSORS 


679 


computation. /,• denotes the degree of g,- > 0 stands for the zero matrix of 
proper size. 


(33) 


Woo — a E/ t > Woi = || oE/ l 0 0 , 


Wio 


0 

0 

aE fl 



aEf i 

0 

0 

0 

, Wu = 

0 


0 

0 


0 

0 

a 

0 


bE u 

0 

0 

aE f . ij 


= 


0 

0 

0 

0 


0 

0 

0 

0 

0 

0 

0 

0 


aE fi 

0 

0 

0 

a Efi+ 1 




, Ww = 





0 

0 

0 

0 

0 

0 

0 

0 

aE/ { 

0 

0 


0 

0 aE/ i+l 

0 


The a’s appearing in different matrices W,-,- are totally unrelated; for i = 1 
and i — p — 1 the rows and columns of the W ,y that correspond formally to 
go and g„ are to be deleted. 

After a suitable shifting of rows and columns, 9GB r takes the form 


(34) 


E /t X SB) 1 


E /p . t X SB5'’- 1 | 
where 95)’ is the indecomposable matrix algebra given by 


(35) • 


SB) 1 = 




sr 1 

sr 1 

s: ;+l 

0 g <+l 

i3< 

sLi g; + i g 1 


i = 1, • • •, p - 1. 


(For i = p — 1 the rows and columns of g p are deleted.) Each g’ is a total 
f-matrix algebra of degree y< and the g,‘ are total f-matrix sets of degrees indi¬ 
cated by their positions in SB)*. 

It is of some interest to observe that if (I): ^ 0 implies y, y* 0 for all 

v < i, holds for S3, then the commutator algebra of SB) r is just the enveloping 
algebra of SB r . 


7. Multiplicities of the constituents of X*. 

We continue to regard £ ( x), , as permutation representations of © m with 

the given f of characteristic p as underlying field. We shall denote by j£(x> , 
Si the same permutation representations of © m , but now regarded as made 




680 


B. M. THRALL 


up of matrices in a field $ of characteristic 0 . The structure of the representa¬ 
tions 1 (A), Z*L is well known . 6 Denote by 8 (\) the multiplicity of 9l(X) in TS,; 
by $(X#X') the multiplicity of 2l(X) in Z\\') . 

If the diagram of (X) has no hook of length p, i.e. if g(X) is of highest kind, 
then 5(X) occurs 8(\) times in Z m and 5 (X$X') times in Z(\>) . If the diagram 
of (X) has a hook of length p, say (X) = ) then we set 5(X) = 5 <(m) and 

5(X#X') = 5,()u$X'). We let 7 o(m), 7 i(m), • • * , 7 p-i(m) denote the multiplicities 
in Z m of foGu), Ui(m), • ■ • , Up-i(m) respectively; and let 7 o(m$X'), * , 7 p-i(m$X') 

denote the corresponding multiplicies in Z(y) . In formulas where they appear 
formally we set y p (y) = 7 p(m$X) = 0. 

We get relations between the $\s and the 7 *s by counting the multiplicity of 
the modular irreducible constituents in two ways. Considering Z m as the 
modular representation induced by 2:^ , it follows from the form of the decom¬ 
position matrix 7 D(/z) for the block 33(/x), that 5 »(m) occurs 8i(u) + $*+i(p) times 
in Z m . On the other hand, using the Cartan matrix C(p) for the block * 8 (ju) 
we see that &(m) occurs + 27 ,(m) + 7 «+i(m) times in Z m . Hence 

(36) 8 i(n) + 5 tH i(M) = 7 <-i(m) + 27 i(n) + 7*+i(m)> i = 1, • • - ,p — 1. 
The same reasoning applies to Z(\) giving 

5 ,(m$X) + 6 i+ i(m$X) = 7i_i(M$X) + 27 < 0 u$X) 

(37) 

+ 7<+i( mPO, * = 1, • • • , p - 1. 
We can now state the main theorem on multiplicities: 

Theorem VI. // the diagram of (X) no hook of length p then $(X) occurs 
exactly as often in Z m as ?1(X) occurs in . For a block 53 (ju) of lowest kind 

j 

7p—i(m) = 5 p (m) 

7 p _ 2 (p) = ViW “ ^p(m) 

7p-*(m) = 4p-*+i(/i) - V<(m) + ‘ ‘ • + ( —1 ) 1+1 M/*)- 

Only the second part requires additional proof. If we add to equations (36) 
the single equation 

(39) 81 (m) - 7o(m) + 7i(m) 

then a simple computation shows that (38) is the only solution of the aug¬ 
mented system. Hence to prove our theorem it is sufficient to establish (39). 
We do this by showing that 

(40) 81 Ou#X) = 7o(ju$X) + 7 i(a$X) 


we have : 

(38) 


• [3], Chapter IV, especially pp. 110, 128, 129; [6], Chapter VII, §§5, 6, 7. 
7 [5], formulas (3) and (4). 




DECOMPOSITION OF MODULAR TENSORS 


681 


for every partition (X) of m, and then using the fact that X m is a direct sum of 
constituents !£< X ) . Of course (40) in the presence of (37) leads to the following 
analogue of (38): 


(41) 


7 p -.-( m # x ) = dp-i+iifilx) 


- wripo + • • • + i-i i - i, 


We arrange the partitions (X) of m and the partitions (y) of l = m — p in 
dictionary order (i.e. (X) precedes (X') if the first non-vanishing difference 
X* — \'i is positive). The verification of (40) is an induction argument, along 
the following lines: We suppose that (40) has been established for all £ (X) pro¬ 
vided (y) precedes a given (y°). Then we exhibit one (X°) (actually X 1 (m°)) such 
that (m°) is the only constituent of 33(//°) which appears in £ (X o) . Let N(*) 
denote the order of the commutator algebra of any representation, *, of @ m . 
Then by Remark I we have 


(42) 


m 5 ) = 


where 33 stands for any sum of X (X) and 33° is the sum of the corresponding 
I( X ) . We establish (40) by solving for 5i(/i°$X) in the equations obtained from 
(42) by setting 33 = £ (X) , X (X o } , £ (X) + £ (X 0 ) in turn. 

An important step in this process is the proof that a suitable (X°) exists. To 
accomplish this we analyze the character 8 of , where (X') is any partition 
of m, and obtain the following 

Lemma V. ?1(X') occurs exactly once as a constituent of £( X '> , and ?l(X) cannot 
be a constituent of I( X /) if (X') 'precedes (X); i.e. 


(43) 


5(X'$X') = 1, 5(X#X') = 0 unless (X) precedes (X'). 


We observe that if (u°) precedes (y) then X 1 ^ 0 ) precedes X l (/x), and for any (y), 
\ l (y) precedes \ % {y) if i > 1. Then from (43) and (37) we get (since the y's 
and 6’s are non-negative integers), 

L£mma VI. Let (X°) « XV) = ( M ; + p, yl , . . . ). Then (i) if (y°) pre¬ 
cedes (y) 3T (X o) contains no constituents from the block 33(/x), and (ii) 3i(/) w the 
only constituent of 3 T (X 0 ) belonging to the block 33(ju°) and it occurs with multiplicity 
one {i.e. yo(y°$\ 0 ) = 1). 

In §6 we saw that the commutator algebra of a representation 33 = I(xp> 
can be computed block at a time. Let A(33, (y)) denote the contribution to 
iV(33) of a block of lowest kind and W(33, (X)) the same for a block of highest 
kind. Then (see formula (32)) we have 


(44) 


NW, (*.)) = Zf-t (Zp yM\ p ) + 7.W)) 2 ; 

N{1 B, (X)) = (£, «(X^)) 2 


and for the total order 


(45) 


m B) = Z XT(«, Cm)) + Z m (X)), 


• [2], pp. 71, 94; £31, p. 110; [6], p. 205. 



682 


R. M. THRALL 


where the first sum is over all partitions (/x) of l and the second sum is over all 
partitions (X) of m whose diagrams have no hook of length p. 

For the corresponding non-modular representation 23° we have analogously 

#(®°> (X)) = (£p5(X*X')) 2 , 

and N( 93°) = X)jV( 93 0 , the sum extending over all partitions (X) of m. 
For comparison with iV(33) it is convenient to group together those #(93°, (X)) 
for which the representations $(X) belong to the same blocks. Thus we obtain 

(46) A T (93°) = 5>(*°, Ox)) + ZTO°, (X)) 
where the summation ranges are the same as in (45) and 

(47) ( M )) = Ef-i iV(2S°, XXm)) = Ef-i (Zp *MX')) 2 . 

By Remark I the difference JV(35°) — A(93) is zero. Observe that the 
A(93°, (X)) and (X)) from (45) and (46) cancel. Furthermore, by our in¬ 
duction hypothesis, (40) holds for any (/x) which precedes (/x°); this in turn 
implies that A(35°, (/x)) = jV(93, (m)) for any (/x) which precedes (/x°). Hence we 
have 

(48) 0 = £ tf(8°, (m)) - TO, („)) 

where the summation extends over all (/i) (including (u°)) which do not 
precede (m°). 

Now let (X) be any partition of m. By (43), (47), N(X°a) + , (m)) = 

TO<x> , (/*)) if (m) follows (m°). By Lemma VI and (44), N(Z W + £<*., , ( M )) = 
N(X(\) , (m)) if (m) follows (m°). Hence if we subtract (48) for SB = £ ( x> from 

(48) for SB = £< X ) + i(x«) the only terms which do not cancel are those involving 
(ix°). This leads us to 

/ TO<x> + J(x»), (m 0 )) - TOcx)(m°)) 

(49) 

= TO<x> + J<x.), (m°)) - TO (X) , (m 0 )). 

Now substituting in this from (44) and (47) and referring to (43) and Lemma VI 
for the values of 5 <(m°$X 0 ), 7 ;(/}fX 0 ) we have 

[(« a (/*X) + l) 2 + 5 2 (m°IX) 2 + • • • + 5,(m°$X) 2 ] - [MmW + • • • + 5„(m°$X) 2 ] 
= [(7o(/*X) + 7 i(m°$X) + l) 2 + (7 i(m°$X) + 72(M°$X)) 2 + • • • + 7 p -i(m°)(X) 2 ] 
•- [(W3M + 7i(m°JX)) 2 + ... + 7p-i(m°$X) 2 ] 

or 

2«i( m °$X) + 1 = 2(7o(m°IX) + 7i(m°$X)) + 1 

which is the same as (40). Observe that the argument above applies to the first 
partition, m = I of I; hence our induction is complete and Theorem VI is fully 
established. 



DECOMPOSITION OF MODULAR TENSORS 


683 


In words (49) says that the change in order of the commutator algebra, due 
to addition of one particular permutation representation to another particular 
permutation representation, is the same, block by block, characteristic 0 as 
characteristic p. Remark I states merely that the total change is the same. 
If we could strengthen Remark I to a block by block form, the proof of the above 
theorem could be much shortened, as then one could omit everything between 
Lemma VI and formula (49); and of course any such improvement of Remark I 
would be of interest in the general modular theory entirely aside from its ap¬ 
plication here. 


8. The Kronecker m ih power of the full linear group 


In order to describe the structure of the enveloping algebra ?I m of n m we have 
only to put together the results of the preceding sections and introduce a suitable 
notation for the constituents. 

Let (X) be a partition of m whose diagram has no hook of length p . Then we 
denote by ©(X) the total f-matrix algebra of degree 5(X). Let (m) be a partition 
of l = m — p. Then we denote by ©,( m ) the total f-matrix algebra of degree 9 
7»(/*), an( l by ©}(m), for j = i — 1, i, i + 1, the set of all 7 ,(m) by 7 ,(m) f-matrices; 
all this for i = 0, • • • , p — 1, with the usual conventions for i = 0, i = p — 1 
(i.e., j < 0, j > p — 1 are excluded); Finally, we set 


(50) UKm) = 


®,(m) 



i(m) 

®; +1 u) 

0 

@’(m) 

®Um) 


ll 

©i>i(m) 

@;-+i(m) ©Km); 


i = 1, • • • , P “ 1 


We can now state the main theorem on the structure of 2l m . 

Theorem VII. For m < 2 p and t any field {of characteristic p) containuig at 
least m elements , the enveloping algebra ?I m , of the Kronecker m th power representa¬ 
tion IT m , of the full linear group ©, of n-rowed non-singular f-matrices, has the 
following indecomposable (direct) constituents : (i) If (X) is any partition of m 
whose diagram has no hook of length p , then ©(X) appears f{\) times as an inde¬ 
composable constituents of 2l m ; where /(X) is the degree of the ordinary irreducible 
representation of @ w defined by (X). (ii) If (m) is any partition of l = m — p, 
then IT(m) appears /»(m) times as an indecomposable constituent of 2l m , 
i = 1, -' •, p — 1, where fi{y) is the degree of the irreducible modular representation 

5Km) of @ m . 

The theorem follows from Theorem III, the formulas of §6, and Theorem VI. 
The degrees/(X) are well known [6, p. 213], and for the 10 /*(m) we have 

(5i) Mu) = /(x‘(m)) - /(x w W) + ••• + (-1 ) i+1 /(xKM)) 


• See formula (38) for the value of y »(m). 
10 Cf. [5], formula (20). 



684 


R. M. THRALL 


where 11 X* (/x) is the partition of m whose diagram T has a hook H , of length p 
and height (vertical length) j , such that T — H is the diagram of (/i). 

Any indecomposable constituent of ?l m affords an indecomposable representa¬ 
tion of ©. We shall consider the symbols ©(X), IT(m), ©<(/*) in two ways: 
first, as they are defined above; and second, as denoting representations of ®; 
for instance @(X) is the representation A —► G(\\A), where G(X&A) is the matrix 
in @(X) assigned to n m (A) considered as an element of 3l w . We define the 
matrices U\n\A) y G\(p\A) analogously. Then the ®(X), ©,•(/*) are all the ir¬ 
reducible representations of © which are induced in the space of tensors of rank m. 

Institute for Advanced Study 


Bibliography 

[1] M. Deuring, Algebren, Berlin, 1935. 

[2] D. E. Littlewood, The Theory of Group Characters , Oxford, 1940. 

[3] F. Murnaghan, The Theory of Group Representations, Baltimore, 1938. 

[4] C. J. Nesbitt, On the Regular Representations of Algebras, these Annals, vol. 39 (1938), 

pp. 634-658. 

[5] It. M. Thrall and C. J. Nesbitt, On the Modular Representations of the Symmetric 

Group , these Annals, vol. 43 (1942), pp. 656-670. 

[6] H. Weyl, The Classical Groups, Princeton, 1939. 


11 See [5], §6, for a discussion of hooks and for references to Nakayama’s treatment. 



Annals or Mathematics 
Vol. 43, No. 4, October, 1942 


NON-ASSOCIATIVE ALGEBRAS 1 
I. Fundamental Concepts and Isotopy 

By A. A. Albert 
(Received January 5, 1942) 

1. Introduction 

The study of non-associative algebras has already yielded much of interest 
and importance. Indeed those special theories 2 in which the associative law is 
replaced by a substitute have each been of an extent and of an interest almost 
comparable to that of the theory of associative algebras. 

The results on non-associative algebras in which one does not assume a type 
of partial associativity 3 have almost 4 all been of a rather primitive kind and have 
been scattered through the literature. They have, in particular, not emphasized 
adequately the important fact that many of the properties of arbitrary linear 
algebras are equivalent to certain properties of related sets of linear transforma¬ 
tions in which multiplication then does satisfy the associative law. The fact 
that there is a rather surprisingly large number of non-associative algebras of 
orders two and three has been noted 3 but it has not been recognized before that 
this is at least partly due to the undesirable narrowness of the concept of equiva¬ 
lence for algebras other than associative algebras with a unity quantity. 

It is the purpose of that part of the study of non-associative algebras which 
we shall begin here to emphasize the facts noted above by providing an appro¬ 
priate formulation of the fundamentals of the theory of arbitrary linear algebras. 
Thus we shall devote the first portion of our present discussion to the process of 
relating the elementary properties of an algebra 31 to the corresponding proper¬ 
ties of three attached linear spaces of linear transformations on 31. We shall 
them introduce the concept of isotopy of algebras, an extension of the concept of 
equivalence which coincides with the latter concept in the case of associative 
algebras with a unity quantity. Our discussion will conclude with an extensive 


1 Presented to the Society September 5, 1941. Most of the results of this paper were 
announced also in lectures at Princeton and Harvard in March 1941. 

2 We refer here first to the theory of alternative algebras for which see M. Zorn, Theorie 
der Alternative Ringe , Hamburg Abh., vol. 8 (1930), pp. 123-47 , Alter nalivek dr per und Quad - 
ratische SystemCj loc. cit., vol. 9 (1933), pp. 395-402. A second such theory is that of Lie 
Algebras for which see N. Jacobson, Simple Lie algebras over a field of characteristic zero , 
Duke J., vol. 4 (1938), pp. 534-51. Finally Jordan algebras arc described in P. Jordan, 
J. v. Neumann, and E. Wigner, On an algebraic generalization of the quantum mechanical 
formalism , these Annals, vol. 35 (1934), pp. 29-64. The articles quoted contain bibliog¬ 
raphies of their subjects and it should be remarked here that the theory of Jordan algebras 
has been generalized by G. Kalisch in his Chicago doctoral dissertation. 

* Cf. L. E. Dickson, Linear algebras with associativity not assumed , Duke J., vol. 1 (1935), 
pp. iia-25. 

4 For a paper not of this type see footnote 6. 

685 



686 


A. A. ALBERT 


consideration of the question as to what properties of an algebra are preserved 
when we pass to an isotope. 

It is the author’s hope that the study begun here may ultimately lead to a 
solution of the problem of determining all simple algebras with a unity quantity, 
at least in a sense like that in which we say that the corresponding problem for 
associative algebras has been solved. A fundamental part of the associative 
algebra theory consisted of the definition of the known types of algebras, that is, 
the cyclic algebras and the crossed products, and such definitions will also be 
required for the non-associative case. We shall provide at least a very extensive 
part of this requirement in Part II of the present study. There we shall define 8 
a class of non-commutative simple algebras with a unity quantity containing all 
such algebras which have been considered thus far in the literature as well as a 
very rich variety of new types. 

2. The multiplication spaces of an algebra 

A linear algebra of order n over a field ft is, in particular, a linear space of order 
n over ft. But all linear spaces of the same order are equivalent. Thus we may 
regard all algebras of the same order as having the same quantities but with 
different laws for forming products. In particular we may take our quantities 
to be vectors, that is, one by n matrices 

(1) a = (ai , • • • , a n ) (<*< in ft). 

An algebra 31 now consists of the linear space 2 of all the vectors (1) together 
with a set of n 3 quantities y ijk in ft such that the product 

(2) u = a-x = (ai , • • • , ct n ) • (£i, : • • , f n ) = (mi , - * * , m») 
of any two quantities of 2 is defined in 21 by 

n 

(3) ^ = 2 myahti (fc = l> •••>*). 

t.j-i 

Define T (i) to be the n-rowed square matrix with y </* in the i th row and k th 
column, and write 

(4) r x = r (1) ?! + • • • + r (n) f n , 

so that r (0 = T e . where is given by (1) with «< = 1 and all the other a f — 0. 
The customary row by column definition of a matrix product does not include 
the definition of ax and so a-x ^ ax. However it is clear that 

(6) a*x = ar* , 

where aT* is computed as usual. Matrix multiplication is associative and so 

(6) 0 a-z)-y = (ar x )r v - a(r x r y ). 

5 This definition has already been presented by the author in a lecture at the University 
of Cincinnati, November 15,1941. See also the author’s paper on Quadratic forms permit¬ 
ting composition t these Annals, this volume. 



NON-ASSOCIATIVE ALGEBRAS, I 


687 


But 

(7) a-(x-y) = aT u , u = x-y = xT v , 

and 31 is associative if and only if T X T V = r tt for every x and y. We shall obtain 
another criterion later. 

The linear transformation 9t x on 8 whose matrix is r x is the correspondence 

(8) a —> aR x — a-x , 

and is called a right multiplication of 31. Its matrix r x depends upon our choice 
of the linear space equivalence between 31 and 8. However the transformation 
R x does not depend upon this choice. We shall therefore study the properties 
of R x rather than of T x . However our present notation aR x for the result of 
applying R x to a has the advantage that aR x = aY x , so that in computations 
we may replace R x by its matrix. We shall not use the author's earlier nota¬ 
tion a Rx . 

The set 

(9) R( 31) 

of all right multiplications of 31 is a linear subspace of order at most n of the 
total matric algebra (g) n (of order n over g) of all linear transformations on 8. 
The correspondence 

(10) x -> R x 

is a linear mapping of 8 on 72(?l), and 72(31) is spanned by R ei , • • ■ , R en . 

We may now state that any algebra 31 of order n over g consists of a linear 
space 8 of this same order, a linear space 72 (31) of order m ^ n over g consisting 
of linear transformations on 8, and a linear mapping (10) of 8 on 72(31). Con¬ 
versely let 8 and a subspace of (g) n be given such that the order of 9? is at 
most^n. Then we may select any n transformations 72 (,) which span 9? and 
define R x = 72 (1) £i + • • • + 72 (7l) £ n . This then determines a linear mapping of 8 
on 91 and hence an algebra 31 with 91 = 72(31). It is particularly important to 
note that 9? is completely arbitrary save for the upper bound n on its order. 
The linear transformations L s given by 

(11) a —► x*a = aL x 

are called left multiplications of 31 and form the left multiplication space L( 31) 
of 31. This space and the linear mapping 

(12) x-*L z 

of 8 on L(3l) determine and are completely determined by 72(31) and (10). 
For the matrix of L s is A x = A (1) £i + • • • + A (n) £„ , where A (,) is the matrix 
with yijk in the j th row and A; th column. Correspondingly L (I) £i + • • • + Z/ n) £ n = 
L x so that L(3l) is spanned over g by L (1) , • • • , L (n) . 

The right and left multiplication spaces of 31 generate another linear sub- 



688 


A. A. ALBERT 


space of ($)„ which we shall call the transformation algebra of 81. It is the 
algebr'a 

(13) T{% l) - ffl, R a \ ■■■ , R {n) , L m , • • • , L (B) ] 

of all polynomials with coefficients in % in the R U) which span 12(21), the L (0 
which span L(21) and the identity transformation I of (g)» . All three spaces 
J2(2l), L(21), T(21) will be used to describe properties of 21 and we shall call them 
the multiplication spaces of 21. 

The scalar extension 21* of 2f by any scalar extension field St of g is the set 
of vectors (1) with the a* in $ and with a-x defined in 21* by the same y as 
define 21. Then clearly we have the same R (t) and L (l) , and 

(14) #(2l*) = [12(21)]* , L(21*) = [L(21)]* , T(2l*) = [m)l* . 

When we begin to discuss more than one algebra 21 defined for the same 8 
it will be necessary to distinguish 21 from the fixed linear space 8 of the quantities 
of 21. However no confusion will arise if we speak of a linear subspace of 8 
as a subspace of 21 and this will be desirable, of course, in discussing subalgebras 
of 21, 


3. Products of spaces 

In our study of the multiplication spaces of an algebra we shall need to use 
the notations for products of spaces of both the algebra 21 and the algebra (g) n . 
We define the product 

33S . 

of any two w linear subspaces of an algebra 21 to be the linear subspace over g 
of 21 spanned by bi-Cj (i = 1 , • • • , s; j = 1 , • • • , t) y where bi , • • • ,6* span 23 
and Ci, • • • , c t span S. Then the square 33 2 = 3333 is defined and we define 
the right power 33‘* +1 = 33 *33. 

If a is in 21 the subspace (of order zero or one over g) of 21 spanned by a will 
be designated by ag. Then we write a33 for (ag)33 and similarly 33a for 33(ag). 
If c is also in 21 we write (a33)c for (a33)(cg) and a(33c) for (ag)(33c). If 21 is 
associative and 33, E, 35 are linear subspaces then (33£)2) = 33(&®), 6 Ed is 
defined to be (fcS)d = 6(Ed) for every b and d in 21. 

The definitions above apply of course both to subspaces of any algebra 21 of 
order n over g and to subspaces of (g) n . However let 33 be a linear subspace 
of 21 and © be a linear subspace of (g) n . We define 

(15) 33© 

to be the linear subspace of 21 spanned over g by the products bS for b in 33 
and S in ©. Then we have defined the product operation (15) as an operation 
on 21, (g) n to 21. We also define 6© = (6g)©, 33 S = 33(£g) for all linear sub¬ 
spaces 33 of % and © of (g) n , and all quantities b in 21 and S in (g) n . Clearly 
(33@)I = 33(@I) for all linear subspaces 33 of 21 and © and 2; of (g) n . We 
now prove N. Jacobson’s 



NON-ASSOCIATIVE ALGEBRAS, I 


689 


Lemma 1. Let *31 be a nilpotent subalgebra of (g) n . Then 2192 ^ 21 or zero . 

For 91 5 * 0 and contains an S 0 of (g) n , aS ^ 0 for some a of 21 and is in 
2l9t ^ 0. If 2I9J = 21 then 2197 2 = (2l9J)9l = 219? = 21 and 219?* = 21 implies 
that 2l9t'* +1 = (2l9?‘*)9i = 219? = 21. Hence 2l9i‘ = 21 for every t . But 
91’ 1 = 0 for some t, 21 = 0 which is impossible. 

If E is in (g)„ and © is a linear subspace of (g) n the condition 2?© = 2?©2? 
means that ES = EUE for every S of © where U in © is determined (but not 
necessarily uniquely) by S. However when E is an idempotent, that is E 2 = E , 
the property 2?© = 2?©2? is equivalent to ES = ESE for every 8 of ©. For 
ES = EUE = EUE 2 = (EUE)E = ESE. 

4. Subalgebras 

If 33 is a linear subspace of an algebra 21 the set of all R v for y in 33 is a linear 
subspace of 22(21), the set of ali L y is a linear subspace of L(2l), and these sub¬ 
spaces, together with 7, generate a subalgebra of T(2l). We designate these 
three linear subspaces of 7\2l) by 

(16) 22(33, 21), L(33, 21), T(®, 21) 

respectively, where !F(33, 21) is the set of all polynomials with coefficients in g 
in the R y , the L y and 7. 

If 33 has order m < n over g we may express 2t as the supplementary sum 
33 + E where £ has order n — m. This means that every a of 2t is uniquely 
expressible in the form a = b + c for b in 33 and c in G. However E is by no 
means unique. We now define a mapping 

E: a — b + c—±b = aE 

of 21 on 33. It is an idempotent linear transformation of rank m, that is, E 2 = E 
and m is the rank of the matrix of E. We then have 33 = 21 E where E is char¬ 
acterized by the property that a = aE if and only if a is in 33, aE = 0 if and 
only if a is in £. A corresponding idempotent for £ is 7 — E. We now have 

Lemma 2. Let 33 be a linear subspace of order m of % so that 33 = 212? for an 
idempotent E of rank m in (g) n . Then 33 is a subalgebra of 21 if and only if 
E[R( 33, 21)] = £[22(33, 21 )]E. 

For 33 is a subalgebra of 2t if and only if aE-y = ( aE-y)E for every a of 2f 
and y of 25. Then aER v = aER y E 1 ER y = ER V E and we have our lemma 
since E 2 = E. 

Note that also yaE = aEL y = aEL y E f E[L( 33, 21)] = £[L(33, 2l)]£. Since 
EIE = E = El we see that EU = £E/£ for every U of 7F, 22(33, 21), L( 33, 21). 
But if £S « £S£ and EU « 2?£7£ we have £(S + 17) = £(S + U)E, ESU = 
£S£C7 - ESEUE - £S17£. Hence £[T(33, 21)] = £[!T(33, 2l)]£. The con¬ 
verse is trivial and we have 

Lemma 2'. Let 33 = 212? o$ in Lemma 2. Then 33 is a subalgebra of 2f if and 
onlyifEm 21)] = £[r(33, 2i)]£. 



690 


A. A. ALBERT 


5. Ideals 

A subspace 58 of an algebra 21 is a right ideal of 21 if and only if y-x is in 58 
for every y of 58 and x of 21. Then 58 = 217? for an idempotent E of rank equal 
to the order of 58 over and 58 is a right ideal of 21 if and only if either of the 
following conditions 

(17) L y = L y E , ER X = ER X E ( x in 21, y = yE in 58) 

holds. For y-x = xL v = xL u E , y = aE , aE-x = a£?/Z x = aER x E . We may 
state this result as 

Lemma 3. Let 58 = 21 E for an idempotent E of 21. Then 58 is a right ideal 
of 21 if and only if ER( 21) = ER(H)E. This is equivalent to the condition that 
L($Q, 21) he contained in [L(2I)]2?. 

In the theory of group representations the property ER( 21) = ER($)E for 
E 9* 0, 1 is called the property that 22(21) is a reducible set of linear transforma¬ 
tions. We shall not use this terminology again here. 

Left ideals are defined similarly and 58 = HE is a left ideal if and only if 
EL( 21) = EL{H)E, and thus if and only if R( 58, 21) is in [72(21)]#. We call 58 
an ideal of 21 if it is both a left and a right ideal. This occurs if and only if 
58 = 21 E where EU = EUE for every U in either 72(21) or L(2l). As in the 
proof of Lemma 2 we have EU — EUE for every U of 7X21) and have 

Lemma 4. A linear subspace 58 = 21 E of 21 is an ideal of 21 if and only if 
£7X21) = ET(H)E. 

We shall call a quantity a of 21 right singular or right non-singular according 
as R a is or is not singular. We then have 

Lemma 5. Let 58 = 21 E be an ideal of 21 and a be a right non-singular quantity 
of 21. Then E(Ra)- 1 = E{R tt )~ 1 E. 

For R a isin the associative algebra T(H) and so is {R a ) ', ETCH) = ET(H)E. 

We next prove 

Lemma 6. Let P be in (g) n and $[P] be afield of degree n over g. Then EP = 
EPE for an idempotent E of (g) n if and only if E = / or E = 0. 

For let EP = EPE and E 9 * 0. Then EP is in the total matric algebra 
E(\5) n E with unity quantity E> a total matric algebra whose degree m is the 
rank of E . If EP k = EP k E then EP k +' = EP k EP = EP k EPE = EP^'E, 
EP 4 = EP X E for every t. But then (EP) 1 = EP l E since from {EP) k = EP k E 
we have ( EP) k+l - EP k EEP = EP k+1 E. It follows that <f>(EP) = E<t>(P)E 
for any polynomial <£(P). But if <£(X) is the minimum function of P it is an 
irreducible polynomial of degree n and <t>(EP) = 0. This is impossible when E 
is singular since the minimum function of EP in a total matric algebra of degree 
m < n has degree at most m. Thus E = 7. 

6. Divisors of zero 

If b is right non-singular the equation x • b = a has the unique solution x = 
a(/2&)~ 1 . However there exists a c 0 such that c-b = 0 when b is right 
singular. We shall call b a right divisor of zero if it is a non-zero right singular 



NON-ASSOCIATIVE ALGEBRAS, I 


691 


quantity. Left singularity and left non-singularity as well as left divisors of zero 
are defined similarly, and it is clear that an algebra contains right divisors of 
zero b if and only if it contains left divisors of zero c. 

A quantity b of an algebra 31 is called an absolute right divisor of zero if b ^ 0 
and a-b = 0 for every a of 21. But then R b = 0. This can occur only if the 
linear mapping x —» R x of 21 on R( 21) is singular, that is, if and only if 72(21) has 
smaller order than 21. Similarly L(2l) has order less than the order of 21 if and 
only if some quantity b in 21 is an absolute left divisor of zero, that is, L b = 0 
and b 0. Each absolute right (left) divisor of zero spans a subalgebra 
23 = of 21 which is a zero algebra of order one and is a left (right) ideal of 21. 
In fact we have 

Lemma 7. A linear subspace 23 = 21 E = 6$ f or an absolute right divisor of 
zero b of 21 if and only if EL( 21) = 0, E has rank one. 

For © = = 21£ T where E has rank one. Then aE is zero or an absolute 

right divisor of zero if and only if x-aE = aEL x = 0, EL X = 0, EL( 21) = 0. 

If T(2l) = 7J5 then R a = al, L a = 01 for every a of 21 where a and 0 are in g- 
If R a 9 * 0 for some a then x-a = aL x = xR a = xa ^ 0 and L x ^ 0. It follows 
that the mapping x L x is one-to-one, n = 1. Similarly if L a 5 * 0 we have 
n = 1. Thus n > 1 implies that T( 21) = 7$ only if 72(21) = 7/(21) = 0, 21 is a 
zero algebra. The converse is trivial and we have 

Lemma 8. The algebra 7\2l) = 7g if and only if 21 is a zero algebra or n= 1 
and 21 = g- 

We call b an absolute divisor of zero if it is both an absolute right and an abso¬ 
lute left divisor of zero. For algebras containing such quantities we have 

Lemma 9. An algebra 21 contains absolute divisors of zero if and only if there 
exists a non-zero idempotent E of ($)n such that ER (21) = £7/(21) = 0. Then 

(18) T{ 21) = © + 7g 

where © is the set of all transformations S of T(2l) such that ES — 0 and is an 
ideal of T{ 21). 

For if b is an absolute divisor of zero the proof of Lemma 7 implies that there 
exists a non-zero idempotent E such that £72(21) = £L(2l) = 0. Conversely 
if £72(21) = £7/(21) = 0 the quantities of 21£ have the property aE x = aER x = 
0 = aEL x = x aE. Then 2l£ 5^ 0 consists of zero and absolute divisors of 
zero. The quantities of T(2l) are sums of scalars al for a in g and products 
U = !7i • • • Ut for the Ui in i2(2l) and L(2l). Then EU = 0. Hence every 
transformation of r(2l) is expressible as a sum S + al with ES = 0 and a in 
Since T(2l) contains 7J5 we have S in T(2I), and (18) holds. That @ is an ideal 
of 21 follows from the property that if U and S are in © then E[U(S + al)] = 
EU{S + al) - 0, E[(S + al) U] = ESU + EUa = 0. 

7. Simple algebras 

An algebra 21 is said to be simple if 21 is not a zero algebra of order one and 21 
is the only non-zero ideal of 21. We define the 21 -centralizer of any set $ of 



692 


A. A. ALBERT 


quantities of any algebra 21 to be the set of all quantities A; in St such that 
k-h — h-k for every h of $ and see that this set is a subalgebra of 21 if 21 is 
associative. Then we may prove 

Lemma 10. The algebra T'(2t) is simple if and only if 21 is either simple or 
7X21) = and 21 is a zero algebra. 

For let 2X21) be simple. By Lemma 9 if 21 contains an absolute divisor of 
zero we have © = 0 in (18) and 21 is a zero algebra, by Lemma 8 . Hence 
let 21 be not a zero algebra and suppose that 23 = 2 IE is a non-zero ideal of 21 
for an idempotent E 0 of ($)„. We define © to be the set of all S in 7X21) 
such that S = ©7?. By (17) © contains L(23, 21) and similarly contains 
72(23, 21). But if y 5 ^ 0 and L„ = 0 we have R v ^ 0 since 21 contains no absolute 
divisor of zero. Hence L(23, 21) and 72(23, 21) are not both zero, © 0. If 

S is in © and U is in 7X21) we have SU = SEU = SEUE = (SU)E by Lemma 4 
while also US — ( US)E . But then SU and US are in @, © is an ideal of 2X21), 
© = 2X21) contains I = IE = E, 23 = 2(7? = 21 is simple. 

Conversely let 21 be simple. If 2X21) has a nilpotent ideal 91 we have 9172(21) 
in 92, 92L(21) in 91 so that if 23 - 2191 we have 2321 = 2372(21) = 219172(21) con¬ 
tained in 23, 2123 = 23L(2l) = 2l91L(2l) in 23. Hence 23 is an ideal of 21.. But 
by Lemma 1 58 ^ 0, 21 contrary to hypothesis. Hence 7 , (21) is an associative 
semi-simple algebra and is either simple as desired or is a direct sum. In the 
latter case the unity quantity of a component of 7X21) is an idempotent E, in 
the (gr)»-centralizer of 2X21), which is singular and not zero. Then by Lemma 4 
23 = 21 E is an ideal of 21, 23 ^ 0 or 21. This completes our proof. 

8. Central simple algebras 

A field g consisting of linear transformations over g on a linear space 21 of 
order n over g is called a subfield of (g)„ if the identity transformation of (§)„ 
is in g. Then 21 may be regarded as being a linear space of order a over S and 
n — ar where r is the degree of g over g. The set of all linear transformations 
over g on 21 is the total matric algebra (g)„ and is clearly the (§)„-centralizer 
of g. The equations a-x — aR x , x-a = aL x then define the algebra 21 over 5 
as an algebra over & if and only if every R x and L* is in (g)„ . But then 72(21), 
L(2l), 7X21) are in ((£)„ . It follows that 21 is an algebra over a subfield S of 
(2r)» if and only if g is in the (5)n-centralizer of 7X21). 

We define the transformation center of 21 to be the 2 1 (2l)-centralizer of 7X21) 
and designate this subalgebra of 7X21) by g(2l). If 7X21) is simple the trans¬ 
formation center of 21 is a field of degree t over g and n = st, 7X20 is contained 
in the total matric algebra [£(2Q]«. 

An algebra 21 over {$ is said to be central simple over g if 21* is simple for every 
scalr extension $ of g. If then 21 is simple and 5 is any subfield of its trans¬ 
formation center g = g(2l) the degree of 3 divides t = rp and 21 is simple of 
order «p over ,3. T(2l) is simple over 3 and is contained in 3 *p • But [7X21)]* = 
7X21*) for every scalar extension $ of 3 and thus 21 is central simple over 3 



NON-ASSOCIATIVE ALGEBRAS, I 


693 


only if T(2l) is central simple over £. This can occur only if 3 = 2(21)* We 
use this result and then prove 6 

Theorem 1. An algebra 21 of order n > 1 over ft is simple if and only if T( 21) 
is the total matric algebra (2), where 2 = 2(21) is a field of degree t over ft and 
n = st. Moreover 21 is central simple over a sub field £ of (ft) n if and only if 

S = e. 

For if 21 is simple so is 21 over its transformation center 2 and 21* is not a zero 
algebra for any scalar extension $ of 2. Now T{ 2t) is in (2), and is known 7 
to be a central simple algebra over 2. The (2) .-centralizer of 7 7 (2l) is also a 
central simple algebra £) of degree q over 2 and T( 21) = (2). if and only if 
q = 1. If q > 1 we let $ be a splitting field over 2 of 3) and see that the total 
matric algebra ®* contains a non-zero idempotent E which is singular and in 
the (2) .-centralizer of T(2I*). Then by Lemma 4 21 &E is a non-zero proper 
ideal over $ of 21 *, whereas the proof above shows that 21 * is simple, a contra¬ 
diction. It follows that 7 7 (2l) = (2)*. The only subfields £ of (g) n in the 
(S)n-ccntralizer of T( 21) are in the (ft) n -centralizer of 2 and hence in (2)*. 
They are then in the ( 2 ) .-centralizer of (2). and thus in 2, 21 is central simple 
over £ only if £ = 2. Conversely let T(2l) = (2). where n = st and (2). 
is a total matric algebra of degree s over 2, 2 is a field of degree t over ft. Then 
the order of !T(2l) is s 2 t > 1 since otherwise s = t = n = 1 . Hence 21 is not a 
zero algebra and, by Lemma 9, 21 is simple. Also 21 is central simple over 2 
since 7 T (2l) is a total matric algebra over 2 and is central simple, 21* is not a 
zero algebra over any scalar extension of 2, 21* is simple. Thus 21 is central 
simple over its transformation center. 

9 . Algebras with a left unity quantity 

Let e be a non-zero vector in a linear space of order n over ft and 0 be any 
linear subspace of order m g n of (ft)„ . Then <30 is a linear subspace of ? and 
the correspondence 

(19) S -> eS ( S in 0), 

is a linear mapping of 0 on e0. It follows that the order of e2> over ft is at 
most m. 


• Results essentially equivalent to this one and to Lemma 10 were given by N. Jacobson, 
A note on non-associative algebras , Duke J., vol. 3 (1937), pp. 544-8. The result was first 
announced for Lie algebras by the author in the A. M. S. Bulletin, vol. 41 (1935), p. 344. 
and the author feels that the present exposition is not only in a form better suited than that 
of Jacobson for later application but presents also a much clearer picture of the relations 
between an algebra 21 and its transformation algebra T($l). The idea of studying these 
relations was suggested to both Jacobson and the author by the lectures of H. Weyl on Lie 
Algebras which were given in Fine Hall in 1933. Note that if X is the algebra generated 
by the right and left multiplications of 21 then X = 7’(2I) unless X does not contain the 
identity transformation. But then X is an ideal of 7’(21) and this cannot occur when 21 
is simple. 

7 For the properties used here see Chapters I, III, IV of the author’s Structure of Algebras . 



694 


A. A. ALBERT 


Suppose then that (19) is one-to-one, and that m = n. Then (19)'maps 
© on 8 such that x = eS = eT if and only if S = T. We may then define R x 
as that transformation S for which eS = x and have defined an algebra 21 
without absolute right divisors of zero and such that @ = 72(21), e-x = eR x = x 
for every x of 21. 

The quantity e is now a left unity quantity of 2t. Conversely every algebra 
21 with a left unity quantity e has no absolute right divisors of zero and is such 
that the linear mapping R x —> x = eR x is a one-to-one mapping of 72(21) on 
21 = e72(2l). Then multiplication in 21 is given by 

(20) eS-eR = eSR 

for every S of (^) n and every R of 72(21). It may be seen, however, that (20) 
does not hold if R is not in 72(21). 

If e is given we say that b in 21 is right regular with respect to e if c-b = e for 
c in 21. Then c is a left inverse of b (relative to e). Such a quantity may exist 
even when b is right singular . 

The left inverse of a right non-singular quantity b may be expressed as a 
certain polynomial in b. We define b 2 — bb and then the right powers of b 
by 6* +1 = ( b k )b for k > 0, b k = e for k = 0. If X is an indeterminate over g and 

(21) *(\) = \‘ + /SiX'- 1 + • • • + ft (ft in g) 

we define the right polynomial 

(22) 0«(&) = b* + pib* 1 + • • • + Pt-ib + Pte 

for any b in 21 and right powers b k . Then it is clear from (26) that 

(23) <£«(&) = &t>(Rb)' 

Hence <t> R (b) = 0 if <t>(R b ) = 0. However <t> R (b) may be zero for <t>(R b ) a non¬ 
zero linear transformation carrying the vector e into zero. Note now that in 
particular 

(24) f R (b) = g R (b) = 0 

where/(X) is the characteristic function and g(\) is the minimum function of R b . 

We define the right minimum function (with respect to e) of a quantity b of 21 
to be the polynomial (21) of least degree t such that <£*(6) = 0. Its uniqueness 
is then implied by 

Theorem 2. The right minimum function <£(X) of a quantity b of an algebra 
21 divides every ^(X) such that \p R {b) = 0. 

For we write ^(X) = </>(X)p(X) + <r(X) and have ^*(6) = e$(Rb) = 
e[<l>(Rb)p(Rb) + <K72&)] = e<r(R b ) = <r R (b ). But the degree of <r(X) may be taken 
to be less than that of <£(X), <r(X) is identically zero. 

We see in particular that the right minimum function of b divides the 
minimum and characteristic functions of R b . If b is right non-singular the 



NON-ASSOCIATIVE ALGEBRAS, I 


695 


constant term of these latter functions is not zero and <j>(\) has the form (27) 
for fit = 0. But then <t>n(b) — (b* 1 -H fiib* 2 + • • • -|- /3<_ie)'b + fifi = 0, 

(25) 6" 1 -6 = e, b' 1 - fi7'(b‘-' + fi^ 2 + ••• + fi^e). 

Moreover if c-b = e we have (c — b _I ) -b = (c — b~ ] )Rt, = 0 if and only if 
c = b~\ 

Right singular quantities may be right regular and it may happen that, 
while (25) holds for fit 0, Rb may be singular. To illustrate this wc consider 
the linear space 8 of order three over a field 5 of characteristic not two where 8 
consists of all two-rowed symmetric matrices. Then a basis of 8 is given by 


(26) 




We define an algebra 21 by 
(27) a-x 


ax + xa 
2 



where products on the right are ordinary two-rowed square matrix products. 
Then 21 is a commutative algebra such that e-x = x■ e = l(ex + xe) = x. 
But also x-x = x and thus 


(28) 


Ui-Ui = Uz-Ui = e, 


while 

(29) 


/0 -1\ / 0 1\ 

2m 2 -m 3 = 2 u 3 -u 2 = I I + I I = 0. 

\l 0/ \-l 0/ 


It follows that u t and u 3 are right and left divisors of zero and are right (and 
left) regular. The (right) minimum function of both Ui and u 3 is X* — 1 and 
the minimum function of R Ul and R u , is X(X 2 — 1). Here the matrices of these 
linear' transformations with respect to the basis (32) are 

/0 1 0 \ /0 0 1 \ 

r M = (i o o), r„= o o o . 

\0 0 0/ \l 0 0/ 


10. Algebras with a unity quantity 

A quantity / of 21 is a right unity quantity of 21 if x •/ = x for every x of 21, 
that is, Rf = I. Thus the mapping 

(30) L x — >fL x = x 

of L(2Q on 21 is one-to-one, and multiplication in 21 is defined by 

(31) eL-eS = eSL 



696 


A. A. ALBERT 


for every S of (g) n and every L of 72(21). Left regularity and left polynomials 
are defined in the obvious fashion and every left non-singular quantity b has 
a right inverse which is a left polynomial in b. 

If 81 has both a left unity quantity e and a right unity quantity / we have both 
e •/ = / and e •/ = e so that e = /. Then e is the unique quantity of 21 such that 
e-x = x-e for every x of 21 and we shall call e the unity quantity of 21. It has 
the determining property 

(32) R e = L e = l y 
and multiplication in 21 is defined by 

(33) eS-eR = eSR, eL-eS = eSL 
for every S of (g*)n , R of 72(8 l), L of L(2I). Note that then 

(34) eL-eR = eRL = eLR, 

so that e(RL — LR) = 0 for every R of 72(21) and L of L(2l). However we shall 
see that RL = LR for every such R and L if and only if 21 is associative. 

11. Isotopes of algebras 

All algebras 21 of the same order n may be regarded as having quantities 
comprising the same linear space L of order n over g. If 21 is given the space 
72(21) and the mapping x —> R x of L on 72(21) are thereby determined and con¬ 
versely. Thus if 2lo is a second algebra we have a corresponding linear mapping 
x —> Rx 0) of 2Io on a linear subspace 72(2l 0 ) of (J$) n and we write 

(35) (a, x) = a72* 0) 

for products in 2fo • We shall now say that 21 is isotopic to 2lo if there exist non¬ 
singular linear transformations P, Q f C such that 

(36) Ri 0) = PR xQ C, 

and shall call (36) an isotopy of ?I and 2l 0 . 

If 21 is isotopic to 2lo then R ( Jq-i = PR X C so that R x = P~ l R x(i -iC~ l and 2l 0 
is isotopic to 21. Also 21 is isotopic to itself under (36) with P — Q = C = I. 
Finally if R = PiR x q\Ci then R x ] = PiR xQi Ci where Pi — PiP, Qt = QiQ, 
Ci = CCi . Hence the relation of isotopy is a formal equivalence relation and 
we shall say that 21 and 2lo are isotopic as well as that 21 is isotopic to 2lo. 

All left multiplications Z/£ 0> of 2lo are determined when its right multiplications 
are given and conversely. Thus we shall determine the conditions relating 
Lx 0) and L x which are equivalent to (36). We observe that in 21 we have 

(37) a-x = aR x = xL a , x-a = aL x = xR a , 
and in 21o we have 

(38) (o, x) * aRl 0) = xLi 0) , 


(x, a) = aL x y — xRa\ 



NON-ASSOCIATIVE ALGEBRAS, I 


697 


Define 

(39) b = aQ f z = xP 
and obtain 

(40) aLT = xRf = xPR„C = zR b C = (z-b)C = bL z C. 

Then aL'f = aQL,C for every a and x of £ and we have 

Theorem 3. The conditions (36) which imply that 31 and 3I 0 are isotopic are 
equivalent to 

(41) L< 0) = QL XP C. 

The relation of equivalence is an instance of isotopy. For two algebras 
3l 0 and 31 are said to be equivalent if there exists a non-singular linear trans¬ 
formation a —> all on 31o to 31 which is preserved under multiplication. But 
then 

(42) (a, x)H = aH xH, 

that is aR^H = allR xl{ . Hence 3lo and 31 are equivalent if and only if 

(43) RT = HR xH H~\ 

By Theorem 3 the equivalent algebras 9tn and 91 are also related by 

(44) L' 0) = HL x »H-\ 

It is usually more convenient to use simplifications of (30) and (41) obtainable 
by replacing 9lo by an equivalent algebra. Thus we may apply (43) to (30) 
with H = QT l and have R? = HR^ir 1 = (. HP)R X CIF\ This result together 
with Theorem 3 may be stated as 

Theorem 4. Every isotope of an algebra 31 is equivalent to an isotope defined by 

(45) RT = PR X C, Lf = L tP C, 

for non-singular linear transformations P and C. 

The form above 8 has the advantage that in 3l 0 we map x on PR X C but is so 
unsymmetrical that we shall prefer the principal isotopy obtained from (36) 
by the application of (43) with H = C. We state the result as 


8 The concept of isotopy was suggested to the author by the work of N. Steen rod who, 
in his study of homotopy groups in topology, was led to study isotopy of division algebras. 
He concluded that algebras related as in (45) would yield the same homotopy properties 
and should therefore be put in the same class. The author then formulated the concept 
generally as in (36) and obtained Theorem 3 giving the corresponding property for left 
multiplications and Theorem 4 showing that Steenrod's isotopes were actually equivalent 
to the more general type. However the principal isotopes of (46) are much more con¬ 
veniently handled. 



A. A. ALBERT 


Theorem 5. Every isotope of an algebra St is equivalent to a principal isotope 
Sto, that is, an isotope with 

(46) Ri 0) = PR xQ , L< 0) = QL XP , 

for non-singular linear transformations P and Q. 

We observe that (46) implies that R x = P -1 ft£g-i, L x — Q~ ~ l L xP -\ and thus 
that if Sto is a principal isotope of 81 then St is a principal isotope of Sto. If 
also R? = URW we have R? - ( UP)R xVQ , L™ = VL<$ = VQL xVP . Fi- 
nally St is a principal isotope of itself with P and Q the identity. Thus we shall 
again say that St and Sto are principal isotopes as well as that St 0 is a principal 
isotope of St. 

Let us note in closing this section that every automorphism of an algebra St 
is an equivalence H of 31 and itself. Then (a, x) = a - x and R x 0) = R x , L* 0) = L x . 
We state this result as 

Theorem 6. A linear transformation H on an algebra St defines an 
automorphism of St if and only if H is non-singular and such that either (and 
hence both) of the following conditions holds 

(47) R xH = H~ l R x H, L xH = H~ l L x H {x in 31). 

12. Isotopes with a unity quantity 

If St has a unity quantity e and / is any non-zero quantity of 31 there exists 
a non-singular linear transformation H such that e = fH . Then (43) and (44) 
imply that P} 0) = HRJT 1 = Lf = HLJT 1 = I since R e = L e = J. It fol¬ 
lows that St is equivalent to an algebra St 0 with / as unity quantity. However 
we seek to discover what principal isotopes of St have / as unity quantity. We 
shall obtain the answer to this question in 

Theorem 7. Let g range over all left non-singular quantities of St, h range over 
all right non-singular quantities of St, so that the non-singular linear transforma¬ 
tions 

(48) P = (R h )-\ Q = (L,)- 1 

exist for each g and h. Then the principal isotope of St defined by (46), (48) has 
f = g-h as a unity quantity. Conversely every isotope of St with a unity quantity 
f is equivalent to a principal isotope determined as in (48), (46) for f = g-h. 

For if / = g-h we have / = gRh = hL g and g = fP, h = fQ where P and Q 
are defined in (48). Let St 0 be the principal isotope of 31 defined by (46) for 
this P and Q and put x = f. Then 

Rf = PR h - Lf = QL„ = I, 

f is the unity quantity of Sto • Conversely let (46) define an isotope of 81 with 
/ as its unity quantity so that if we define g — fP, h = fQ we have R/ 0) = 7 = 
PRk , L/ 0) = I = QL g . But then h is right non-singular, g is left non-singular, 
(64) holds and / = gP~ l — gR h = g-h as desired. 



NON-ASSOCIATIVE ALGEBRAS, I 


699 


We now prove 

Theorem 8. Let 21 and 2lo be principal isotopes and let each of these algebras 
have a unity quantity. Then the corresponding transformation algebras T( 21) 
and T(2lo) are the same. 

For we have (48) and hence have P and Q in T(2t), 72* 0) and L* 0) in !T(2l), 
!T(2lo) is contained in T( 21). The converse follows by symmetry. 

13. Ideals in isotopes 

The mapping x R x of 8 on 72(21) is one-to-one if and only if x —> 72£ 0) = 
PR x qC is one-to-one. Hence 21 contains no absolute right divisors of zero if 
and only if every isotope of 21 has this property. In particular 21 is a zero algebra 
if and only if every isotope of 21 is a zero algebra. We combine this result with 
those of Theorems 1, 5, 8 to obtain 

Theorem 9. Let 21 and 2lo be isotopic algebras each possessing a unity quantity. 
Then 21 is simple if and only if 2l 0 is simple. 

We also have the stronger result 

Theorem 10. Let 21 and 2lo be principal isotopes and let each have a unity 
quantity. Then a linear subspacc of V is an ideal of 21 if and only if it is an ideal 
of 1 lo. 

This follows from Lemma 4 and from Theorem 8. That it is desirable when¬ 
ever possible to restrict our attention to algebras with a unity quantity is strongly 
indicated by the remarkable 

Theorem 11. Let there exist a polynomial f(x) of degree n over g which is 
irreducible in g. Then every algebra 21 of order n over g and with a unity quantity 
e has a principal isotope which is simple and indeed has neither left nor right ideals. 

For by Lemma 6 if we take the linear transformation P of (g) n such that 
f{P) — 0 the only idempotents E such that EP = EPE are 0, I. Define 2lo 
by (46) for this P and Q = P. Then R %-1 = PR e = P, L^-i = QL e = P and 
ER(%) = ER(%)E is not possible unless EP = EPE, EL(%) = EL(%)E 
is not possible unless EP = EPE. Hence in either case E = 0, /, the only 
right and left ideals of 21o are zero and 21o. 

14. Associative algebras 

It is well known 9 that if 21 is an associative algebra with a unity quantity the 
space 72(21) is an algebra and the mapping x —> R x defines an equivalence of 21 
and 72(21). Moreover L(2l) is also an algebra and x —> L x defines a reciprocal 
simple isomorphism of 21 and L(2l). However it is possible for 72(21) to be an 
algebra without 21 being associative. We shall give an illustration of such an 
occurrence shortly. 

Let us now observe the known criterion for associativity which we state as 

Lemma 11. Let 72(21) and L( 21) be the right and left multiplication spaces 


9 Cf. the reference in footnote 7. 



700 


A. A. ALBERT 


respectively of an algebra 81. Then 81 is associative if and only if RL = LR 
for every R of 7?(8l) and L of L(8l). 

The proof of this criterion is rather immediate. We write (x-a)*y = x*(a*y) 
for every a, x 9 y of 81 and see that this equation is equivalent to 

(49) ( aL x )R y = x(aR v ) = aR y L x . 

Thus LxR y = R V L X as desired. 

We now derive the important 

Theorem 12. An algebra 81 with a unity quantity is associative if and only if 
every isotope with a unity quantity of 81 is associative and equivalent to 81. 

For if 81 is associative R X R V = Rxy , L x L y = L yx for every x and y of 81. A 
quantity x of 8t is right non-singular if and only if it has an inverse in 81 and then 
x is also left non-singular. But then 

(50) 72,-1 = (R x y\ L x -1 = {L x y\ 

Let now 8Io be a principal isotope of 81 and assume that 8to has a unity quan¬ 
tity so that (48) holds. Then P = R h -\, Q = L g -1 and we have xQ = xL g -1 = 
g~ l -x 9 PR xQ = Rh-'RxQ = Rh-'.g-i'X- However/ -1 = {g-h)~ l = h~ l >g~ 1 and 
we have proved that 

(51) 72< 0) = R { -u . 

Similarly Lf = L 0 -\L xP = L a -iL x . h -i = L x .h- 

(52) L ( x 0) = L x .f-i. 

It follows that R(t (o) = 72(21), L(2lo) = L(21), 72< 0) L' 0) = L { y m R ( x 0) for every a; 
and y of ?. By Lemma 11 the algebra 2lo is associative. Since it has a unity 

quantity it is equivalent to the algebra 72(21o) = 72(21) which is equivalent to 

21, 21 and 2lo are equivalent. 

Observe that if 77 = 72/-i then xH = a:-/ -1 , HR xH H~ x = 72/-i72 x ./-i72/ = 
72/- 1 .». Hence 

(53) 72< 0) = HR^ir 1 

and the principal isotopy of 21 and 2lo which we are studying is induced by the 
linear mapping 

x —> xH — xf~ l 

of 2(o on 21. This map is an equivalence of 2Io and 21 obtainable as the product 
of the equivalence x —> 72* 0) of 2lo on 72(2(o) = 72(21), the automorphism 72« 0) = 
HR xH H~ l —* R xK of 72(21) and the equivalence R xH -* xH of 72(21) on 21. Ob¬ 
serve also that the only principal isotopy of an associative algebra with a unity 
quantity e which carries e into the unity quantity of the isotope is that given 
by 72* 0) = R x . For (51) holds with / = e, f~ l -x = e-x = x. 



NON-ASSOCIATIVE ALGEBRAS, I 


701 


It is natural to consider at this point whether the property that 72 (2t) is an 
algebra implies that 21 is an associative algebra. This is partially true in view of 

Theorem 13. An algebra 21 with a left unity quantity e is associative if and 
only if 72(21) is an algebra . Moreover the mapping x —» 72* then is an equivalence 
of 21 and 72(21). 

For we have e-x = eR x = x , the mapping 72* —> eR x = x is a one-to-one linear 
mapping of 21 on 72(21). Now ( e-x)-y = x-y = eR x R y = e-{x-y) = eR x . y 
and 72*. y is in 72(21). It follows that 72 *. v = R x R y , 21 is equivalent to 72(21) and 
is associative. The converse has already been mentioned. 

We may also prove the simple generalization 

Theorem 14. Let 21 and 72(21) be algebras and let there be a left non-singular , 
quantity f in 21. Then 21 has an associative principal isotope 2to which is equivalent 
to 72(21) and has f as left unity quantity . 

For we define %> by ft' 0> = R x q , Q = Then LT = QL Z = (L/) -1 L* 

and hence L} 0) = I, (/, x) = xL}‘" = x, 5( 0 has/as its left unity quantity. But 
72(21) = 72(21o) and our result follows from Theorem 13. 

It remains to consider the general question as to the existence of non-associa- 
tivc algebras 21 such that 72(21) is an algebra. Such algebras do exist and we 
prove this as an immediate consequence of 

Theorem 15. Let 21 be an associative algebra of order n > 1 over an infinite 
field 5 an d let 21 have a unity quantity e so that 72(21) is an algebra . Then there 
exists a non-associative isotope 2l 0 of 21 with 72(2l 0 ) = 72(21). 

For it is known 9 that L(2l) is the ($) n -centralizer of 72(21), 7^(21) is a proper 
subalgebra of ({$)„ . Then there exists a linear transformation U not in L(2I), 
UR a — R a U j* 0 for some a in 21. Every linear transformation has the form 

« 2 

U = z it Si (£, in g) 

i—1 

for a basis Si of (§) n , and UR a — R a U ^ 0 implies that there exist r/ t in g such 
that Q = tjiSi is non-singular, QR a ^ R a Q- We define 2lo by 72£ 0) = R x q and 
have 72(2lo) = 72(21), T/£ 0) = QL X , L* 0) = Q is not commutative with 72^-1, 
2lo is not associative. 

It is important to observe that there exist associative algebras (without unity 
quantities) which are isotopic but not equivalent . For example consider the 
nilpotent algebra 21 with basis e\, e 2 , e 3 such that e^e* = — e 2 'e\ = cz and all 
other products are zero. Then we write a = (ai, a 2 , <* 3 ) for a = 
a%ei + oc 2 e 2 + a 3 e 3 and have a*x = fl*({i, 6, &) = (0, 0, ai&j — <* 2 £i) = a*T*, 
x-a = (0, 0, (ia 2 - fern) = a A*, where 




702 


A. A. ALBERT 


This algebra is associative since 
j 0 0 fe 

(55) r,A„ = 

We let 


(56) 


and define 


(57) 


ri 0) = Ar* 



A = 




= o = A„r, 


Then ri UJ A. 


( 0 ) a (0) = A ( 0 ) r (0) 


Aj® = A, a = 


0 as before. But we have then defined an isotope 



2lo of 21 with (ei, ei) = eiTei 01 = e s . It is not equivalent to 21 since in 21 the 
square of every a is ar« = (0,0, aia 2 — a 2 «i) = 0. 


15. Isotopes with a prescribed unity quantity 

Let 91 be a linear subspace of order n over g of (g)„ and I be in 91. Then we 
have seen that if / is any non-zero vector of 8 and the linear mapping 

(58) R^fR 

of 91 on 8 is one-to-one the algebra 21 with 12(21) = 91 and which is defined by 

(59) fS-fR = f'SR (S in (g)„, R in 91) 

has / as left unity quantity, L/ = I. But also if JR = x we have R = R x . 
Since 91 contains I we have fl = f, I = R, , f is the unity quantity of 21. 

Conversely if 21 has / as its unity quantity we have f-x= fR x = x and the 
linear mapping R x —>fR x is one-to-one and is such that I = R; is in 91 = 72(21). 

Let us now assume that 21 is a prescribed algebra with a unity quantity c so 
that 72(21) contains 7. We now let P and C be non-singular linear transforma¬ 
tions and G in 72(21) have the property that 

(60) PGC = I. 

Then 91 = PR($)C contains I and if we let/be any vector such that the linear 
mapping 


N —>fN 



NON-ASSOCIATIVE ALGEBRAS, I 


703 


of 9? on 8 is one-to-one we define an algebra Sto with P(3lo) = 91 by fS-fN = fSN 
for every S in (%) n and N in 91 and have seen that / is its unity quantity. It 
is then desirable to have 

Theorem 16. The algebra So with unity quantity f defined above is an isotope 
of 31 with f as unity quantity . 

For fN = x implies that N = /2* 0) = PR t C. Write g = fP and have x = 
gR,C = (| g-z)C = zL 0 C. If L 0 is singular so is L 0 C, that is the mapping of N 
on fN = x is singular. It follows that N —> fN is non-singular if and only if 
g = /P is a left non-singular quantity of 31. We put Q = C~ l (L g )~ l and have 
z * xQ, R ( x 0) = PR xQ C as desired. 

16. Commutative isotopes 

An algebra 31 is commutative if and only if a*x = aR x = x-a = aL z , that is, 

(61) P, = L, 

for every a: of 31. Let 5lo be a principal isotope of an algebra 81 (which may or 
may not be commutative) so that 3l 0 is commutative if and only if PR zQ = 
QL xP for every x . Put y = xP and have xQ = yP~ l Q = yS where S = P _1 Q. 
Thus 3lo is commutative if and only if 

(62) Rys — SLy 
for every y of 31. 

Suppose now that 31 has a left unity quantity e , that is, e-x = a: for every 
x of 8, L # = J. Then (62) implies that £ = R f . Thus / is a right non-singular 
quantity of 81. Moreover yS = yRj = yf and (62) is equivalent to x-(yS) = 
xRfLy = y-(x •/), that is, to 

(63) 

for every x and y of a. 

Conversely let P be any non-singular linear transformation, / be any right 
non-singular quantity of a and put Q = PR f so that <S = R f = P -1 Q and (63) 
implies (62). We have proved 

Theorem 17. Let a be an algebra with a left unity quantity , / range over all 
right non-singular quantities of a such that (63) holds for every x and y of a. 
Then the principal isotopes of a defined by R x 0) = PR x p R/ , for P any non-singular 
quantity*of (g) n , are commutative algebras to one of which every commutative 
isotope of a is equivalent . 


17. Division algebras 

An* algebra 31 is called a division algebra if it has no (right and hence no left) 
divisors of zero. Then every non-zero quantity of a is both left and right 
non-singular and we"apply Theorem 7 with g = h j* CTto obtain as an im¬ 
mediate consequence. 10 

1# This result was also obtained by Steenrod. His proof was necessarily more com¬ 
plicated as he did not have Theorems 5 and 7. 



704 


A. A. ALBERT 


Theorem 18. Every division algebra is isotopic to a division algebra with a 
unity quantity . 

Non-associative division algebras do not have many of the properties of 
associative division algebras. In particular the right minimum function of a 
quantity of a division algebra may be reducible. Let us note now that a division 
algebra 21 has no right ideals other than 21 or zero. For otherwise we would 
have b-x in a right ideal 99 for every b of 99 and x of 21 whereas b-x = a has 
the solution x = a(Lb)” 1 for every a of 21. Similarly 21 has no left ideals other 
than 21 or zero. 

If 99 is a division subalgebra of an algebra 21 and the unity quantity of 21 
is in 99 we may prove that if u is any quantity in 21 and not in 99 the linear 
spaces $8, u$8 are supplementary in their sum. For otherwise u-b = bo for 
non-zero b and b Q in 99, u = f>o(/2&) _1 . But the equation xb = b 0 has the solution 
x = boiRb)" 1 in the division algebra 99 and u is in 99, a contradiction. 

The process above is used for associative algebras to prove the theorem that 
the order of 99 divides the order of 21 under the hypothesis just stated. The 
usual proof of this result requires that if ni99 + • • • + u r 99 = 99 r is a supple¬ 
mentary sum of linear spaces not containing u then so is the sum of 99 r and n99. 
Thus in particular we need to show that if u is not in v99 no non-zero quantity 
of u$8 is in t>99. But u-b = v-b Q then u = (v-bo)Rb l = vRb 0 (Rb)“ l • However 
we cannot conclude that this latter quantity is in t;99 and thus our proof breaks 
down. We leave the question as to the validity of this theorem as an unsolved 
problem. 

If 21 is a division algebra every R x defined for x 0 is non-singular and 
fR x = 0 if and only if / = 0. Thus the mapping 72 —*/72 of 72(21) on 21 is non¬ 
singular for every f 0. Moreover so is the mapping PRC v* fPRC for every 
non-singular P and C. By Theorem 16 we have 

Theorem 19. Let f be any non-zero quantity of a division algebra 21 and P 
and Q be any non-singular linear transformations such that PRQ = I for some 
R of 72(21). Then the algebra 2l 0 defined by (a, fS ) = aS for every S of P72(2l)Q 
is an isotope of 21 withf as unity quantity . 

If (j>(X) is the right minimum function of a quantity b in a division algebra 
21 with a unity quantity e and <£(X) is reducible and of degree t > 1 it cannot 
have a linear factor. For otherwise = ^(X)[X — a] and </>*(b) = e0(72&) = 
[e}l/(R b )](Rb — otl) = ^*(6) ■ (b — ae) = 0 which is impossible since ^*(6) 0, 

b — ae 0. However it is possible that 0(X) = ^(X)(X 2 + a\ + /}) since then 
<fo(b) = [e\l/(R h )](R\ — ocRb +-&I) W'WMfc 2 — aib + Pe) since in general 
72? t* Rbt . It then becomes of interest to ask whether or not any quantity of 
a division algebra has irreducible right minimum function. It is not easy to 
answer this but we may prove instead 

Theorem 20. Let 21 be a division algebra with a unity quantity e over g and 
let b in 91 be not in Then there exists an isotope Ho of 21 such that 9lo has a 
unity quantity , 72 (Ho) = 72(21), the right minimum function of bin % is irreducible . 
. For it suffices to assume that the right minimum function <£(X) of b is reducible, 



NON-ASSOCIATIVE ALGEBRAS, I 


705 


<A(X) = ir(X)^(X) where ^(X) is irreducible and has degree t > 1. Then <£*(6) = 
&!>(Rb) = ev(Rb) •'l'(Rb) = ft(Rb) = 0 where / = eir{R h ) = T R (b) 0. We pass 
to the isotope 2l 0 defined as in Theorem 19 for P = Q = /, 72(2l 0 ) = 72(21) and 
have / as the unity quantity of 2l 0 . Then /^(/2 b ) = f R (b) = 0 in 2(o. By 
Theorem 2 the right minimum function of b divides ^(X) and must coincide 
with this irreducible polynomial. 

The result just obtained implies that every division algebra of order n > 1 
over the field 9t' of all real numbers is isotopic to a division algebra with a 
unity quantity e and containing a quantity b such that b 2 = — e. Moreover 
it is clear that every division algebra 21 of order n > 2 over 91' is central simple. 
For otherwise we could write 21 as a division algebra over its center tS ^ 9i', 
& must be 9 V(i) for i l = —1, 21 over (£ has an isotope 2lo over (£ such that b 
in 21o has X 2 + 1 as (right) minimum function. But X 2 + 1 = (X + i)(\ — i) 
in (S contrary to the proof above. 

18. Subalgebras of isotopes 

The problem of finding in a division algebra a quantity whose right minimum 
function is irreducible is an instance of the problem of determining whether an 
algebra 21 has a certain type of subalgebra. In particular we may ask whether 
or not a given algebra 21 has any proper subalgebras. A criterion that this 
be the case was given in Lemma 2 and we wish now to propose the question 
as to whether a principal isotope of 21 has subalgebras of the same order as 
those of 21. By Lemma 4 we have 33 = 21 E is a subalgebra of 21 whose order 
is the rank of E 0 if and only if ER y = ER V E, EL V = EL y E for y — xE and 
every x of 21. Now R { x 0) = PR xQ , L* 0) = QL xP and 212? 0 is a subalgebra of 21o 
if and only if E 0 R ( Z 0) = EoR { e 0) E 0 ,2£ 0 L* 0) = E 0 Li 0) E 0 for every x of S where z = xE 0 . 

The problem just proposed does not appear to have a simple solution for 
arbitrary algebras. However we should observe that if 2to has a unity quantity 
then P — (72 b )“ 1 , Q = (L*) -1 and if g and h are in $ the linear space $ is a 
subalgebra of 21 0 as well as of 21. For P and Q are in T(93, 21) = £T(23, 21 )E 
and ER { y 0) = EPR yQ = EPER yQ = EPER yQ E = ER^ since yQ = xEQ = 
xEQE = yQE is in 33. Similarly EL < 0) = EL^E. We shall not study the 
general question further except to note that if E 0 has the same rank as E it 
has the form H~ l EH and it may be seen that 3Jo = 212?o is a subalgebra of 21o 
if and only if 33 is a subalgebra of the isotope 2L defined by 72i 1} = HR { x hH~ 1 
and equivalent to 2lo. 


19. Special properties 

An algebra 21 is said to be alternative if (a-x)-x = a-(x-x), z-(x-a) = (x-x)*a 
for every x and a of 21. Then 21 is alternative if and only if 

(64) R,t = (Rx)\ L x , = (L x f 

for every x of SI. 

It follows from (64) that x-x 2 = x(R x ) 2 = (xR x )R x = x 2 -x. Suppose then 



706 


A. A. ALBERT 


that ft** = ( R x ) k for all right powers k = 1, 2, • • • , t and that x-x* = x fc *x 
for A; = 1, • • • , t. Then we put y = x + x* and have y 2 = x 2 + (x*) 2 + x-x* + 
x‘-x = x 2 + (x‘) 2 + 2x <+1 . But R y = R x + ft x< = ft* + (ft*)‘, (fty ) 2 = 
(ft*) 2 + 2(ft*)‘ +1 + (ft*) 2 * = ft** + ft*<.*< + 2(ft*)* +1 . It follows that 2ft x « + i = 
2(ft*) <+1 and that (ft*) <+1 = ft*«+i if the characteristic of g is not two. But 
then x-x* +1 = x(R x ) t+l = ( xR x t)R x = x <+1 *x. This completes our induction 
and proves that ft** = ( R x ) k for every k. 

We see that consequently x k -x* = xfti** - " 1 = x a+t so that all powers of x 
are right powers, (x s -x t )*x k = x” +t+k = x* '(x l -x k ). It follows that the algebra 
g[x] of all right polynomials <t> R (x) is the associative algebra of all polynomials 
4>(x) = <f>n(x) = 4>l(x) and R^ X ) = <£(ft*)> L^ X ) = </>(L*). 

We now propose the problem of determining the principal isotopes of an 
algebra 2t which are alternative. This occurs if and only if (ft£ 0) ) 2 = ft* 0) , 
(L* 0) ) 2 = L* 0) where z = xft£ 0) = xL* 0) . But then we must have 

R x qPR x q — RzQ j L xP QL xP = L xP . 

Replace xQ by x and thus zQ = xQL xP Q by (xQ~ l P-x)Q and similarly replace 
xP by x and thus zP = xPR xQ P by (xP~ l Q x)P. Then we see that 2lo is alterna¬ 
tive if and only if 

ft x Pft x = ft u , L X QL X — L v , 
for every x of 21, where 

u = (xQ -1 P*x)Q, y = (xP _1 Q*x)P, 

and the indicated products are those in 21. *It is of particular interest, of course, 
to study the case where we assume also that 21 Is alternative. 

An algebra 21 is called a Lie algebra if a-x = — x-a, a-(x-y) + y -(a-x) + 
x • (y • a) = 0 for every a, x, y of 21. Then L x = — ft*, aft*.y + aft x L„ + aL v L x = 0, 
a[ft*y — (ft*ft y — ftyft*)] = 0, 2t is a Lie algebra if and only if 

(65) L x = —ft*, ft*. y = ft*ft v — ftyft* (x, y in 2t). 

We propose again the question as to whether a principal isotope of an algebra 
21 is a Lie algebra and see that this occurs if and only if 

P RzQ = — QL x p , P R x qPR yQ — PftyqPft*g = P ft* 

where z = (x, y)Q = yQL xP Q . Replace xQ by x, yQ by y and thus xP by xC 
for C = Q~ l P y z by yL x cQ = (xC'-y)Q. Then 2lo is a Lie algebra if and only if 

(66) L X C = — Cft* , ft*Pfty — ftyPft* = R(xC‘V)Q • 

The problem of determining the principal Lie isotopes of simple Lie algebras 
is being studied. 11 


11 This problem is the topic of study of a doctoral dissertation at the University of 
Chicago. We also wish to mention here that Mr. W. Carter in his Master’s dissertation 
has classified all real division algebras of order four and degree two into classes of algebras 



NON-ASSOCIATIVE ALGEBRAS, I 


707 


Let us conclude these remarks with some observations which will be im¬ 
portant for the study of simple algebras. We define the center 12 of any algebra 
21 over g to be the set 3 °f all quantities z of 21 such that R t = L x is commuta¬ 
tive with every R x of /2(2l) and L x of L(2l). Then z is in 3 if and only if 

z*a = a*2, Z'(a-x) = (z-a)*x, a-(z-x) = (a-2)-x, ( a-x)-z = a-(x-z) 

for every a and x of 21. It is easily shown that 3 is zero or an associative sub¬ 
algebra of 21. Moreover if 21 has a unity quantity e the set 2) = of all ae 
for a in 3 is a subalgebra of order one over g of 3> ^5 is equivalent to g and.is 
a field. 13 

Let us define a new operation of scalar product on 2(2) to 21 by writing (a, y) = 
a -y = a-ae = aa for every a of 21 and a of g. Then 21 is an algebra over 2) 
with respect to this operation. It is clear that this is a change in our representa¬ 
tion of 21 as a linear space over a field and is not a change in 2t. 

If 21 is a simple algebra of order n over g we have seen that 2( is a central 
simple algebra of order s over its transformation center G, G is a field of degree 
t over g, w = st. Let 21 have e as its unity quantity so that, as above, cG is 
a subalgebra over G of 2f and is equivalent to G. But then eG is a field of degree 
t over eg, eG is a subalgebra of 21 of order t over g. Moreover it is easy to verify 
that eG is contained in the center 3 of ?l. But the quantities of 3 arc quantities 
z such that R z is in G, c-z = eR z = z is in cG. This proves that cG is the center 
of every simple algebra 21 with e as unity quantity and G as its transformation 
center. If we express 21 as an algebra over G it is central simple and conse¬ 
quently we may express 21 as a central simple algebra over the associative sub¬ 
algebra of 21 which is its center 3- 

It is desirable to note that if 21 and 2lo are principal isotopes we have 7 7 (2f) = 
T(2l 0 ) and hence G = G(2l) = G(2I 0 ), 21 and 2l 0 have the same transformation 
center. If e and e 0 arc corresponding unity quantities we have cG equivalent 
over 5 to Co G and thus we have shown that isotopic simple algebras with unity 
quantities have equivalent centers . It is important to observe that, while the 
center of an associative simple algebra 21 is its 2l-centralizer, this may not be 
the case when 21 is not associative. 

University of Chicago 


with respect to isotopy. He has also shown that every real division algebra of order four is 
isotopic to an algebra of degree four , that is, containing a quantity whose right minimum 
function has degree four and is thus reducible. Moreover there exist real division algebras 
of degree and order four not isotopic to algebras of degree two. 

12 We have now used the terms center instead of centrum and central in place of normal . 
A change of terminology of this kind has long seemed very desirable to many algebraists. 

18 In particular 2( may be commutative and may yet be a central simple algebra. For 
example the set of all r-rowed real symmetric square matrices forms a commutative central 
/ simple algebra with respect to the product operation a-x =» i (ax + xa), ax and xa the or¬ 
dinary matrix products. 



Annals of Mathbmatzcs 
Vol. 43, No. 4, October, 1942 


NON-ASSOCIATIVE ALGEBRAS 
II. New Simple Algebras 1 

By A. A. Albert 
(Received January 30, 1942) 

1. Introduction 

*In the second part of our study of non-associative algebras we shall give an 
iterative construction of new simple algebras with a unity quantity.* All 
previous constructions 2 of this type have used groups of automorphisms or 
anti-automorphisms and the great generality of our definition will lie precisely 
in that we shall be able to use instead almost 3 arbitrary multiplicative groups 
of non-singular linear transformation. 

We shall begin our exposition with a preliminary discussion of (non-associa¬ 
tive) separable algebras, that is algebras 21 with a unity quantity e such that 
every scalar extension of 21 is a direct sum of simple algebras. Let © be any 
finite multiplicative group of non-singular linear transformations $ on 21 such 
that eS = e and § be any subset of © containing the identity transformation. 
We define an extension set g to be a set of non-singular quantities q s ,t in 21 for 
every S and T in ©. Then we shall construct a corresponding crossed extension 
G = (21, ©, g) which is a certain algebra having e as its unity quantity. 

For every separable 21 we shall give conditions that the crossed extensions 
shall be simple (or central simple) algebras. If $ is the identity group G is 
simple whenever 21 is, G is central simple, whenever 21 is. These latter algebras 
include the so-called Cayley algebras. Our algebras are associative only when 
© = § is a group of automorphisms and our definition then includes that of 
crossed products. 4 

The crossed products are associative central simple algebras of order r 2 , 
and for each such algebra we may use an explicit process to give a set of cor¬ 
responding central simple algebras of order r* for any integer t > 1. These 
algebras are not associative for t > 2. In particular we then have generalized 
cyclic algebras. Another explicit construction will connect every central simple 


1 Presented to the Society February 28, 1942. 

* Automorphisms were necessarily used in the constructions of associative algebras of 
L. E. Dickson for which see my Structure of Algebras , Chapter V, Chapter XI, pp. 182-8, 
and bibliographical references [141], [145]. Cayley algebras were generalized in my “Quad¬ 
ratic forms permitting composition ** these Annals, vol. 41, pp. 161-77, to algebras of order 
2* obtained by the process given here where the group consists of the identity automorphism 
and an antiautomorphism of order two. 

* The special restrictions will reduce to the property that the transformations leave the 
unity quantity unaltered in the most important cases. 

4 For these algebras and the Cayley algebras see the references in footnote 2. 

* 708 



NON-ASSOCIATIVE ALGEBRAS, II 


709 


algebra of order n with a crossed extension of it by a group © equivalent to any 
permutation group on m letters. 

We shall close our discussion with a list of fundamental unsolved problems 
in the theory of these new algebras. 

2. Decomposition of algebras with a unity quantity 

An algebra 21 is said to be decomposable 6 if it is expressible as the supplementary 
sum 2li + • • • + 2l r , of at least two subalgebras 21,- of St, such that a^a, = 0 
for a» in St,-, ay in Sly and all i j* j. Then we say that SI decomposes into the 
direct sum of its components 21* (which are ideals of 21), we write 

(1) SI = 2li © • • • © Sir , 

and we call (1) a decomposition of A. If 21 has no such decomposition we say 
that 21 is indecomposable . It is clear that a decomposition with r components 
becomes one with r + 1 components if we replace a decomposable component 
21* = 33 © S by 5} © 6 in (l). The lowering of orders in such decomposition 
implies that every 21 has a decomposition (1) with the components indecom¬ 
posable algebras. It is then natural to ask (as in the theory of associative 
algebras) whether or not such a decomposition is unique apart from the order¬ 
ing of the components. We shall give a simple solution below for algebras 
with a unity quantity. 

The center of an algebra 21 has been defined 6 to be the set £ of all quantities 
z of 21 such that the commutative and associative laws, for products in 21, hold 
whenever z is one of the factors. Then £ is zero or an associative and commu¬ 
tative subalgebra of 21. When 21 has a unity quantity e the subalgebra eg of 21 
is in its center £ and we call it a central algebra if £ = eg. We may now prove 

Lemma 1. Let 21 be an algebra with a unity quantity e and £ be the center of 21. 
Then 21 has the form (1) if and only if e = e i + • • • + e r for pairwise orthogonal 
idempotents e t - in £ such that 31* = 2le*. 

For if 21 has the form (1) we may write every quantity of 21 in the form a = 
ai + • • • + a r , for the a* uniquely determined quantities of 21, . Then if x = 
Xi + • • • + x r we have a-x = avX\ + * • • + a r -x r . Take x = e = ei + 

• • • + e r and have e,*ey = 0 for i j , a e = ai*ei + * • * + a r -e r = a if and 

only if a,-e, = a,-. Similarly era, = ai , 21* has e, as its unity quantity, 21,* = 
2le,*. Now {a*x)-ei = (arz<)-e,- = arXi = (a-e^-rr,- = a-(xei) and similar 
other verifications imply that e, is in £. Conversely if e = e\ + • • * + e r , 

for pairwise orthogonal idempotents e, in £, we have 21 = 2L + • • • + 2l r for 

21 i = 21 ei and we have (1). 

As a consequence of this result we have 


* This term seems much preferable to the term reducible which causes so much confusion 
if representation theory and linear algebra theory be considered together. 

• This was defined at the end of part I of this paper. We shall use the concepts given 
in that part without any reference. 



710 


A. A. ALBERT 


Lemma 2. The algebra 21 of Lemma 1 has a decomposition (1) if and only if 
its center 3 has a corresponding decomposition 

(2) 3 = 3i + • • ’ + Sr • 

Then 3i is the intersection of 2l» and 3 an d the center of 2 U . 

For e is in 3 and Lemma 1 implies that from (1) we have (2) with 

(3) 3i = 3e> 

and conversely (2) and (3) imply (1) with 21 = Sle*. Then 3* is in both S 
and 21*, and is the intersection of S and 21, by (2). If z { is in the center of 21* 
then Zi-aj = ayZi = 0 for a ; -in 21/ and j ^ i, z { is in 3*, <3» is the center of 21,-. 
Lemma 2 clearly implies 

Lemma 3. An algebra 21 with a unity quantity is indecomposable if and only 
if its center is indecomposable . 

We then have 

Lemma 4. The decomposition of an algebra with a unity quantity as a direct 
sum (1) of indecomposable components is unique apart from the arrangement of 
the components. 

This follows from Lemmas 2, 3, and the associative case of the result we are 
proving. This latter result is proved by the use of the following lemma which 
may then be used to prove Lemma 4. 

Lemma 5. Let an algebra 21 with a unity quantity have a decomposition (1) 
so that 21* = 2le* with e,- an idempotent of 21. Then every right , left or two-sided 
ideal 33 of 21 is the direct sum . • 

(4) 33 = 33i ©•'••© 33 r , 

where 33* = 33e* is the intersection of 33 and 21, , 33 1 is correspondingly a right, left , 
or two-sided ideal of 21,*. 

The proof of this result involves the use of the associative law only for prod¬ 
ucts a-b-c with a factor in the center, and thus the proof which has been given 
in the associative case is valid without change. We shall not repeat it here. 

3. Absolute indecomposability 

If 3 is the center of an algebra 21 over 3 and $ is any scalar extension of g the 
center of 21* is 3* • F° r it is clear that 3* is contained in the center 3o of 21* . 
Let then zo be in 3o so that we may write Zo = z&i + • • • + z P £ P wl\ere the 2 * 
are in 21 and $*• in $ are such that a sum ai£i + • • • + a p £ p = 0 for the a* in 21 
only when the a,- are all zero. Then if a is in 21 we have a-zo — z 0 -a = (a*zi — 
zra)£i + • • • + (a z p — z p -a)£ p = 0, and a*Zi — z*-a = 0. If also x is in 21 
we compute a-(x*Zo) — (a-s)*2o and other similar products, and see that the 
Zi are in 3, z 0 is in 3* , 3o = 3* • 

An algebra 21 over g may be indecomposable but there may exist a scalar 
extension $ of 5 such that 21* is decomposable. Thus we call 21 absolutely 



NON-ASSOCIATIVE ALGEBRAS, II 


711 


indecomposable if SI* is indecomposable for every Moreover a decomposi¬ 
tion (1) of a decomposable algebra will be called an absolute decomposition if 
the components 31. are all absolutely indecomposable. Lemma 2 and the 
result above then imply 

Lemma 6. An algebra 31 is absolutely indecomposable if and only if its center 
3 is absolutely indecomposable. 

Lemma 7. A decomposition (1) is an absolute decomposition if and only if 
the center of each component 3l< is absolutely indecomposable. 

We may also use Lemma 4 to obtain 

Lemma 8. Let 31 be an algebra with a unity quantity , & and $ 0 be scalar ex¬ 
tensions of g such that 

(5) 31* = 3li + • • • + Sir, **. = »!+•••+ 93. 

for absolutely indecomposable 31 1 and 53;. Then r — s and , if we imbed St and 
$o in a scalar extension St 1 of g, there is a permutation j\ , • • • , j r of 1, 2, • • • , r 
such that (31,)*! = (®;»)*! . 

For 31*, = (31*)*, = (Hi) tl ® ••• ® («,)* = (3f* 0 )*, = (930*, © ••• © 
(930*! • Our result then follows from Lemma 4. 

The center of a simple algebra 31 with a unity quantity is a field and if sep¬ 
arable is indecomposable only if its degree is one, 31 is central. Thus we have 

Lemma 9. A simple algebra with a unity quantity over g and separable center 
is absolutely indecomposable if and only if it is central simple over 

4. Semi-simple algebras 

We shall call an algebra 31 over $$ a semi-simple algebra if it has a unity quan¬ 
tity e and is the direct sum (1) of simple components 3 U . Then we have seen 
in Lemmas 1, 2 that 31; has a unity quantity e; such that e = ci + • • • + e r , 
the center 3* °f 21* is a field, the center 3 of 31 is the direct sum of the 3* ■ 
We shall call 31 separable if 31* is semi-simple for every scalar extension 

Let the center 3 °f a simple algebra 31 over g and with a unity quantity e 
be a separable field. Then if $ is any scalar extension of g the algebra 3* = 
3i © • • • © 3r ? where 3» is a separable field over $ equivalent over g to a 
composite of 3 an( i If e * is its unity quantity the algebra 3* contains 
3e» which is a field over § equivalent over g to 3 under the mapping z-c; —> z. 
We let ui , • • • , u 9 be a basis of 31 over 3 and u a -Uj = u ^ 0 jk for the z gjk 
in 3 and see that ui-e iy • • • , w.-c; are a basis of 3fe; over 3^ > (*ve<) • (w;-e<) = 
(uk-ei)(zgj k -ei). Then we have a corresponding decomposition 31* = 
8li + • • • + 3I r where 81; = (3t*)e; = (3le,)* . But the linear mapping a -» a-c; 
is clearly an equivalence over $ of 31 and Sle;, 3 e < is the center of 3le;, 31; is 
a simple algebra with center 3» over $. 

Conversely let 31 be simple and separable. If 3 is not separable it is known 
that there exists a scalar extension St of g such that 3* contains a quantity 


7 This too seems a desirable terminology. 



712 


A. A. ALBERT 


y ?£ o, y h = 0 for some positive integer h. But by hypothesis 21* = 2li © 
• • • ® 2l r where the center of 21* is a direct sum of fields and is a separable as¬ 
sociative algebra &*. However y is properly nilpotent in St & contradicton. 
We have proved 

Lemma 10. A simple algebra with a unity quantity is separable if and only 
if its center is a separable field . 

We also clearly have 

Lemma 11. An algebra 21 with a unity quantity is separable if and only if it is 
a direct sum of separable simple algebras . 

By a well known property of separable fields we have 

Lemma 12. Let 21 be separable. Then there exists a scalar extension $ such 
that 21* is a direct sum 2li © • • • ® 2l r of central simple algebras 21* over $. This 
is an absolute decomposition of 21* and r is the order above % of the center of 21. 

5. Extending groups of linear transformations 

If tei , • • • y u n is a basis of 21 over g, any linear transformation G over $ on 21 
has a matrix T such that G is given by (ai^i + • • • + a„u n )G = fiiUi + • • • 
+ PnU n where 

(6) (ft , • * * , ft) = («1 , ‘ ‘ , «n)r. 

Then if $ is any scalar extension of g we may indicate by G* the linear trans¬ 
formation on 21* with the same matrix T. It is given by the equations above 
for the a, and /3, now in $. Conversely if the matrix T of a linear transformation 
Go on 21* with respect to a basis tq , • • • , u n of the original algebra 21 has elements 
in g then G 0 = G* where the matrix of G in (g) n is also T. 

As in the theory of groups with operators we shall consider algebras 21 over 
5 with operator sets © of linear transformations G over 5- If $ is any scalar 
extension of % we shall designate by ®* the set of all G K on 21* for G in ©. 

A linear subspace 23 over g of 21 will be called ©-allowable if bG is in $8 for 
every b of 23 and G of ©. Then a ©-allowable ideal of 21 will be called a ©-ideal 
and we shall say that 21 is ©-simple if it has no ©-ideals other than itself and 
the zero ideal. Finally we shall say that 21 is ©-central if every scalar extension 
21* of 21 is ®*-simple. 

We shall restrict all further attention to subgroups © of the multiplicative 
group of all non-singular linear transformations on 21 and shall call such a group 
© an extending 8 group for 21 if the unity quantity e of A has the property that 
eG = e for every G of ®. Then every subgroup of an extending group for 21 
is an extending group for 21. Moreover © is an extending group for 21 if and 
only if ®* is an extending group for 21* where $ ranges over all scalar extensions 
of 5* We now prove 

Theorem 1. Let © be an extending group for a semi-simple algebra 21 = 2li © 


. 8 We shall use this terminology in our theorems so as to diminish the size of the statement 
of the hypotheses we shall find it necessary to make. 



NON-ASSOCIATIVE ALGEBRAS, II 


713 


• • • © 2l r with simple components 2L, and let there exist an a* j* 0 in 2li and a 
transformation Gi in © for each i = 1, • • • , r such that a,G< is in 21,. Then 21 
is Ob-simple, and is Ob-central if the 21* are all central simple over g. 

For Lemma 5 states that every ®-ideal 93 = 93i © • ■ • © 93 r where the inter¬ 
section of 93 and 21,* is the ideal 93* of 21*. If 93 0 some 93, 0, 93,- = 21/ 

contains a/?,-. But © is a group and 93 is a ©-ideal only if (a,G,-)G7 1 = a,- is 
in 93. Hence a, in 2L is in 93i, 93i = 2ti . Then 93 contains every a, of our 
hypothesis and also every a*G,. These are non-zero quantities since a»G* = 0 
implies that aiGiGj 1 = a, = 0. They are in 93 and in 21, and hence in 93,-, 
93* j* 0, 93* = 21,, 93 = 2t is ©-simple. If every 21* is central every (21,-)* is 
simple and our proof implies that 21* is ©*-simple, 21 is ©-central. 

Theorem 2. Let & be a subgroup of an extending group © for 21. Then if 21 
is §-simple it is Ob-simple, and if 21 is §-central it is Ob-central. 

The next result may be regarded as the trivial case Sp — [/] of Theorem 2. 

Theorem 3. A simple algebra 21 is Ob-simple for every ©. If 21 is central simple 
it is Ob-central for every @. 

Every G of an extending group © for an algebra 21 induces a linear mapping 
b —>bG of a linear subspace 93 of 21 on 93G. If § is any subgroup of © such that 
93# is a subset of 93 for every H of @ then the mappings above are a group of 
non-singular linear transformations on 93 induced by §. We shall use this ter¬ 
minology in the formulation of 

Theorem 4. Let the center of a simple algebra 21 over ftbe a {separable) normal 
field £ and let an extending group © for 21 have a subgroup inducing in £ its 
automorphism group tp. Then 21 is ©- central . 

For if 93 is any ©*-ideal of (21)* the set 93* is a ©%-ideal of 21%, where 9i is 
any scalar extension of g containing St. But it is well known that 9Z may be 
so chosen that ,3* = + * • * + effl for pairwise orthogonal idempotents e*, 

21% = 21,• © • • • © 2t< as in Lemma 12, 21,- = (21%)e* central simple. Moreover 
Ci = e\Hi for #* in § and hence e, = <?iG* for Gi in ©. By Theorem 1 21% is 
©-simple, 93% = 0 or 21% , 93 = 0 or 21% , 21 is ©-central. 

Corollary I. A normal field 2( over ft is ©-central with respect to its {extend¬ 
ing) automorphism group ©. 

Theorem 4 may be extended to direct sums and the result stated as 

Theorem 5. Let 21 be a Ob-simple algebra of Theorem 1 such that the center 
c3i of 21, is a normal field over g. Then if @ has subgroups ®ifor i = 1, • ■ ■ , r 
such that ©,• induces in £i its automorphism group , the algebra 21 is ©- central. 

This generalization is a corollary of Theorem 4 and the following 

Theorem 6. Let 21 be a Ob-simple algebra of Theorem 1 and © have subgroups 
©* inducing in 21,* an extending group § such that 21,- is Ob r central for every i = 
1, • • • , r. Then 21 is ©- central . 

To prove this result we see that every ©-ideal of 21 has the form 93 = 93i + 

• ■ • © 93r where 93,- is the ideal of (21,)* which is the intersection of 93 and (21,)* . 
Then 93*(&<)* is in 93 and in (21,)* and is in 93,, 93* is an (§*)*-ideal. Since 21, 
is central 93,- = 0 or (21,)* . The remainder of our proof is exactly as in the 
proof of Theorem 1. 



714 


A. A. ALBERT 


6. Crossed extensions 

A quantity g of an algebra SI has been called non-singular if it is neither a 
right nor a left-divisor of zero. Then the right multiplication R g and the left 
multiplication L 0 are non-singular and we have 
Lemma 13. Let a and g be quantities of an algebra 21 such that g is non-singular . 
Then if an ideal B of A contains either a*g or g-a it contains a. 

For a-g = aR g is in 33 and so is (aR g )-g = a(R g ) 2 . If a(R g ) k is in 33 sb is 
[a(R g ) k ]-g = a(R g ) k+1 . Hence S3 contains a(R g ) k for every positive integer k. 
Since S3 is a linear space it contains every a<l>(R g ) for 4>(R g ) = ct\R g + <*t(Rf) 2 + 

• • • + ot r {R g ) r and the in But the constant term of the characteristic 
function ^(A) = A n + ftA* -1 + • • • + 0 n of R 0 is not zero, yl/(R g ) = 0, the identity 
transformation / = — Pn'lPn-iRg + • * * + ( R g ) n ] is a<t>(R g ]), S3 contains al = a. 
Similarly if S3 contains g a it contains every a4>(L g ) and a. 

We now make the 

Definition. Let 21 be an algebra with a unity quantity e and © be an ex¬ 
tending group for 21. Then a set 

(7) 0 = {f/s.r} 

of quantities g StT of 21 will be called an extending set 9 for 21 by © if g contains one 
and only one gs,rfor every pair of transformations S and T of ©, the g s , T are non¬ 
singular quantities of 21, and 

(8) gi,s = g s ,i — 
for every S of ©. 

We shall now proceed to our definition of new classes of algebras. We let 21 
be any algebra with a unity quantity e, n be the order over ft of 21, © be a finite 10 
extending group of order m for 21, g be an extending set for 21 by ®. We let & 
be a subset of © containing the identity transformation. Then we shall define 
an algebra 

O) e - 

of order v = nm over which we shall call the crossed extension of 21 by £ and 
© with extension set g. We shall also call the integer m the extension index 
of 21 under (g. 

We let 91 be a linear space of order v over so that 9? is the supplementary sum 

(10) 9?i + • • • + 9? m , 

• The term extending set is preferable to that of factor set which we reserve for extending 
sets restricted so that the algebras we construct will be associative. 

10 It seems clear that if we take © to be an infinite group our construction will be valid 
if we take the corresponding linear space 91 to consist of vectors with finitely many non¬ 
zero coordinates. Moreover it seems that our hypotheses insuring that the result is a simple 
algebra will be also sufficient for the algebras of infinite order. It would be interesting to 
take the case where © consists of the non-zero quantities of a division algebra, as well as 
that where 91 is a Hilbert space. 



NON-ASSOCIATIVE ALGEBRAS, II 


715 


of linear subspaces 92 < each of order n over 5- Then there exist corresponding 
non-singular linear transformations Ci , • • • , C m in (g), such that Ci = J„ 
is the identity transformation on 91, 

(11) 92* - 92i Ci (i = i, • • • , m). 

Thus every quantity of 92 is uniquely expressible in the form 

(12) a = aiCi + ••• + cimC m (a» in 92i). 

Observe next that the linear spaces 31 and 91 have the same order and thus 
that it is possible to take SI = 92i. We do this and have thus imbedded the 
algebra 31 in 92 as a linear subspace of 92. We shall actually define S to be an 
algebra whose quantities are the vectors of 92 and we shall formulate our defini¬ 
tion so that 31 will be a subalgebra of S. 

Let us order the transformations of ® in any order such that the first trans¬ 
formation is I and thus have the notations 

(13) S = /, St , • • • , S m 

for these transformations. We have then defined a one-to-one mapping 

(14) Si -> Ci = C Si 

of © on the set of C* such that Ci = C r = I v , the identity transformation on the 
space of order v. If S = <S, we designate by a s the coefficient a { of Ci = C 3 
and may thus write every a of 92 uniquely in the form 

(15) a = a 3 Cs « 

a 

for the a s in 31 where the sum is taken over all S of ©. Write 

(16) x = XtCt , 

T 

and define 

a'x = 22 tfuCu , 

V 

for the x T and yu in 31 where the y v are to to be determined. Then the dis¬ 
tributive law holds only if 

(17) U’x = 22 (o>8 Cs) ' (Xt Ct) • 

S,T 

We now let 

(18) (isCs'XjC t = y8 ,tC ST ) 

so that if we write U = ST, T = S~ l U } then we have 

(19) yv = Y, ys.s-w. 

a 



710 


A. A. ALBERT 


We shall then complete our definition when we express the y a%T in terms of 
terms of a a , x T and g, and we do this by defining the function 

(20) w(S 9 T , a, x) = aT-x (S in $), 

(21) w(8 , T, a, a;) = x*aT (S not in $), 

for every ordered pair a, x of quantities of 81 where the products indicated are 
products in 31 of its quantities. We are then able to write the desired formulas 

(22) y 8tT = w(ST , 7, g 8tT , Ws.r), Ws.r = w(S , T 7 , a s , x r ). 

In particular e = g x%T = 0s,/ for every S and hence we have 

(23) yi, T = w/,r = ajT-x T , y 5 ,/ = w 8 ,i = w(£, 7, a 8 , x 7 ). 

Conversely let y 8tT be defined by (20), (21), (22), so that the y v are defined 
uniquely by (19). Since 21 is a linear algebra it is clear that the y 8 ,r are linear 
in the x T and thus also in the coordinates of x, the y v are linear in x, a*x is 
linear in x. Also every T is a linear transformation, the y 8 , T are linear in a s T 
and hence in a 8 , a-x is linear in a. It follows that (5 is a linear algebra. 

We note that y 8tT = 0 if either a 8 or x T is zero. Then if « and * are in 21 
we have a = a 7 , * = x 7 , all the y 8tT are zero except yi,i = a 7 -xj by (23), 21 is a 
subalgebra of g. In fact we may prove 

Theorem 7. The algebra @ of (7)-(21) contains 21 as a subalgebra and the 
unity quantity e of 21 is the unity quantity of (£. Every quantity of (g is uniquely 
expressible in the form 

(24) = YL u s' a s (S in G, a s in 21), 

s 

where u s = eCs , Uj = e. The quantities u s are non-singular quantities of E 
such that Us'a a = a s C a and, 

(25) Us'Ut — UstQs,t > 

for every S and T of ©. Then the definitive properties of E are completely given 
by (20), (21), (25) and 

(26) ( u a -a a )-(u T -x T ) = (u a -u T )-[w(S, T, a a , x T )]. 

For by (23) we have y t , T = x T = yr if ** = e, e-« = *. Similarly y s ,i = 
w(S, 1, Os , e) = Os if « = e, y a , T = 0 unless S~ l U = T = I,S = U. Then 
Vv = Vv.i — Ou, y = o = a-e, e is the unity quantity of (S. Now u s -a s = 
eCs-asCi = ys,iC s = [w(S, /, e, a fl )]Cs = a s C a so that (15) is equivalent to 
(24). Also (18) becomes 

(27) (Ms-Os) • (u T -x T ) = u ST -y a , T . 

But (25) follows from (27) if we put a a = x T = e and use the property that 
eT = e. The definition (22) then states that (27) is equivalent to 

(28) (u a -a a )-(u T 'X T ) = u 8 t-[w(ST, I, g 8 ,T,w a , T )]. 



N ON - ASSOCIATIVE ALGEBRAS, II 


717 


But (u S 'U T )-W8 t T = (u ar t gsT) t (urw a , T ) = u aT -[w(ST, I, g a ,T , w 8 ,t )] and 

(28) implies (26). Conversely (26) and (25) imply that (u a -a a )-{u T -x T ) = 

( UaT-gs,T)'W8, T = (' u S T'Q8,T)'(uflj) a ,T) and the fact that u aT -u T = used 
with (26) implies (27). 

Since is non-singular we have u a -(u T 'X T ) = u 3T 'y a ,T where 2/ 5 , r = 
ga.r-Xr or Xr-gs.T is not zero unless z r = 0. Then u ST -ysT t* 0 

unless x = 0, u a is not a left divisor of zero. Similarly u a is not a right divisor 
of zero and is a non-singular quantity of 21. This proves our theorem. 

7. A non-simple crossed extension 

The crossed extension ® need not be simple even when 21 is ©-simple, nor 
need 6 be central when 21 is ©-central. Let us give an example here of such an 
algebra. We take g to be a field of real numbers, 21 = 2li © 2L where ei and e 2 
are the respective unity quantities of the quadratic fields 2ti and 2L defined by 

U 1 — — , u\ = —€2 , 2li = €$ + WlS, 212 — 02*5 + U2$. 

Then every quantity of 21 is uniquely expressible in the form 

(29) cl = a\Ui + a 2 u 2 + a 3 it 3 + a\U\ (a» in 5)> 

where u 3 = e\ — U \, ua = c 2 — u 2 . We let © be the group of linear transforma¬ 
tions on 21 obtained by applying the permutations (13), (24), (13) (24), (12) 
(34), (14) (23), (1432), (1234) and the identity to the subscripts i on the w, in 

(29) . Then © is an extending group of order eight for 21 and is known 11 to be 
generated by the transformation S obtained by applying (13) and the trans¬ 
formation P obtained by applying (12) (34). We let T be the transformation 
obtained by applying (24) and have 

(30) ST = TS, S 2 = T 2 = /, SP = PT , PS = TP. 

Here TP is the transformation-obtained by applying the cycle (1234) to the 
subscripts of the basal quantities in (29), and PT is its inverse obtained by 
applying (4321). 

The algebra 21 has the property that 21# = 93i © 332 © 93 3 © 934 where 33* 
has order one over the field $ of all complex numbers, 33, = i such that 
2vi = Uz + (1 + i) U \, 2t; 3 = w 3 + (1 — i) U \, 2v 2 = Ua + (1 + i) u 2y 2v K = Ui 
+ (1 — i) in are pairwise orthogonal idempotents. Any non-zero ideal 93 of 
21# contains one of the Vi . If 93 is a ©#-ideal it contains with v\ the quantity 
uz and then we apply P and its powers to get all the w,, 93 = 21# . If 93 con¬ 
tains t> 3 and hence 2ui + t/ 3 we apply S to get 2w 3 + Ui and hence the quan¬ 
tity 2(2wi + u 3 ) - (2uz + ui) = Zui . Again 93 = 21# . Similarly if 93 
contains either v 2 or va it is equal to 21# , 21# is ©-simple, 21 is ©-central. 

Observe that S carries quantities of 21i into other quantities and leaves €i 
as well as every quantity of 2b unaltered, T carries quantities of 2 12 into other 

11 cf. L. E. Dickson, Modern Algebraic Theories , pp. 145-6. 



718 


A. A. ALBERT 


quantities such that e% is unaltered and T leaves all quantities of 2ti unaltered. 
We form the crossed extension © defined above for all quantities in the extension 
set equal to the unity quantity of SI and for $ the identity group. Let j = 
u a -e 2 + . Now i-a = u 8 -(a 2 -e 2 ) + Wt-(ui-Ci) for every a = ax + a 2 

such that ai is in Six and a 2 in 2l 2 . Also = u 8 -(a s -e 2 ) + u T -(aT-ei) = j*a 
since (aS)-e 2 = (ax£ + a**)-e 2 = a 2 -e 2 , aT-ex = (o 2 >S + ax)-ex = ax-Ci. More¬ 
over u 8 'l — Usr-e x + e 2 = = Wrs’Ci + e 2 , = u T s^ 2 + Ci = 

Finally = u Pa -e 2 + u PT -ei> i-u P = u aP -e x + u T p-e 2 = by (30). 
That (a • y) • i) = a • ($ • *)) when one of the factors is $ follows from the fact that 21 
is a commutative associative algebra and that Uq-u r = . Then j is in the 

center of our crossed extension @. But £ = (u a -e^) 2 + {u T -e^) 2 + 

(u a -e 2 )(u T -e x) + ( u T -ei)(u a -e 2 ) = e since ( u a -e 2 ) 2 = e 2 , (w r *ei) 2 = e \, and the 
other two terms are equal to u 8T '{ei-e 2 ) = 0. It follows that © is a direct sum 
® = © ®$ 2 ,2$i = e — 2$ 2 = e + $, © is neither simple nor central. 

8. Simple crossed extensions 

We have just seen that 21 may be ©-simple but its extension @ not a simple 
algebra. Thus we shall have to make additional hypotheses if we wish every © 
defined for the given 21, ®, § to be simple. These conditions are really a part 
of the usual associative crossed product definition, but are hidden in the more 
explicit and special nature of those algebras. 

Let us call a linear transformation S on the linear space 21 an inner 12 or an 
outer transformation for this algebra according as there is or is not a quantity 
b 7 * 0 in 21 such that 

(31) b-x = xS-b. 

The identity transformation I is one of a class of inner transformations on 21 
which have the property that (31) holds for b in the center of 21. We shall call 
any such transformation a semi-identity transformation for 21. If S is semi¬ 
identical and 21 is semi-simple we may write b = bi + • • • + b a , 21 = 2b © 
• • • © 21, © C for simple algebras 21, such that ^ 0 is in the center of 21* and 
(31) becomes b • {x — xS) = 0. But there exist d \, • • •, d, in the center of 
2li i • • • , 21, such that drbt = e t ,/ = e\ + • • • + e a is the unity quantity of the 
ideal » = 2li ©•••©«. of «,/•(* - xS) = 0. Then 21 = » © © such 
that y — yS is in © for every y of 23, ©S = ©, b • x = xS • b for every b in the center 
of ». 

We now have a terminology for the hypotheses we shall require, and we 
shall prove 

Theorem 8. Let 21 be a semi-simple algebra , © be an extending group for 21 
such that 21 is ®-simple and I is the only semi-identity transformation for 21 in®. 


11 If © is an automorphism it is inner in the ordinary sense only when the quantity b is 
non-singular. However we shall require the (only slightly) more general hypothesis we 
give here. 



NON-ASSOCIATIVE ALGEBRAS, II 


719 


Then if there is any subset fQ of ® such that § consists of I and outer 18 transforma¬ 
tions for SI the crossed extensions (£ = (SI, ©, £>, g) are simple algebras . 

For let p(a) be the number of non-zero coefficients a a in the unique expression 
(24) of any a in @ for a a in SI. Then p( 0 ) = 0 , p(a) is a positive integer for 
every a ^ 0 . Let S 3 be a non-zero ideal of @ and p be the least p(tj) for any 
tj 0 in Then some b in 33 has the property p(b) = p and if we write 6 = 
2 Ua-b 8 as in (24) there is an So such that b So j* 0. Then u T -b = u T s'C T a 
where c T8 is the product of b a by a non-singular quantity g T ,a of 31 and is not 
zero if b a ^ 0. Take T = So 1 and have a quantity c in S3 such that p(c) = p, 
c = ^2 8 u a *c a , Ci 0 . We now let 3) be the set of all finite sums of terms of 
the form (x-c )•?/, x-(y- c), for x and y in 21. It follows that every b of 3) has 
the form 

b = dj + 52 u 3i dsi , 

i -2 

for a fixed set /, S 2 , • • • , S p in © and with the d a in 3 t. Then c is in the set 
3) and 35 is a non-zero linear subspace of the ideal 31 such that p(b) = p for every 
non-zero b of 3). Moreover the quantities d 7 consist of all finite sums of the 
form (x-Cj)-y or x-(cry) for x and y in 31, the set of all the d 7 is a non-zero 
ideal of 31. Leti / 7 be its unity quantity so that fi is in the center of 31 and 
there is a quantty f in 3 ) such that 

f = // + 2 “s./.Si (fsi in 31). 

»*2 

If p < 1 and some T = S% is not in $ it is not a semi-identical transformation 
and there exists a quantity x in 31 such that /* 7 = x-fi — frxT = 
fr(x — xT ) 0. The corresponding b = x-f — f xT 0 is in 3) so that 
p(b) = p. But the term of I) involving u T is x*(M r */ r ) — (ur-f^-xT — 
u T -(xT-f T — xT-fr) = 0 , a contradiction. Hence every Si is in § and if p > 1 
there is an Si = T which is an outer transformation, there must exist a quantity 
x in 31 such that h T = xT f r — fr-x ^ 0 , x-iur-fr) — (ur-f^-x = u T -h T ^ 0 
is the term involving u T in b = — f -x. Then b ^ 0 is in 3) and p(b) = p. 

However / 7 is in the center of 31 and hi = x-fi — frx = 0, a contradiction. 

This proves that p = 1 and that the intersection U of 33 and 31 is a non-zero 
ideal of 21. If U contains b and S is in © we write T — S~ l and have 
Wr-(b-iifl) = b where b =‘ 07 \s*bS or bS-g T ,s is in U. By Lemma 13 so is dS . 
Thus U is a ©-ideal of 21 and U = 21 since 21 is ©-simple. Then 21 contains e 
and is the unit ideal @ is a simple algebra. 

We note that the example in Section 6 of a crossed extension which is not a 
simple algebra failed to satisfy our hypotheses precisely in that © contained 
semi-identical transformations S 9 * I. 

We shall not try to compute the center of a crossed extension @ and so to 

11 In the case of ordinary crossed products ($ » $ and 3( is a field. Our hypotheses are 
then satisfied. 



720 


A. A. ALBERT 


prove that a given ® is central simple but we shall rather try to see that our 
hypotheses are in a form such that they hold also when the field gf is extended. 14 
Thus we prove 

Lemma 14. A linear transformation S on a separable algebra 21 is a semi- 
identical transformation or an inner transformation for 21 if and only if S* has the 
corresponding property for 21* , where $ is any scalar extension of 

For if (31) holds for a quantity b in 21 and every x of 21 we will also have 
b -x = xS*'b for every x in 21* , S * is inner when £ is. Also S* is semi-identical 
when S is since if £ is the center of 21 the center of 21* is £* . Conversely let 
y-x = xSsry for every x of 21* and a fixed quantity y in 21* . Then we may 
write y = 1 / 1 & + • • • + y£ t for yj in 21 where the are in $ and are such that 
Uifi + • * * + = 0 for a t in 21 if and only if the a< are all zero. Take x in 21 

and so obtain xS * = xS in 21, y-x — xSsry = (Vr x ~~ £&***/;)£y = 0, 
yyx = xS-y. Hence if £* is inner so is S. If £* is semi'identical the quantity 
y may be taken to be in £* , yi is in £, S is semi-identical. 

We may thus apply Lemma 14 to Theorem 8 and obtain 

Theorem 9. Let 21 be a separable algebra , 15 © be an extending group for 21 
such that 21 is ©-central and I is the only semi-identity transformation for 21 in &. 
Then if @ is any subset of ® consisting of I and outer transformations for 2t the 
crossed extensions @ = (21, ©, §, g) are central simple algebras. 

If 21 is a simple algebra the only semi-identity transformation for 21 is I and 
we have 

Theorem 10. Let ® be an extending group for a simple algebra 21, § consist 
of I and outer transformations for 21 in Then every crossed extension of 21 
is a simple algebra. 

If § consists of I alone we shall write 

« = (#, ®, 0 ) 

for the corresponding crossed extensions. These are surely the most interesting 
of our new algebras and we shall state the results of Theorems, 8 and 9 for such 
algebras as 

Theorem 11. Let & be an extending group for a simple algebra 21. Then 
every crossed extension Q = (21, &, q) is a simple algebra . Moreover if 21 is 
central simple so is 6. 

9. Associativity 

A crossed extension @ is associative if and only if 21 is associative and 
[(us'as)(u T -x T )]-Up-Wp = (u s -a a ) '[(u T -x T ) -(up-w P )] for every a s , x T , w P of 21 
and S, T, P of @. If © s* § we take S not in a 8 = e, T = P = I and 
have u 8 -(x'w) = (u 8 *x)-w = u a {w-x) which is possible in an associative 

14 This seems to be the best possible method of proof even for ordinary crossed products. 

15 In the associative case the simple components of 21 are necessarily equivalent, since 
21 is (^-simple and © — $ is composed of automorphisms. However this does not appear to 
be necessary here and this question should be studied. 



NON-ASSOCI ATI VE ALGEBRAS, II 


721 


algebra © if and only if 21 is commutative. But when 21 is commutative the 
algebras © = (21, ©, g) are the same for every § and we may take 6 = 
(21, @, g). Hence we have § = [I] in every case. We now compute 
( Us-a a )-(u T -x T ) = U8T-(gs.T-asT-x T ). Similarly (tt T *x r )-(wp-wp) = 
UTP-(gT.p'X T P-Wp). Multiply the first of these products on the right by u P -w P 
and the second on the left by u S 'a a . The resulting products each have the 
non-singular left factor u 8 tp , and if © is associative we have 

(32) gaT,p[ga,T'(a s T’X t^P'W P = ga,Tp'{(iaTP) *\g T ,p'{x T P'W P )]. 

Put a 3 = = Wp equal to the unity quantity e of 21 and obtain 

(33) gsT,p m (g s,tP) = ga t TpgT, P ( S , T, Pin©). 

Next put S = T 7 = / and w P = 6 and so obtain 

(34) (a-z)P = aP-zP, 

that is, © is a group of automorphisms of 21. Put x T = w P = e, P = I, to 
see that the are in the center of 21. Conversely if (33) holds for an auto¬ 
morphism group © and the g s , T in the center of 21 we have (32) and © is 
associative. 

We shall call an extension set satisfying (33) for an automorphism group © 
of 2t a factor set. Then we have proved 

Theorem 12. A crossed extension © = (21, ©, §, g) is associative if and only 
if 21 is associative , © is a group of automorphisms of 21, g is a factor set of 21 whose 
quantities are in the center of 21, and ® = §. 

We now have the consequent 

Corollary I. Let © = (21, ©, g). Then if 21 is not commutative the algebra 
© is not associative. 

Corollary II. Let © = (21, ©, g) where 21 is a central simple algebra. Then 
© is a non-associative central simple algebra . 

We have tacitly assumed in all of our work that the order n of 21 is not one 
and that the order m of © is also not one. Then © has order nm and 21 is a 
proper subalgebra of ©. 


10. Explicit construction 

Let us indicate some of the types of algebras included under our definition. 
The first of these are the ordinary crossed products and the special case of cyclic 
algebras. These are given by © = (21, ®, £>, g) for © = $ the automorphism 
group of a normal field, g a factor set. Then our construction Theorem 9 
implies that © is central simple and this seems to be a proof of that result which 
has been overlooked in the literature. The other algebras of Theorem 12 have 
been considered only in a rather special case. 

Let us restrict all further attention to the case where 21 is central simple and 
# is the identity group so that © = (21, ©, g) is not associative and is central 
simple. Then we shall define one very interesting type of algebra which is the 



722 


A. A. ALBERT 


crossed extension of SI by what is, essentially, a permutation group. We shall 
call such algebras permutation algebras and define them as follows. We let 
e, th , • • • , w» be any basis of SI over § and let u\ be determined by 

(35) «! + •••+ u n — e. 

Then «!,•••,«„ are a basis of SI over 5 and we have a unique expression 

(36) a = aiUi + — + a„u„ (a« in 3r) 

for every a of SI. We define 

(37) aS(P) = atiu^ + • • • + «nMt„ 
for every permutation 


(38) 


-C 2 - B ) 

\H t2 • • • w 


and thus have defined a group © of non-singular linear transformations S = S(P) 
on 31 such that eS = e for every permutation group © 0 of permutations P . 
Clearly © is equivalent to ® 0 and the algebra G = (21, ®, g) is central simple 
for every g. Moreover, this type of algebra is special since, while every finite 
group ® may be represented as a permutation group, © may not 16 permute any 
set of basal quantities of A . 

Let us give an iterative process next for the construction of a family of central 
simple algebras defined for what is essentially a single group. We let G = 
(2f, ©, fi) be a given central simple crossed extension of order nm over 
defined for an algebra 21 of order n and a group © of order m. Then every 
quantity * of @ is uniquely expressible in the form 


“t” 'W'm'UfH 

, S m the transformations of ®. 


x = Wi-ai + 

for the di in 21 and = u 8i , S\ = 7, S 2 , 

Since 6 is central simple any 

So = (@, ®o , fio) 

will be central simple for any extending group © 0 and extension set g 0 
©o be the set of linear transformations 


We let 


So 


xSq = Ui-diS + * • * + Urn'CLmS 


(S in ®). 


Then © 0 is a finite group equivalent to © and is clearly an extending group 
for the algebra G 0 = (G, ©o, fio) is central simple for every g 0 and we may 


1# It would be desirable now to prove the existence of examples of a simple algebra 21 
and an extending group © such that no basis of 2( exists for which G may be regarded as a 
permutation group. Observe also that for every field 21 of order n over J we have a simple 
permutation algebra. This is then a generalization of the crossed product concept where 2t 
is a normal field, the crossed product is a permutation algebra defined for a normal 
basis of 21. 



NON-ASSOCIATIVE ALGEBRAS, II 


723 


indeed choose 90 in 81 . This process may be repeated to obtain central simple 
algebras of order nm* for every © of order m. 

In particular we have the generalized crossed products 

= ( 9 ?, ©, 01 , • • • , Q<), 

where 91 is a normal field of order r, © is its automorphism group, the are all 
factor sets (or merely any extension sets). This algebra has order r t over g. 
We thus have the generalized cyclic algebras 

(% S, 71, * • • , 7 1 ) 

for the 7, ^ 0 in 3- 

Another process of iteration is that where we define S 0 by K S 0 = 
Ui*aS + U2-av + • • • + u m -a m and there are other obvious variations. How¬ 
ever these may possibly give corresponding algebras obtained from the type 
given above by the use of a different © and extension set. Thus we are led to 
the problem of determining when two crossed extensions defined for the same 81 
but with distinct groups and extension sets are equivalent, and also w’hen they 
are isotopic. The solution of this problem will require a study of automorphisms 
of our algebras and also a solution of the simpler problem of determining condi¬ 
tions that ( 81 , ©, 9) shall equal ( 81 , ©, §, f) for given distinct extension sets 

9 and f. 

The associative algebra theory suggests for study many other fundamental 
problems regarding our new classes of algebras. For example let us call any 
algebra equivalent to a generalized cyclic algebra ( 9 ?, S, 1, • • • , 1) a generalized 
total matric algebra over 3 * Then we seek to study the nature of the simple 
subalgebras of all such algebras (as w r ell as of all other crossed extensions) and 
in particular to prove that they are all generalized cyclic algebras if 3 is a p-adic 
or an algebraic number field. Such a study would probably require a study of 
splitting fields, direct products, the S-centralizer of a simple subalgebra of S, 
further extension of the concept of total matric algebra, similarity for crossed 
extensions, a theory of division algebras and a theory of exponents. It seems 
clear that our crossed extension definition includes many new’ varieties of simple 
algebras and it should lead to a host of new applications of modern algebraic 
techniques. 


University of Chicago 



Amau of Matbsm aucs 
Vol. 43, No. 4, October, 1942 


EXTENSIONS OF DIFFERENTIAL FIELDS, I 

By E. R. Kolchin 1 
(Received May 6, 1942) 

Introduction 

It is a well-known theorem of algebra that a finite algebraic extension of a 
field of characteristic zero K always contains a primitive element o>: 

K(a i ,-••,«»)« KM. 

n 

Moreover, by means of the theory of Galois, it is possible to characterize those 
elements of the extension which are primitive. 2 * The present paper treats the 
analogous problems for differential fields (ordinary or partial). 

A simple example shows that the precise analog is not true without further 
restriction. Let be the ordinary differential field of rational numbers, and 
let a\ and a 2 be two algebraically independent complex constants. Since and 
a 2 both have zero derivatives, go(ai, <*2) is set-theoretically identical with 
5 o(<*i, «2), 8 whence it is clear that there exists no number /3 c , <*2) such 
that $o(«i > <*2) = 3 o( 0 ). However, for the theorem in question to hold it suffices 
to place a mild condition on the differential field. In the ordinary case the condi¬ 
tion reduces to the requirement that the differential field contain a non-constant 
(that is, an element whose derivative is different from zero), in the general 
(partial) case, the condition is that the differential field contain a set of elements 
whose Jacobian does not vanish. 

In studying those elements of an extension ® of a differential field g which 
are primitive, a theorem presents itself which bears a similarity to results from 
Galois* theory. However any attempt in this direction seems destined to but 
fragmentary results, as the concept analogous to a normal extension of a field 
is lacking, so that one must speak of isomorphisms instead of automorphisms, 
thereby abandoning the concept of group. 

1 . Generic solutions 

Throughout this paper g will denote a differential field of characteristic zero 
with m types of differentiation 81 , • • • , 8 m , 4 * * and yi , • • • , y n will denote un¬ 
knowns (m and n are positive integers). 

Let 2 be a system of differential polynomials in g{t/i, • • • , y n ) with mani- 

1 National Research Fellow. 

* This is not, of course, the simplest characterization. 

* • • • ) means the result of the differential field adjunction to 5 of the elements 
u y • • • . S(tt, • • • ) means, as usual, the result of the field adjunction to 5 (considered as 
a field) of the elements u t ••• . The result of differential ring adjunction is indicated by 
curled brackets: 5{w, ••• ). 

4 This concept has been discussed by H. W. Raudenbush, Bulletin of the American 

Mathematical Society, vol. 40 (1934), pp. 714-720. 

724 



EXTENSIONS OF DIFFEBENTIAL FIELDS, I 


725 


fold Sffl. A set in , • • • , ii n of elements of a differential extension field of 3 will 
be called a generic solution of 2 (or of 9 H, with respect to 3 ) if a necessary and 
sufficient condition for a differential polynomial F(y 1 , • • • , y„) in 3 J y x , • • • ,y n ) 
to belong to 2 is 

• ,Vn) = 0 . 

It is easy to see that if 2 has a generic solution, then 2 is a prime differential 
ideal in 3 {j/i > • • • , 2/»), so that 9)1 is irreducible over 3 - Conversely if 2 is a 
prime differential ideal other than the whole ring 3lyi > • • • ,y n \, then 2 has a 
generic solution. For example, if, in the differential ring of remainder classes 
3 {j/i. • • • > y«}/2, yi is the remainder class containing y,-, then y x , • • • , y n are 
elements of a differential field containing 3 (namely, the differential field of 
quotients of 3 f^i > • • • , Z/»)/ 2 ), and F(y j, ,y n ) is in 2 if and only if 
F(y x , • • • , y„) = 0 . It is not hard to see moreover, that any generic solution 
Vi , • ■ • , v n of 2 is equivalent to y t , ■ • • , y „, that is, iji —* y< (i — 1 , • •• , n) 
generates an isomorphism: 

3<»?1 , • • • , Vn) = 3(yi , • • • , tin )- 6 

Now, a prime differential ideal 2 in ${2/1, • • • , y n ) may very well decompose, 
over an extension @ of into several essential prime differential ideals: 

( 1 ) { 2 } = Aj n ••• n A s , in ©{z/i, • • • , y n ). 

Let fi, • • • , fn be a generic solution of some A*, say of A* . Then ft , • • • , f n 
is a generic solution of 2. Indeed, it is clear that F(y \, • • • , y n ) c 2 
implies F(fi, • • • , f n ) = 0, as 2 C A* . Conversely, suppose that F = 
F(y %, • • • , y n ) € 5(2/1, • • • , y n ), and that F(fi, • • • , f«) = 0. Let 

G € Ai n • • • n Ah—i n A*+i n • • ■ n A,, G 4 Ah « 

Then FG vanishes for all solutions of 2, so that, by the Ritt analog of the 
Nullstellensatz , some power ( FG) k is a linear combination, with coefficients in 
©{i/i, • • • , 2/n), of differential polynomials in 2: 

F*G k = CiSt + • • • + C,S, (Si € 2). 

The coefficients of (?*, Ci, • • • , C t are in ©. Letting wi, • • • , be, with 
respect to a linearly independent linear basis of these coefficients, we find a 
relation 

+ • • • + H^ g ) = Tm + • • • + T ^ 0 , 

where each T{ € 2, each Hi e ${2/1, • • • , y n }, and Hn*i + • • • + H^ g = (j . 
Equating coefficients, on both sides, of the linearly independent elements 
we see that 

* The isomorphism indicated by the symbol ~ maps not only the sum and product of 
two elements onto the sum and product, respectively, of their images, but also the various 
derivatives of an element onto the corresponding derivatives of its image. 



726 


E. B. KOLCHIN 


F*Hi = T{ € 2 (i * lj * *' j ff). 

But not every H { is in 2 , for otherwise G would be in Aa . Hence, since 2 is a 
prime ideal, Ft 2 . 

We use this result to prove that if the prime differential ideal 2 mj}{yi, • • • , y n ) 
has a generic solution i?i, • • • , y n , if © is a differential extension field of 
g, • • • , rj n ), and if no extension of © contains another generic solution of 2 , 
then each ra1 g. 

For, let (1) be the decomposition of {2} into essential prime differential ideals 
in. ©{2/1, • • • , 2/n}. Any generic solution of Ai is a generic solution of 2 and 
therefore is identical with 771 , • • • , ri n . The same holds for every A t -, so that 
s = l,\ { 2 } = [yi - m , • • • , y n - ?7 n ], and the only solution of 2 is 771, • • • , rj n . 
Assume, now, that 771 4 g. For some fc, 

(2/1 - m)* - CxSi + • •. + CtSi (Si c 2 ). 

We suppose that k has been chosen as low as possible, so that 1, 171, • • • y rjt 
are linearly independent over g. Letting 1 , 171, • • • , r/i, wi , • • • , w 0 be a 
linearly independent linear basis, with respect to g, of the coefficients in 
(yi*T,"Vi) k y Ci , • • • , Ci , and equating coefficients of 77^, we arrive at the contra¬ 
diction that 1 c 2. Hence rji e 5 , similarly, every 77, e g. 

2 . Relative isomorphisms 

. Let © be a differential extension field of By an isomorphism of © with 
respect to g we shall mean an isomorphic mapping of © onto a differential field 
©' such that 

(a) ©' is an extension of 

(b) the isomorphic mapping leaves each element of g invariant, 

(c) © and ©' have a common extension. 

».iBy means of well-ordering methods it is easy to show that an isomorphism 
of © with: respect to g can be extended to an automorphism of the common 
extension of © and its map under the isomorphism. 

Concerning such relative isomorphisms we prove the following theorem: 

Let © be an extension of g, and let y e ®. A necessary and sufficient condition 
that y t g is that every isomorphism of © with respect to g leaves y invariant . A 
necessary and sufficient condition that y be a primitive element , that is } that 
© = g(y), is that no isomorphism of © with respect to g other than the identity 
leaves y invariant . 

Proof: A. If 7 € g, then by condition (b), every isomorphism of © with 
respect to g leaves 7 invariant. Now let 7 4 g, and denote by T the prime 
differential ideal of all differential polynomials in g{2/} which vanish for y = 7. 
7 is a generic solution of T. Since 7 4 g, we know by §1 that there exists a 
differential field § 2 © in which T has another generic solution 7'. Now, 
7 generates an isomorphism between g(7) and g(7') which leaves invariant 
every element of g. This isomorphism can be extended to an automorphism 



EXTENSIONS OF DIFFERENTIAL FIELDS, I 


727 


of «£), which automorphism in turn can be contracted to produce an isomorphism 
of © with respect to 3 which does not leave 7 invariant. 

B. If © = 5(7), every element of © is a rational function, with coefficients 
in 3, of 7 and its various derivatives, so that an isomorphism of © with respect 
to 3 which leaves 7 invariant leaves every element of © invariant, that is, is the 
identity isomorphism. Conversely, if © 5^ 8(7), there is an element a c © such 
that a 4 5(7)* By the part of the theorem already proved there is an iso¬ 
morphism of © with respect to 3(7) which does not leave a invariant. This is 
an isomorphism of © with respect to 3> other than the identity, which leaves 7 
invariant. 

The existence, in certain general cases, of a primitive element will be demon¬ 
strated in §4, after the proof of a preparatory result in §3. 

3. Non-vanishing of nonzero differential polynomials 

The following lemma will be used in §4. 

A necessary and sufficient condition that , for an arbitrary nonzero differential 
polynomial A = A{y \, • • • , y n ) e 3{ 2/1 > * • • , y n } 9 there exist elements , • • • , 
In * 3 such that A{t\ 1 , • • • , tj w ) ^ 0, is that 3 contain m elements £1 ,•••,£*, 
whose Jacobian is different from zero : 

81 £1 ••• 8m £1 

J = . 5* 0. 

' 81 * * * 8 m £m 

Proof: Necessity. If 3 has the property in question, then, in particular, 
there are elements £1 , • • • , £ m which do not annul 

812/1 • * * 8 m 2/1 

J(j/l, ••• , 2/m) = . 

S\ 2/m 5m 2/m 

Sufficiency. It obviously suffices to consider the case n = 1: A = 
My) « 55 {yI- Now, since J ^ 0 there exists an m X m matrix (a {j ), with ele¬ 
ments in g, such that (a, 7 )(5,|*) is the unit matrix. Hence, if we introduce the 
operators 

= a,i6i + • • • + otimSm (i = 1, • ■ • , m) 

in terms of which, in tuijn, the operators 5 ; may be expressed 

5j = (8jl5l + • • • + ffjmSm (j — m )l 


, fl if i = lb, 

= If) if * ^ k. 


we shall have 



728 


E. B. KOLCHIN 


Moreover, since 


we see that 


BpSq Otpibi ^ ] CtqjSj 

i i 

= Z Z <Xpi<Xqj&i&j + X QI ap<8<a 4 /)5/, 


8 p « 4 = ^'p + 


(7^*8). 


Hence A(j/) may be expressed as a polynomial, with coefficients in in the 
quantities 5('' ••• s'^ m y: 

A(y) = P( • • • , ai 1 ' 1 • • • ti”y, • • • )• 

Letting the symbols denote constants in j$f such that 

P( ••• ) ^0, 

and letting ax, • • • , d m be unknown constants (that is, indeterminates all of 
whose derivatives are zero), form the expression 

= E rr—I”-, ($1 - «l) Al • • • (fm - dm)*". 

By the above, fj satisfies the congruences 

fli* 1 • * * &m m ii = (fi ~ «i, • * * ? — a m ). 

Hence 

= > £*i» * * * ) (£i di, • • * , £m a m ), 

that is, is a polynomial in the indeterminates dy = £y — ay 0* = 1, • • • , m) 
with coefficients in g, and these coefficients are not all zero. Therefore we may 
choose rational values ay for the unknown constants dy so that, for 

V = Z " (a - ax)* 1 • • • (6. - Om)S 

we have A(rj) 0, q.e.d. 

4. Existence of a primitive element 


We are now in a position to prove our principal 

Theorem. Let gf contain m elements whose Jacobian is different from zero. 
If \S(ai ,••• ,a n ) is a differential extension field of 5 such that each a t - is a solution 
of a nonzero differential polynomial in g{?/}, then there exists a primitive element y: 

3(«i »•••>“») = 

By §2 we must show that there exists aye g(ai ,•••,«„) which is invariant 



EXTENSIONS OF DIFFERENTIAL FIELDS, I 


729 


under no isomorphism of , • • • , a n ) with respect to 5* We shall prove, 
as a lemma, a stronger result. 

Let Ai(y % ) e have the solution = a* (t = 1, • • • , n). We shall show 
that there exist elements n, • • • , r n t 3 such that r\y x + • • • + r n y n assumes 
different values for different solutions of {Ai(y x ), • • • , A *(?/„)}. 6 Then certainly 
the element r x ai + • • • + r n a n will satisfy our requirements on y. 

To prove this lemma, let z \, • • • , z«, k ,*••,/« be new unknowns, and, in 
5{2 /i , • • • , y n , Z \, • • • , 2 n , h , • • • , tn }, consider the perfect differential ideal 

fl = Mi(2/i), * • • , An(yn), Ai(Zi), * • • , A n (z n ), /l(2/l - Zl) + • • • + /n(l/n ~ *n) } . 

Let 12 = 12i n • • • n 12* be the decomposition of 12 into essential prime differential 
ideals, and suppose the subscripts have been assigned so that 12 x , • • • , 12 r each 
contains every z/ t — Zi , whereas 12 r+ i , • • • , 12 s each fails to contain some yi — Zi . 
Consider an 12, with j > r. Let , • • • , f \ n , fi, • • • , f« , h , • • • , r n be a 
generic solution of 12/. Since — fi) + • • • + f n (fj n — f n ) = 0, and some 
ffi — f t - is different from zero, f x , • • • , f n are dependent 7 over , • • • , ij n , 
fi, • • • , f n ). But each fj, and each f t annul a nonzero differential polynomial 
with coefficients in g. Hence f \, • • • , f n are dependent over so that 12,- 
contains a nonzero differential polynomial Lj e 3(/i ,*•*,/„}. Now let 
M(ti - ,t n ) = L r +1 • • • L 9 . By the authority of §3 choose elements n , 
• ■ • , r n for which M(r x , • • • , r n ) 0. For any two distinct solutions = ??» 
(i = 1; • • *, n) and y { = f< (i = 1, • • • , n) of \A x {y x ), • • • , A n {y n ) ), the 3/i 
elements 


1?1 > * y y fl > ' > Tn > T\ , * > 

cannot be a solution of 12. For, these elements cannot be a solution of any 12,- 
with j ^ r as each such 12/ contains every yi — Zi , and they cannot be a solution 
of an 12/ with j > r as each such 12/ contains M(t x , • • • , /„). Consequently 

Tl(l7l ~ fl) + • * • + T n (Vn ~ fn) 5^ 0. 

Since 771 , • • • , r\ n and f 1 , • • • , f n were chosen as any two distinct solutions of 
{^ 1 ( 2 / 1 ), ••• ,A n (y n )}y the proof of the lemma, and therefore of the theorem, 
is complete. 

Institute foe Advanced Study 

• We lean heavily here on the proof for the ordinary case given by J. F. Ritt, Differ¬ 
ential equations from the algebraic standpoint , American Mathematical Society Colloquium 
Publications, vol. XIV, New York, 1932. See especially pp. 26-31. 

7 See Raudenbush, loc. cit. 

8 See Raudenbush, loc. cit. 



Ann alb or Mathbmatk* 
Vol. 43, No. 4, October, 1942 


LE CORRESPONDANT TOPOLOGIQUE DE L’UNICITfi DANS 
LA THfiORIE DES EQUATIONS DIFFERENTIELLES 1 

Par N. Aronszajn 

(Received December 13, 1940; revised July 25, 1942) 

Dans la th<k>rie des Equations diff6rentielles, aussi bien ordinaires qu’aux 
derives partielles, on a pu ytablir des th^orfemes d’existence et des thdorfcmes 
d’unicity. II est apparu dans beaucoup de cas que, si pour les th^oremes d’exist- 
ence il suffisait d’admettre pour les membres de liquation des hypotheses de 
regularity trfes faibles, se r^duisant parfois a la continuity seule (comme dans le 
cas de systSmes d’Equations diffyrentielles ordinaires), il etait n^cessaire d’ad¬ 
mettre des hypotheses de regularity plus fortes pour assurer l’unicite. 

La question se pose de caractyriser dans les cas de multiplicity provenant de 
l’aff&iblissement des hypotheses de regularity, l’ensemble des solutions multiples. 
Il apparait immediatement que cette caracterisation doit tenir compte des 
proprieties topologiques de 1’ensemble en question et que pour cela il est nyces- 
saire d’introduire une topologie dans cet ensemble. 

Sur cette voie nous sommes arrive k etablir une classe d’cnsembles k laquelle 
appartiennent toils les ensembles des solutions multiples correspondant aux 
equations en question. Il nous semble probable que tout ensemble de cette 
classe est homeomorphe k l’ensemble des solutions multiples d’une equation du 
type consider. Si cette suggestion etait vraie, nous aurions eu ainsi une carac- 
terisation topologique complete de ces ensembles de multiplicity et, en memo 
temps, le correspondant topologique de l’unicite dans le cas de certaines types 
d’equations admettant de solutions multiples. 

Les ensembles de la classe mentionnye seront design es par Rt . Ce sont des 
limites des suites decroissantes des ensembles R , ou par R nous designons les 
retractes absolus do K. Borsuk. 2 Les Rt conservent beaucoup de proprietes des 
retractes absolus. 

Notre resultat principal peut etre enonce de manure intuitive (mais peu pre¬ 
cise) comme suit: Si les membres de VEquation en question peuvent etre approchis 
aussi prds que Von veut par les membres d’une Equation plus rfyulidre, admettant 
une solution unique , Vensemble des solutions de la premiere Equation est un Rt . 

Remarquons que, dans les cas particuliers que nous avons pu traiter, l’hypo- 
thfese de notre theorfeme concernant l’approximation avait pu 6tre verifiee grace 
au theorbme de Weierstrass sur l’approximation d’une fonction continue par des 

1 Cet article forme un d4veloppement d’une conference que l’auteur a faite le 19 avril 
k Paris, k une seance de la Society Math, de France. Les circonstances anormales actuelles 
n’ont pas permis de donner k cet article un developpement aussi complet que l’au- 
eur l’aurait souhaite. Surtout le cdte bibliographique est en defaut, mais l’auteur n’a 
as pu faire mieux et il s’en excuse. • 

* Voir au sujet des retractes les articles de K. Borsuk dans Fundaments Math, a partir 
du t. 17 (1931) pp. 152-170. 


730 



LB CORRESPONDANT TOPOLOGIQUE DE L’UNICITlS 


731 


polynomes, ou grace aux th^orfemes similaires. A ce propos, r61evons que 
Papplication de ce th6or6me de Weierstrass a d6j& 6t6 faite par U. Muller 3 dans 
le cas de systfemes d^quations diff6rentielles ordinaires, pour d6montrer un 
th6or6me de H. Kneser. Ce dernier th6or6me, qui concerne le caract&re continu 
de Pensemble de solutions, est une simple consequence do notre th6orfeme (car 
Ri est toujours un continu). 

Le travail se compose de quatre paragraphes. Dans le §1 nous rappelons 
certains r&ultats et definitions essentiels. Dans lc §2 nous prouvons un th6o- 
rfeme auxiliaire concernant les suites de r6tractes absolus. Le §3 est consacr6 
au r6sultat fundamental de Pexpos6. Des applications aux systemes d’equations 
differentiellcs ordinaires forment le contenu du §4. 

1 . Resultats Preliminaires 

D’aprhs Borsuk 2 un r6tracte absolu ( R ) est un espace m6triquc separable qui 
est un retracte de tout espace m6trique qui le contient. Les r6tract.es absolus 
ont la propriete du point fixe, c/est-^-dirc quc toute representation de R sur R 
possfcde un point invariant. Nous introduisons la notation R 6 pour designer 
tout homeomorphe de ^intersection d’une suite decroissante de r6tract.es absolus. 
On peut ais6ment montrer que Pensemble R* est un continuum k homologie et 
groupes fondamentaux ceux d’un point. Bien entendu, on sait que ces pro- 
pri6t6s apparticnnent aussi & R . dependant R& et R peuvent diff6rer en ce qui 
concerne leurs propri6t6s locales. Par example R& peut ne pas avoir de con¬ 
nexions locales, ainsi que le montre clairement Pexemple elassique y = sin 2 (r/x) 
pourO < x 1 et — 1 ^ y = 1 pourx = 0. Nous observons entre parentheses 
qu'un R s dans lc plan euclidien ne coupe pas le plan. 

Dans le but de fournir des conclusions g6n6rales, nos resultats sont formuies 
pour ccrtaines equations op6rationelles de la forme 

W = T{z) 

dan§ les espaces de Banach. 4 Si T est continu et represente des ensemble's homes 
de E sur les ensembles (conditionellement.) compactes de £", on dit alors que T 
est complfetement continu. Notre contribution principale, le th6oreme C, est 
bas6e sur un th6or6me g6n6ral d’existence du 4 Schauder.’’ 

TH^ORkME A. Lorsque T est complement continu et reprisente K sur K, ou K 
est borne, convexe etfermt dans E, son ensemble de points fixes est un sous-enscmble 
compacts en soi et non vide d J un R. 

L’equivalence avec la formulation de Schauder r6sulte du fait qu’un com- 
pactum convexe (ici Pextension convexe ferm6e 6 de T{K)), dans un espace de 

* Voir M. Mtiller, Math. Zeitschrift, 28 (1928) pp. 619-645. 

4 S. Banach: Thkorie des Operations Linkaires , Warsaw 1932. 

* Voir Math. Zeitschrift, 26 (1927) pp. 46-65 et Studia Math., 1 et 2. Des th6or£mes de 
ce type ont d6j k 6t6 donnas par G. D. Birkhoff et O. D. Kellogg, Transactions Amer. Math. 
Soc., 23 (1925) pp. 96-115; Lefschetz: Topology (New York 1930) p. 358 et Annals of Mathe¬ 
matics, vol. 38 (1937), pp. 819-822. 

* S. Mazur: Studia, 2, (1930), pp. 7-10. 



732 


N. ARONSZAJN 


Banach est un R. II serait int6ressant de savoir si le th^orfeme C, peut 6tre 
dtendu aux cas oik les th^orfemes d'existence (fondamentaux) sous-jacents sont 
d4montr6s par les m&hodes, de Leray-Schauder. 7 

2. Les Suites de Rgtractes Absolus 

Th^or&me B. Soit {R in) } une suite de ritractes absolus, sous-ensembles d f un 
mfrne espace, et soit M un ensemble contenu dans tous les R in \ Si les R in) con - 
vergent vers M, ce dernier ensemble est un R$ . 

Demonstration . Soit & respace contenant tous les R (n) et soit <p n une fonction 
r&ractant & sur R (n \ 

Nous pouvons toujours supposer que Tespace & est distanciable et que Ton a 
choisi pour lui une distance p(x, y) born^e sup^rieurement (autrement nous 
aurions pu remplacer & par la somme de tous les R (n) qui a certainement ces 
proprtet&s). • 

Nous allons choisir une sous-suite {/? u) } de {/? (n> } de sorte que, en d6signant 
par <p k la fonction correspondant a R {k \ et par $ k \ pour i < k , la fonction 
compos^e 

( 1 ) ti k) = v&i+x • ■ ■ 9k-1, 
on ait pour tous k, i < k et x e R (k) , 

(2) p(x, *{"(*)) £l/k. 

Pour d4finir les R {k) nous commengons par poser R a) = R m . Supposons mainte- 
nant que les R a) , R i2 \ • • • , R w sont d6j& d^finis. D’apr&s (1), ^,- k+1) est alors 
d^fini, et on a pour tout x de M et tout i < k + 1 

* - 

car pour tout r, M C R (r) , et par cons6quent ^ r (x) = x. II s’ensuit qu’il existe 
un voisinage V de M tel que, pour x e V et tout i < k + 1, on ait, 

*(*, tl k+1 \x)) £ --j • 

Nous poserons R {k+l) = le premier R w post4rieur 4 R {k) dans la suite {/2 (n) }, 
contenu dans V. II est clair qu’un tel R (n) existe vu que les /2 <n> convergent 
vers M. Ainsi, les R ik) se d^finissent successivement et la propri4t6 (2) est 
remplie. 

Consid4rons maintenant le produit combinatoire infini &°° = & X & X ■■ ■ 
aux elements X = (xi, x*, • • • , x», • • •) avec x n t S. On d6finit dans S°° 
une distance 4 la Fr4chet 

p(X, Y) - E 2~ n p(x», y n ). 

n*"l 


7 Voir J. Leray et J. Schauder, Annales Scient. ficole Norm. Sup., 51 (1934) pp. 45-78. 



LB CORRESPONDANT TOPOLOGIQUE DE L’UNICITlS 


733 


La notion de limite correspondante se dSfinit comme suit: la suite {Z a> } 
converge vers X, si chaque suite {x^} converge vers x„. 

Considerons dans &°° les sous-ensembles Q (k \ d6finis de manifere suivante: 
Q W est compost de tous les points X = (x t , x 2 , ■■■ ,x k , • • • ) tels que, pour 
n ^ k, x n t R (n) , tandis que pour n < k, x n = ^ k \xk). 

II est clair que Q (k) est hom6omorpl^| avec l’ensemble de toutes les suites 
(xic, Xk+i , • • ■ ) oil x n parcourt R { n) , n = k, k + 1, • • • . Cet ensemble forme 
le produit combinatoire R (k) X R (k+1) X • • • de r6tractes absolus R <n) ; c’est done 
un r^tracte absolu. 8 II en r^sulte que Q (k) est un rStrode absolu. 

Remarquons ensuite que Q (k) 3 Q (k+l) . En effet, si X = (x x , x 2 , • ■ ■ , x k , 
x k + 1 , ••• ) appartient 4 Q (k+ '\ on a d’aprfes la definition de Q <k+n : x„ t Ii {n) 
pour n ^ k + 1, x k = \pi k+l \xi ;+ i) — <t> k (xk+\) e R m et enfin, pour n < k, x n = 
— $»&n+i • • • <pk(xk+i) = V*fe+i) = *?’(**), done X e Q ik) . 

Prouvons maintenant*que la suite ddcroissante Q (1) , Q <2) , • • • a pour intersec¬ 
tion l’ensemble M ' compose de tous les X = (xi , x 2 , • ■ • ) avec x x = x 2 — x 3 = 
• • • = x e M. En effet, si X = {x x , x 2 , • ■ • ) appartient a tous les Q (k) , on 
aura suivant (2) pour tout k et tout n < k 

p(Xk,X n ) = p(Xk,in\x k )) ^ 

II en rfeulte que tout x n , n = 1, 2, • • ■ , est la limite de la suite {.r*} qui est 
n£cessairement convergente. II s’ensuit d'une part que Xi = x 2 = • • • = x. 
D’autre part, Xk c R (k) et, les R (k) convergeant vers A/, la limite x dc {x A } appar¬ 
tient k M. Ainsi A/' 3 Q (1) Q (2) • • • . Inversement, si X e A/', il appartient k 
tout Q ik \ car, pour n ^ A:, x n = x € M C R (n) et, pour n < k, x n = x = 
\pn\x) = \j/n\xk)j vu que toutc transforme un x e A/ en lui-meme. II est 
done prouve que A/' = Q (1) Q (2) • • • . 

Enfin, il est Evident que Tensemble M r est hom^omorphe avec M par Tinter- 
m6diaire de la correspondance donnant k un X = (x t x, x • • • ) de A/', pour 
image le point x de M. 

Ainsi, M est hom^omorphe avec M’ qui est, d’aprfes ce qui pr6cfedc, un /?«. 
M est done lui-m6me aussi un /?*, c.q.f.d. 

3. Le theoreme principal 

Pour pouvoir poursuivre nos raisonnements nous allons admettre que la trans¬ 
formation T pout Ure approcMe aussi pr&s que Von veut par une transformation 
“plus rfyulidre” 

Pour pr6ciser cette hypoth&se revenons aux notations du §1 pr6c*5dent. Nous 
supposerons qu’it tout t > 0 on peut faire correspondre une transformation T t 
complfetement continue de 1’espace E en luj-m6me de sorte que 

1°. || T<(z) - T(z ) || ^ e pour tout ttiment z de K, || || d&ignant la norme 
dans E ; 


• N. Aronszajn et K. Borsuk, Fundamenta, 18, 1932, pp. 193-197. 



734 


N. ARONSZAJN 


2°. La transformation z t = z — T,(z) = H,{z ) reprisente de mani&re biunivoque 
Vensemble Kenun ensemble nontenant une sphere 11 z 11 ^ p, avec p indipendant de e. 

Tandis que la premiere condition precise de manifere dont T est approch4e 
par T,, la seconde condition peut 6tre appl4e "condition d’unicit4,” car elle a 
pour consequence l’existence et l’unicite (si Ton se limite aux solutions apparte- 
nant 4 K) de la solution de liquation en z 

z - T,(z ) = 21 , 

pour Z\ de norme suffisamment petite. Comme nous l'avons d6jil remarqu^, 
pour avoir FunicitE de solution il faut admettre en g6n6ral des conditions ^uppl6- 
mentaires de r£gularit6, c’est pourquoi nous dirons que T t est “plus r^gulifere” 
que T . 

Dans ces conditions nous pouvons d^montrer le 

ThEorEme C. Vensemble des solutions de rtquatio\T(z) = z est un Rs . 

Demonstration. D&signons cet ensemble des solutions par S. Consi- 
d^rons les transformations T n = T (n pour une suite {6 n } tendant vers 0, tous 
les e t , 6tant ^ p. 

Consid^rons d’abord les transformations T n et H n pour un n fixe. L'ensemble 

6tant contenu dans K, on a d’aprfcs 1°, pour tout 616ment f de aS, 

II #»(r) II = II H, n (0 II - II f - TM) II = II T(t) - r„(f) II ^ *„. 

Par consequent, l’ensemble transform^ H n (S) est contenu dans la sphere de 
rayon t n ^ p. Cet ensemble, obtenu par transformation continue d’un en¬ 
semble compact en soi, est compact en soi (voir le th6orfcme A). Le plus petit 
corps convexe le contenant est aussi compact en soi et compris dans la sphfcre 
de rayon e„ g p. 

Soit Q n ce corps convexe. D’apr&s la condition 2°, la transformation 
inverse de la transformation H n , est d^finie sur Q„ . Elle donne de Q n une 
image R (n) = H~\Q n ) contenue dans Pensemble K. 

La transformation inverse H„ l n’est pas en general continue, mais nbus allons 
montrer qu’elle l’est sur Q» . En effet, si une suite d’614ments j/i*-) de Q„ tend 
vers h (qui appartient aussi & Q „, celui-ci dtant compact en soi), on a pour les 
616ments z k — H^\h k ) les Equations suivantes 

Zic - T n (z k ) = h k . 

Les 2 * appartenant a l’ensemble born£ K, ils forment une suite bornfe, et la 
transformation complfctement continue T„ transforme cette suite en une suite 
compacte. Si les 2 * ne tendaient pas vers P616ment z — HZ'ih) donng par 
liquation 

z - T„(z) = h, 

on pourrait extraire des 2 * une suite { 2 *,.} n’admettant pas z comme 414ment 
limite et telle que les T n {z ki ) convergent vers un 616ment g. Mais alors les 



LE CORRESPONDANT TOPOLOGIQUE DE L’UNICITfi 


735 


elements z k . = T n (z k .) + h ki convergeraient vers g + li et les T n (z kl ) converge- 
raient vers T„(g + h), qui serait 4gal 4 g, et on aurait 

9 + h = T„(g + h) + h; 

g + h serait done la solution z de z — T„(z) = h et les z ki y convergeraient, 
d’oit contradiction. 

Cette contradiction prouve quo, sur Q n , la transformation H„ l est continue. 
Puisque son inverse H n est d’aprfcs 2° biunivoque et continue, la transformation 
H^ 1 represente de manifcre homiomorphe Q n en R ( "' = HZ l {Q n ). 

L’enscmblc Q n etant compact en soi et convexe, e’est un ritrade absolu. Par 
consequent, son hom4omorphe R {n> l’est aussi. D’autrc part Q„ contenait le 
transform^ H n (S) de S. II s’ensuit que R (n) = II^ l (Q n ) contient S. Des lors, 
pour prouver notre theorfcme, il nous reste a prouver que les R in) convergent 
vers S pour n tendant vers 1’ °c. 

A cet offet prenons une suite quelconque {z,} telle quo ehaque Zj appartient 4 
un R {n ‘\ les n,- tendant vers l’oc. Comme nous l’avons vu plus haut, tous les 
..ft 00 sont contenus dans 1’ensemble borne K. II s’ensuit que {z/J est une suite 
bornee et que les T{z,) forment une suite compacte de laquelle on peut extraire 
une sous-suite [T(z ik )} convergeant vers un element 2 . Les zj appartenant 
4 K, on a selon 1° 

II TnjJ.Zjh) 1 ( 2 /fc) || =§ e n )jt * 0, 

done, les T n , (zj k ) convergent aussi vers z. D’aprfes la definition des ft <n) , on a 
pour tout Zj k l’equation 

Zj k = Tn-Jzj) -f- hj t , 

od h jt appartient 4 Q n , k , done 4 une sphere de rayon e n , k —* 0. II en l-esulte 
successivement: lim z ]k = lim 2’„. (z, t ) = z, lim T{z jt ) = T(z) done T(z) — z. 

Ainsi; de toute suite {z,} avec z, appartenant 4 ft (n,) on peut extraire une suite 
{z ik } convergeant vers une solution z de l’equation T(z) = z, done vers un 
element de S. Ceci prouve que les ft 00 convergent vers S. 

Notre tbeoreme est ainsi demontre. 

4. Application 

Comme application de notre theor^me general nous allons considerer un sys- 
tfeme d’equations differentielles ordinaires. Sans restreindre essentiellement la 
gen4ralite nous pouvons nous limiter au cas d’un systfcme de deux equations avec 
deux fonctions inconnues 

(3) J = n{x, y, t), ^ - v(x, y, t ). 

avec les conditions initiales 


x(0) = 0, 


y{ o) = 0. 



736 


N. ABONSZAJN 


II est connu depuis Peano que ce systfeme admet certainement de solutions, 
si seulement u et v sont continues. En g£n£ral ces solutions existeront dans un 
intervalle de t entourant t = 0. Si nous supposons—ce que nous allons faire 
dans la suite—que les fonctions u et v sont continues et borages pour toutes les 
valeurs de x, y et t, les solutions existeront sur tout I’axe de t. 

D’autre part, on sait que si l’on suppose les fonctions u et v satisfaisant & une 
condition de Lipschitz relativement k x et y, uniform&nent en t dans tout 
intervalle fini, la solution est unique. EUc le sera done k fortiori, si u et, v sont 
analytiques en x, y et t, ou ne different d’une telle fonction que par une fonction 
de t seul. 

* 

Pour appliquer notre thgorfcme g6n6ral, neus envisagerons l’espace vectoriel 
de tous les couples de fonctions [x{t), y(t)j admettant des d6riv4es x'(t) ct y'(t) 
continues, et satisfaisant aux conditions x(0) = y(0) — 0. Nous considdrerons 
ces fonctions dans un intervalle fini fixe b g |3, a < 0 < /3, arbitrairement 
choisi. 

Dans cet cspace vectoriel nous prendrons comme norme d’un couple de 
fonctions 


z = [x(t), y(t)], 


lc nombre 


II z II = max[x'(0 2 + y'(t) 2 )*. 

Consid6rons dans cet espace la transformation 

Zi = T(z) = [xi(0, yi(t)], Xi(t) = f u(x, y, t) dt, yi(t ) = [ vdl, 

JO « JO 

oil z = [*«), 2/(0]- 
II est clair que, si Ton pose 

m = borne sup (u 2 4- v 2 )\ pour tous les x, y, t, 

la sphere de notre espace vectoriel, 

|| z || ^ 2m 

peut 6tre prise comme 1’ensemble K dans la th^orie g4n4rale, car la transforma¬ 
tion T la repr&ente en elle-m&ne. D’autre part, on prouve facilement que T 
est complfetement continue. Ceci permet d4j& d’ appliquer le thforfeme d’ex- 
istence A. 

Pour appliquer le th6orfeme C, il faut d6finir les transformations T, conform4- 
ment aux conditions 1 ° et 2 ° du § 3 . A cet effet remarquons d’abord que, si 



LE C0RRE8P0NDANT TOPOLOGIQUE DE L’UNICIT^ 


737 


pour 2 = [z(0,2/(01 on a || 2 || g 2m, il en r4sulte pour x(t) et y(t), a g l £ 0, 
les inegalites 

I x(t) I g 2m(0 - a), 1 y(t) | g 2 m{0 - a). 

Par consequent, pour satisfaire aux conditions 1° et 2°, il suffira d’approcher 
chacune des fonctions u(x, y, t ) et v(x, y, t ) par des fonctions analytiques u, et v, 
des trois variables reelcs x, y et t, satisfaisant aux in4galit4s 

| u. - «| g g, |». - ® | S g, pour 

|*|^ 2m(/3 - a), I y | g 2m(j8 - a), a g t <. 0, 
| u, | ^ my/2, | v, | g m\/2 pour tous les x, y, t. 

En se basant sur le theorfcme (^approximation de Weierstrass on construit 
augment les fonctions u t et v t . Les thdorfcmes d'existence et d'unicite, indiqites 
au commencement de ce § permettent de verifier imm^diatement la condition 2°. 
Ainsi, le theorfcme C est applicable. 

Indiquons quelques consequences de ce theorfcme dans le cas present. Comme 
on sait, k chaque solution de notre systfeme avec les conditions initiates z(0) = 
2 /( 0 ) = 0, correspond dans Pespace des variables x , y, t une courbe integrate du 
systfcme passant par Porigine x = y = t = 0. S'il y a plusieurs solutions, il 
passe par Porigine tout un faisceau de courbes integrates. Chacune de ces 
courbes coupe le plan t = to au point x 0 = x(/o), y 0 = y(to) qui varie de fagon 
continue quand la courbe integrate parcourt le faisceau. Il s’ensuit que la trace 
du faisceau sur le plan t = to est une image continue du faisceau, c'est k dire de 
Pensemble S des solutions du systfcme (3). Cet ensemble etant un R &, done k 
fortiori un continu, son image est egalement un continu. Par consequent, la 
trace sur le plan t = to du faisceau des courbes integrates passant par Vorigine 
—ou par un point quelconque—est un continu . C'est le theorfeme de Kneser; il 
se montre ainsi une consequence immediate de notre theorfcme. 

Dans des cas particuliers nous pouvons preciser la nature de cette trace. 
Par exemple, si les fonctions u(x , y } t) et v(x, y } t) satisfont dans tout Pespace 
des x, y, t a la m6me condition de Lipschitz sauf au point x = y = t = 0, il n’y 
aura dans tout Pespace que Porigine comme point par lequel puissent passer 
plusieurs courbes integrates. Dans ce cas, chaque point de la trace sur le plan 
t = to 7 * 0 ne provient que d'une seule courbe integrate. Par consequent, la 
trace est une image homiomorphe de Vensemble S et est un R *. D’aprfes la pro- 
priete caracteristique des Rt plans, cette trace ne coupe pas le plan t = to . 

Il est trfes probable que tout Ri plan peut £tre obtenu comme trace du faisceau 
integral pour un choix convenable des fonctions u et v conformes aux conditions 
ci-dessus. Ceci est en rapport avec Phypothfcse que nous avons emise dans 
Pintroduction. 



738 


N. ARONSZAJN 


II serait int&essant d 9 6tudier la nature du faisceau integral et de ses traces 
pour difterentes classes de fonctions. En particular, on pourrait 6tudier la 
nature topologiquc du faisceau integral pour les fonctions u ct v continues et 
bornfes dans tout l’espace (x, y, t) et analytiques partout sauf & Forigine. 

Editors Note. Owing to present circumstances it was impossible to commu¬ 
nicate freely with the author regarding certain necessary revisions in the paper. 
With the authorization of the author and some information conveyed by him, 
this was accomplished by Professor D. G. Bourgin, to whom the Editors 
wish to express their personal thanks and also those of the author. S. L s 

London 



Amhal* of Mathematics 
Vol. 43, No. 4, October, 1042 


A PROOF THAT THERE EXISTS A CIRCUMSCRIBING CUBE AROUND 
ANY BOUNDED CLOSED CONVEX SET IN R 3 

By Shizuo Kakutani 
(Received June 8, 1942) 

1 

The following problem was proposed by Professor Rademacher: Given a 
bounded closed convex set in a three-space R 3 , is it always possible to find a 
circumscribing cube around it? It is easy to see (cf. §3) that this problem 
can be reduced to the following one: Given a real-valued continuous function 
/(P) defined on a two-sphere S 2 , is it possible to find a triple of points Pi, P 2 , 
P 3 c S 2 , perpendicular to one another (this means that the three vectors OPi , 
OP 2 , OP 3 from the center 0 of S 2 to these three points Pi, P 2 , P 3 are per¬ 
pendicular to one another) such that /(Pi) = /(P 2 ) = /(P 3 )? The purpose of 
the present note is to answer these questions in the affirmative. 

2 

Theorem 1. Let f(P) be a real-valued continuous function defined on a two- 
sphere S 2 . Then there exists a triple of points Pi , P 2 , P 3 * S 2 , perpendicular to 
one another , such that f (Pi) = /(P 2 ) = /(P 3 ). 

Proof. Let us consider S 2 as a sphere of radius 1 in a three-space R 3 , with 
the origin 0 = (0, 0, 0) of R 3 as a center. Let us put P? = (1, 0, 0), P 2 = 
( 0 , 1, 0), P 3 = (0, 0, 1 ). Let further G = {< 7 } be the group of all rotations of 
S 2 (or equivalently, rotations of R 3 around its origin O = (0, 0, 0)). G is a three 
dimensional compact manifold. 

For any a € G, consider the point (p(a) = ( x , y , z) t R 3 defined by x = f(<r~\Pi)), 
y = f{<r~ l (PV)), z = f{<T l {Pl)). It is clear that <7 —> <p(cr) is a continuous map¬ 
ping Qf G into R 3 . In order to prove our theorem, it suffices to show that there 
exists a rotation <7 e G such that <p(<r) lies on the straight line l:x = y = z in R 3 . 

We assume the contrary, and shall draw a contradiction from it. Let p be 
the projection of R 3 onto the plane w'.x + y + z = 0, which is perpendicular to 
the line Z. Then a —> \f/(cr) s p(<p(cr)) is a continuous mapping of G into it. 
By assumption, the image \j/(G) of G by this mapping a) does not contain 
the origin O = (0, 0, 0). 

Let H be the subgroup of G consisting of all rotations around the line Z. H 
is isomorphic to the group of rotations of the plane w around the origin O = 
(0, 0, 0), and we may denote elements of H by <r*(0 £ 6 £ 2*r), where B denotes 
the angle of rotation around the axis Z, measured in such a sense that we have 
<T 2 x/s(P?) = P?+ 1 , i = 1, 2, 3, mod 3. 

Let us denote the rotation of the plane r around its origin 0 = (0,0,0), which 
corresponds to <r $, by 7*(0 ^ 0 g 2*■). It is then easy to see that we have 

^(<r$+2(r/8)) = T2r/z(^(<Te)) 9 ^(^+(4 *■/«)) = Ux/ityi**)) 

739 



740 


SHIZUO KAKUTANI 


for any 0(0 ^ 0 ^ 2 tt). Let Ce lt $ % be the curve traced on v by #(*$) when 0 
runs over the interval 0 i ^ 0 ^ 02 . Then the fact stated above means that the 
curves C 2ir /8,4ip/3 and Cir/w* are obtained by applying the rotations r 2r /3 and 
to the curve Co, 2*73 . 

Let a be the increment of the angle around the origin 0 = (0, 0, 0) in the 
plane ir y when the point runs over the curve C 0 , 2^/3 from ^(<ro) to (<r 2 r / 3 ), or 
equivalently, when 0 runs from 0 to 2t/3. Then a must be of the form: a = 
2 m 7 r + 2?r/3, where m is an integer (m = 0 , ± 1 , d= 2 , • • •)• Hence as 0 runs 
from 0 to 2 v y the total increment of the angle of ${ 09 ) around the origin 0 — 
( 0 , 0 , 0 ) in the plane tt is Gwtt + 2 ir = ( 3 m + 1 )* 2 tt. 

On the other hand, consider H as a closed curve on the manifold of the topo¬ 
logical group G. Then it is well known that 2 H is homotopic to zero on G . 
Consequently, the curve 2Co, 2 x , which is the image of 2 H by the mapping 
a —> \p(cr) y must also be homotopic to zero on tt*, where by t* we mean the open 
set which is obtained by taking away the origin 0 = ( 0 , 0 , 0 ) from the plane t. 
This is, however, impossible, since the total increment of the angle on the 
curve 2 C 0 , 2 * is 2(3m + l)-27r 5 ^ 0 . 

Thus we arrive at a contradiction, and the proof of Theorem 1 is completed. 

3 

Theorem 2. Let K be a bounded closed convex set in a three space R z . Then 
there exists a circumscribing cube around K. 

Proof. Let S 2 be a two sphere in / 2 3 with the origin O = ( 0 , 0, 0) of R z as a 
center. For any point P e S 2 , consider two. tangent planes to K (parallel to 
each other) which are perpendicular to the- vector OP. These two planes may 
coincide if K is a flat convex set. Let/(P) be the vertical distance of these two 
planes. f(P ) is clearly a real-valued continuous function defined on S 2 . (More¬ 
over, f(P) takes the same value at two antipodal points of £ 2 ; but we do not 
need this fact in our proof). By Theorem 1 , there exists a triple of points 
Pi, Pi , Pi * S 2 , perpendicular to one another, such that/(Pi) = /(P 2 ) = /(P 3 ). 
It is then clear that the corresponding six tangent planes form a cube which is 
circumscribing around the convex set K. 

4. Remarks 

There are two problems related to our results. The first one is to investigate 
whether it is possible to inscribe a* cube in a given bounded open convex set 
in R z . The answer to this question is negative, and a counter-example to this 
is given by a tetrahedron in R z which is extremely flat. In fact, if we take a 
convex quadrangle ABCD on the {x, y)-plane, such that the two diagonals 
AC and BD are not perpendicular to each other, and if we shift the vertex A 
in a direction of z-axis by a small distance, then the tetrahedron A'BCD thus 
obtained is a required one. It is easy to see that there is no inscribing cube in 
this tetrahedron. 



CIRCUMSCRIBING CUBE AROUND CONVEX SET 


741 


The second problem concerns the possibility of generalizations to higher di¬ 
mensional cases. It is not yet known whether or not it is possible to find a 
circumscribing w-dimensional cube around any given bounded closed convex 
set in R n (n ^ 4). We may also ask: Given a real-valued continuous function 
/(P) defined on an (n — l)-sphere S is it possible to find n points Pi , • • • , P n 
on S n ~\ perpendicular to one another, such that/(Pi) = • • • = /(P„) (n ^ 4)? 
These problems are still unsolved. 

Institute for Advanced Study 



Axnau or Mathematics 
Vol. 43, No. 4, October, 1042 


AN EXTREMUM PROBLEM IN PRODUCT MEASURE 


By Shizuo Kakutani 
(Received June 19, 1942) 


I. The problem and results 

The following problem was proposed by P. R. Halmos: Let $ be the collection 
of all real valued measurable functions <p(x y y) defined on the unit square I x X I v : 
0 ^ x f y 1 such that 0 ^ <p(x, y) ^ 1 for any (rr, y) c /* X I v . Let us put 

(1) A(<p) = f f v(x, y) dx dy, 

Jo Jo 

(2) VM = f f f <p(x, y) <p{y, z) dx dy dz, 

Jo Jo Jo 

and 


(3) 

(4) 


X(a) = SUPx(*)-a V{<p), 

ip t 4 

m(«) = inf^ (v ,_ a V(>p), 

V> « 4 


where a is a real number (0 ^ a rg 1 ). Then what arc the exact values of 
X(a) and n(a) as functions of a in the interval 0 g a ^ 1? 

Consider the special case when <p(x, y) is the characteristic function <p E (x, y) 
of a measurable set E C I x X I y . Then A((p E ) = A(<p) is clearly the area 
(= two dimensional Lebesgue measure) of the set E, while the meaning of 
V(<p E ) = V(<p) may be interpreted as follows-: Take the unit cube I x X I v X I* : 
0 ^ x y y, z £ 1, and consider E as a subset of its face I x X I v X (0). Let E ' 
be the set on the face (0) X I y X I z which is obtained from E by the mapping 
(x y y, 0) —> (0, X, y). Then V(ip E ) is the volume (= three dimensional Lebesgue 
measure) of the intersection of two cylindrical sets E X /* and l x X E' y i.e., 
the set of all points (x t y , z) e I x X I y X I» such that (x t y) e E and ( y , z) e E'. 
The purpose of the present note is to prove the following 
Theorem. 

(5) 

( 6 ) 


(7) 


( 8 ) 


( 9 ) 

742 


X(a) * 2o - 1 + (1 - a)*, 0 g a g J 

X(a) = a*, i £ a ^ 1 

, . n — 2 /,„ , ((1 — 2a) n — 1)*\ 

bW " -siT \ <3 “ - D n + 1--j. 

M(i) - i 

,(.) - a. -1 + ^{(2 - 3.) - +1 - <<2 “ -y iy }, 

1/, , 1\ - _ - 1/. , 1 


» «■ 2 , 3 , ” •. 



AN EXTREMUM PROBLEM IN PRODUCT MEASURE 


743 


a 




Fia. 2 





744 


SHIZUO KAKtJTANI 


These extreme values of V(<p) are attained by the characteristic functions 
<px(x, y ) of the sets E of Figs. 1, 2, 3, 4, and 5 respectively, where 

a = a' = 1 — (1 — a) 4 , 6 = 5' = (1 - a) 4 (in Fig. 1); 

a = a' = 1 - a 4 , 5 = 6' = a 4 (in Fig. 2); 



&« = a' n - - (1 - ((n - 1 )(n/3 - l)) 4 ) 
n 


where n is a positive integer satisfying 

* 

-S£/3 = |l—2a|^ —r (in Figs. 3, 5). 

n n — 1 



The graphs of X(«) and n(a) are given in Figs. 6 and 7. It is to be remarked 
that X"(a) > 0 in the intervals 0 < a < \ and $ < a < 1, while at a - J 
we have X'($ - 0) = 0.94 ■ • • < X'(£ + 0) = 1.05 ••• . n(a) is linear in the 
intervals 0 ^ a ^ 1 and a^l. Further we have /'(a) < 0 in the intervals 


1 


3,4, 


<«< j(‘ - i) “ d K 1 + '0 < “ < K 1 - irh) • n ■ 


we have 


• •. Finally, at a = ± ^ 

'‘ , G( i+ 0-°)- n 4- 1< »'(K 1+ 0 +o ) 


n + 2 
















AN EXTREMUM PROBLEM IN PRODUCT MEASURE 


747 


The exact values of fi(a) at a 



1 , 2 , 3 , 


, are given by 



(n - l)(n - 2) = l/ 1 _l\_l,J L 
6n 2 2 \ n) S^n 2 ' 

(» + 1)(» + 2) 1 (, , l\ 1,1 

—ss*-sV + ;/ “ § + a?- 


Thus the graph of y(a) lies entirely above the straight line y = a — except 
at the point a = £ where /*(£) = £. 

The author is indebted to Drs. W. Ambrose, D. Blackwell, R. H. Fox and 
P. R. Halmos for the conversations we have had in the course of this work. 


The values of n(a) for a = 



n 


1, 2, 3, • • • were obtained by R. H. 


Fox and P. R. Halmos. 


H. Preliminary considerations 

Lemma 1. 

(10) X(1 — a) = 1 — 2a + X(a), 

(11) /x(l — a) = 1 — 2a + n(a). 

Proof. These are the direct consequences of the fact that r, y) c $ implies 
1 — ip(x , y) €$, and that 

(12) A<1 - *) - 1 - AM, 

(13) F(1 - *) = 1 - 2 AM + F(*), 

which follow easily from the definitions of A(<p) and V(tp). 

It is clear that the functions X(a) and n(a) defined by (5), (6), (7), (8) and (9) 
satisfy (10) and (11). Hence it suffices to discuss either the case 0 ^ a g £ 
or the case f ^ a ^ 1. This fact is needed in the following discussions. 

Definition 1. A real valued function f(x) defined on the interval I x : 0 ^ 
x ^ 1 is an elementary function if it is a finite linear combination of the charac¬ 
teristic functions of the intervals contained in I x . (We do not care whether 
each of these intervals is closed or open, as we are only interested in elementary 
functions defined modulo null sets.) A set in the square I x X I v : 0 ^ x, y ^ 1 
is a rectangular set } if it is a direct product of two intervals each contained in 
I x and I v respectively. Finally, a real valued function defined on I x X I v is an 
elementary function if it is a finite linear combination of the characteristic func¬ 
tions of rectangular sets in /* X I v . 

Let $° be the subcollection of $ consisting of all elementary functions <p(x , y) 
in It is clear that in the definitions (3), (4) of X(a) and /*(«)> we may replace 
the condition <p c $ by <p t $° and yet we obtain the same sup or inf. Hence, 
in order to prove our theorem, it suffices to show that y(a) g V(<p) ^ X(a) 



748 


SHIZUO KAKTJTANI 


for any <p(x, y ) «4>° with A(<p) — a, where X(a) and n(a) are defined by (5), (6), 
(7), (8) and (9). 

Let now <p(x, y ) «<J>°. We shall put 

(14) Mx) = [ <p(x, y) dy, 

Jo 


(15) g f (y) = f <p(x, y) dx. 

Jo 

It is clear that f v (x) and g v (y) are elementary functions and we have 

(16) A(<p) = [ 1 f v (t)dt = f g v (t) dt, 

Jo Jo 


(17) 


v(<p) = f MOffv(t) dt. 

Jo 


Lemma 2. For any <p(x , y) c<f>°, there exists a v'(x, y) €$° such that A(<p') = 
A (fp), V(<p') = F(^>), and such that f^(x) is monotone non-increasing or monotone 
non-decreasing in x. 

Proof. Since /*(x) is an elementary function, there exists a measure pre¬ 
serving transformation x' = h(x) of the interval I x onto itself, such that U(h(x)) 
is monotone non-increasing or monotone non-decreasing. In fact, we can choose 
h(x) as a permutation of subintervals of I x . It is then clear that the function 
<p'(x, y) = <p(h(x), h(y)) satisfies all the conditions required in Lemma 2. 

Definition 2. A set E C I x X I y is an elementary set if it is a union of a 
finite number of rectangular sets. A set E Cl I x X I v is a corner set if ( x , y) t E , 
0 ^ x f g x, 0 ^ y' ^ y imply ( x f , y') e £?. Further, a set E C I x X l y is a 
corner * seJ (x, y) t E, x g a:' g 1,0 g y' g ?/ imply (x', t/') € E . 

Lemma 9. Let <p(x , ?/) e 4>° an elementary function such that f^x) is monotone 
non-increasing in x. Then there exists an elementary corner set E C J x X / y 
suc/i that A (ip*) = A (v?), F(v?*) F (<p). 

Proof. Let E be the set of all points (x, y) eI x X I y such that 0 g y ^ /(x). 
It is clear that 2? is an elementary corner set satisfying 


(18) h B {t) = f 9 (t) 9 for 0 g 1. 

From this follows easily that A(<p E ) = A(<p). Moreover, it is easy to see that 


(19) G 9B (t) ^ < 7 ,( 0 , for 0|^1, 

(20) G,*(0) - G,(0)( = 0), • Q„( 1) « (?,( 1)(- A(«) - AW), 

where G 9B {f) and <7,(0 are defined by 

(21) G„(<) = f g fa (s) ds, G r (t) - f g/s) ds, iotO g t g 1 

Jo Jo 


respectively. 



AN EXTREMUM PROBLEM IN PRODUCT MEASURE 


749 


•Consequently, 


( 22 ) 


V{<p E ) - jf f fB (t)g, s (t) dt 

= (Oli - f <?„(<) 4 f «(0 

Jo 

£ [/„(0<?,ft)]o - f 1 <7,(0 <*/,«) 

l 

= f MQgJLQ dt = VM, 

Jo 


which proves Lemma 3. 

In the same way we can prove 

Lemma 3'. Let <p(x, y) c4>° be an elementary function such that f^(x) is mono¬ 
tone non-decreasing in x. Then there exists an elementary corner * set E C I x X I v 
such that A(<p E ) = A{<p), V{<p E ) ^ V{<p). 

We omit the proof. 

Definition 3. Let E be an elementary corner set in I x X I y , and consider 
the graphs y = f<p B (x) and x = g<p E {y ). These two graphs together will compose 
a polygonal line T* , consisting only of horizontal and vertical segments, which 
connects the points (0, 1) and (1,0). This polygonal line is called the charac¬ 
teristic graph of E. Similarly, we can define the characteristic graph of an ele¬ 
mentary corner* set E C I x X I y . This is a polygonal line connecting two 
points (0, 0) and (1, 1). 

We shall divide our further arguments into two parts, namely, the discussion 
of X(a) and that of p(a). 


III. Discussion of X(a) 

Definition 4. A set E C I x X I y is symmetric if ( x , y) e E implies 
(V> ?) « E - 

Lemma 4. For any elementary corner set E C I x X I y , exists a sym¬ 
metric elementary corner set E' C I x X I y such that A(<p E >) = 

FW 2 FW. 

Proof. Let r* be the characteristic graph of E . r* has a unique inter¬ 
section with the diagonal x = y of the unit square I x X /„ . Let (£, £) be this 
point of intersection. Then the required set E f is defined as the set of all 
points ( x , y) el x X I y satisfying one of the following three conditions: 

(23) 0^g{, 0 g y g t, 

(24) 0£x£ J {/„(») + 0 r M (y)}, t < y £ 1, 

(25) $ < x £ 1, J {/„(*) + g* E (y)}- 

It is clear that E ' is a symmetric elementary corner set, and that A(<p E /) = 
A (<*>*). In order to prove that V(tp E ’) ^ 7(^), we put 



750 


SHIZUO KAKUTANI 


( 26 ) p(t) = /„(«), g(0 - g, a (t), r(0 - /„»(«) » forO £ t £1, 

P(t) = p(s) ds, Q«) = j|| 3(8) da, J?(<) « r(8) ds, 


(27) 


for 0 £ < £ 1. 


It is then easy to see that 

(28) 2R(t ) 2 P(<) + Q(t), for 0 £ < £ 

(29) 2r(<) = p(<) + q(t), for ( < t g 1. 

Consequently, 

j[* WO 2 - p(0g(0} ^ — \ f* HKO 2 - (p«) + g«)) 2 } df 
= \f {2K0 - p(<) - g(0} (2r(<) + p(<) + g(0} dt 


(30) 


and 

(31) 


= i[{2R«) - P(t) - 0(0} {2r(0 + pH) + «(*)}]! 

- I {2/2(0 - P(0 - 0(0} d{2r(0 + pit) + 9(0} dt * 0 

^ {r(0 2 - p(0g(0} df = | {(p(0 + g(0) 2 - p(0g(0} d* 

= 1((pW ~ v(t)) 2 dt * 0 , 


which together imply 

(32) Vi<pf) - Vi<pt) = f MO 2 - p(0g(0} dt ^ 0. 

Jo 


The proof of Lemma 4 is completed. 

Thus in order to discuss \(a), it suffices to consider symmetric elementary 
corner sets E C I x X I y . 

Now let E be a symmetric elementary corner set in J* X I v , and let T s be 
its characteristic graph. Let us assume that V E contains at least three seg¬ 
ments above the diagonal x = y} Let a, 6, c be three consecutive segments 
of T, lying above the diagonal x = y. We assume that a and c are horizontal, 
while b is vertical to the x-axis. (See Fig. 8.) We shall replace the part of T B 
consisting of a, 6, c by another system of segments a', c' as indicated in Fig. 8. 
Let us denote the new symmetric elementary corner set thus obtained by E 
(Of course, we make the same change below the diagonal, as indicated in Fig. 6, 


1 When we say that a segment (parallel to the s-axis or to the y- axis) lies above or below 
the diagonal * « y, one of the end points of the segment may lie on the diagonal x — y t 
while on the other hand, when we say that a segment lies entirely above or below the 
diagonal neither of the end points of the segment can lie on the diagonal x — y. 



AN EXTREMUM PROBLEM IN PRODUCT MEASURE 


751 


so as to make E f symmetric.) The ^-coordinate of V is so chosen that we 
have A(<pB') = A(<p B ), and this condition is thus fulfilled if we have d/V = 
e'/5 = e 9 where 6 is a real number satisfying 0 < 0 < 1. We shall compare 
V(<pb’) with V((Pb). A simple computation shows 

(33) V(vm’) ” V(<p E ) = 0(1 - 0)(a + c)b(& + *-&). 

Hence, if d + l > b, then by replacing a, 6, c by a', 6', c' we obtain a new sym¬ 
metric elementary corner set E } such that A(<p B ') = A(^*), V(<p E ') ^ F(^*). 
If we interchange a , 5, c with a', c', then we immediately see that the same 

thing is true even if a and b are vertical, while b is horizontal to the z-axis. 



Assume now that r* contains at least four segments above the diagonal 
x = y. Let a, 6, c, d be any four consecutive segments. We denote their 
respective lengths by a, 6, c, d. It is then easy to see that at least one of the 
inequalities o + c>6, b + d > c must hold. Hence, by replacing a, 6, c or 
6, c, d by a suitable system a', 6', c' or b c', d', we can always obtain a new 
symmetric elementary corner set E' for which A(<p B ') = A(^), ^ F(^>*). 

Further, it is to be noticed that the number of segments lying above the diagonal 
x = y in the characteristic graph of the new set E' is smaller than that of E 
exactly by two. 

Thus, by iterating the same process, we shall finally reach a symmetric ele¬ 
mentary corner set E * whose characteristic graph r** consists of at most three 
segments lying above the diagonal x = y, and such that A (<pb*) * A (<p B ), 



752 


SH1ZU0 KAKTJTANI 


V(<pb*) ^ V(<p B ). Consequently, in order to discuss X(a) it suffices to consider 
the symmetric elementary sets E of the forms given in Figs. 1, 2, 9 and 10. 
We shall discuss these cases separately. 

(i) Case op Fig. 1 . The condition A{<p s ) = ct implies d = 1 — (1 — a)*, 
5 = (1 — «)*. Consequently 

(34) V(<p B ) = 2a — 1 + (1 — a)*. 

(ii) Case of Fig. 2. The condition A(<p B ) = a implies d = 1 — a *, 5 = 
Consequently, 

(35) V(<p B ) = a 1 . 



(iii) Case of Fig. 9. A simple computation shows: 

(36) Afa) = 2 be + 5 2 = a, 

(37) FW = 5(5 + c) 2 + 6 2 c. 

Hence c = (a — 6 2 )/25. Putting this value in (37), we have 

V(<ps) - 1 (a 2 + 4a6 2 - 5 4 ) 

46 

= 1 {2a 2 + 2a6* - (a - 6 2 ) 2 } 

s i Kl *)* ■ 


(38) 



AN EXTREMUM PROBLEM IN PRODUCT MEASURE 


753 


(iv) Case of Fig. 10. A simple computation shows: 

(39) A(<p B ) = 1 - (25c + 5 2 ) = 

(40) V(<ps) = d + S(d + c) 2 + a 2 5 

= 2a — 1 -f* {5(5 -j- c) 2 -f~ 5 2 c}. 

Consequently, by the result obtained above in the case of Fig. 7, 

(41) V(<Pb) S 2a - 1 + (1 - a) 1 . 

Summing up, we have thus proved that 

(42) V(<p) g max (a 1 , 2a - 1 + (1 - a)*) 


a 



for any <p(x , t/) €<f> with A(^>) = a, 0 ^ a g 1, the equality holding for the 
symmetric elementary corner sets E of the forms given in Figs. 1 and 2. This 
completes the proof of our theorem for X(a). 

IV. Discussion of v(ot) 

Definition 5. Let E be an elementary corner* set in I x X I v , and let T* 
be its characteristic graph. E is a special elementary corner* set if there is no 
segment in r* which lies entirely 2 above or below the diagonal x = y. For 
example, Fig. 11 shows a special elementary corner* set, while this is not the 
case in Fig. 12. 

Lemma 5. For any elementary comer* set E C I x X l y , there exists a special 
elementary corner* set E' C I x X I* , such that A(<p B 0 = A (#?*), V(<p E 0 = VX#>*). 


1 See footnote (1) on page 750. 




764 


8HIZTJ0 KAKDTANI 




Proof. Let us assume that the given elementary comer* set E is of the 
form as given in Fig. 12. We shall replace the part of r* consisting of a, b, c, d 






AN EXTREMUM PROBLEM IN PRODUCT MEASURE 


755 


by a', 6', c', d! as indicated in Fig. 12. (Clearly we have <J + c = 6 + 5 = 
d! + c r = 5' + d'. Let us denote the new set thus obtained by E The 
condition A(<p S ') = A(^*) is fulfilled by taking 5c = b'd\ Then a simple compu¬ 
tation shows that F(<p*0 = V(<p B ). 

Thus our lemma is proved if E is an elementary corner* set E of the form 
of Fig. 12. The analogous argument applies to the case when similar situation 
happens above the diagonal x = y. Finally, if there are more than two (and 
hence ^ 4) consecutive segments or more than one pair of segments which lie 
entirely above or below the diagonal x = y, then we can iterate the same kind 
of operations, reducing the number of segments lying entirely above or below 
the diagonal x = y exactly by two in each step, until we finally reach a special 
elementary corner* set E * satisfying A (<*?*•) = Afog), V(<p E *) = V(<p s ). The 
proof of Lemma 5 is completed. 

Thus, in order to discuss n(a) it suffices to consider special elementary corner* 
sets only. 

Let now E be a special elementary corner* set in I x X I v , as in Fig. 11. We 
denote the segments of its characteristic graph T E successively by di , a[ , • • • , 
(in, a n (see Fig. 11). Those a t *, a { which lie above the diagonal x = y are 
denoted by bj , b\ , and those below the diagonal by c* , c k . We have clearly, 
df — 1 flj , b j bj , c k Ck and 

(43 ) £«. = £&,• + £& = i. 

X i k 

Then a simple computation shows 

(44) A(<ps) = j{l + 2)5y — 23®*} = a 7 

i k 

V(<Pe) = 23 dfdjdk + 23 h 

i<i<k 3 

— J {(2) a») 3 — 3 23 a? 23 a» + 2 23 a?} + 23 R 

» * » i j 

= j {i — 3 23 2 23 s? 

(45) * < 1 

= i{ 1 + 3(23 8* — ^2 cl) + 2 23 a?} 

1 * ». 

- *{1 + 3(2a - 1) + 2 23 a!} 

i 

— a — i + i{22 S* -i- S*} • 

i k 

Thus our problem is transformed into the following one: under the condi¬ 
tions (43) and 

(46) 22 b* — 22 6l = 2a — 1 

i * 

to make 

(47) 22 b* + £ 5* =* a ■ 

»' * 


3(VM - a) + 1 



756 


SHIZUO KAKUTANI 


as small as possible , where hj 0 and 5* ^ 0 and there is no assumption on the 
number of hj and c*. 

Let us consider the interval 0 ^ a ^ Then it is clear that we have only 
to consider the case when all hj = 0, and our problem is further reduced to the 
following one: Under the conditions : 


(48) 

E& = i, 

k 

(49) 

E = P ■ 1 - 2a > 0, 

k 

to make 


(50) 

E c* = co = 3(F(^) - a) + 1 


k 


as small as possible , where c k ^ 0 and we have no assumption on the number nof c k . 

By Schwarz’s inequality, we have > 1, and it is easy to see that for fixed n 
with nfi ^ 1, the minimum value w n of w is attained by 


(51) 


Cl = 




(52) 

and 

(52) 


c„ = - (1 - ((n/J - l)(n - 1))*) 
n 


n l 


— “o i 2) — 


(n — 2)(n/3 — 1)* 
- 1)* 


Hence, by (50) and (51), we finally have 


V(<fis) = a 

_ n — 2 
3n*~ 


1 


3 + 3n^ (3n(1 - 2a) - 2) " 


(n — 2)(n(l — 2a) — l) 1 


(n - 1) J 


(3a - l)n + 1 


(n( 1 - 2a) - l) f 
(n - 1)* 


where is the characteristic function of a special elementary corner* set E of 
the form given in Fig. 3. It is easy to see that the expression (54) is a monotone 
increasing function of n for each given a for n ^ (1 — 2a)” 1 . Hence the smallest 
possible integer with nj8 s n( 1 — 2a) ^ 1 gives the required value of m(«)> or 

in other words, the equality (7). is true for a ^ 

Thus we have proved the formula (7) for n = 2, 3, • • • , i.e. for all a satisfying 
0 ^ a < i The formula (9) for n = 2, 3, • • • , or for \ < a ^ 1 then follows 
from this and from Lemma 1. Finally, the formula (8) for a = £ follows from 
(7), (9) and from the fact that fx(a) is a monotone non-decreasing function of a. 
This completes the discussion of n(a). 


Institute for Advanced Study 



Annals or Mathsmatics 
Vol. 43, No. 4, October, 1943 


GROUP EXTENSIONS AND HOMOLOGY* 

By Samuel Eilenberg and Saunders MacLane 
(Received May 21, 1942) 

CONTENTS 

Page 

Introduction . 758 

Chapter I. Topological Groups and Homomorphisms . 760 

1. Topological spaces. 760 

2. Topological groups. 761 

3. The group of homomorphisms. 762 

4. Free groups and their factor groups. 763 

5. Closures and extendable homomorphisms. 765 

Chapter II. Group Extensions . 766 

6. Definition of extensions. 767 

7. Factor sets for extensions. 767 

8. The group of extensions. 770 

9. Group extensions and generators. 771 

10. The connection between homomorphisms and factor sets. 771 

11. Applications. 774 

12. Natural homomorphisms. 777 

Chapter III. Extensions of Special Groups . 778 

13. Characters. 778 

14. Modular traces. 780 

15. Extensions of compact groups. 782 

16. Two lemmas on homomorphisms. 784 

17. Extensions of integers. 785 

18. Tensor products. 787 

Chapter IV. Direct and Inverse Systems . 789 

19. Direct systems of groups. 789 

20. Inverse systems of groups. 789 

21. Inverse systems of homomorphisms. 791 

22. Inverse systems of group extensions. 791 

23. Contracted extensions. 793 

24. The group Ext*. 794 

25. Relation to tensor products. 797 

Chapter V. Abstract Complexes. 798 

26. Complexes. 799 

27. Homology and cohomology groups.800 

28. Topology in the homology groups.802 


* Presented to the American Mathematical Society, September 4 and December 31, 
1941. Part of the results was published in a preliminary report [5] and also in an appendix 
to Lefschetz [7J. The numbers in brackets refer to the bibliography at the end of the paper. 

757 





































758 


SAMUEL EILENBERG AND SAUNDERS MacLANE 


29. The Kronecker index. 803 

30. Construction of homomorphisms. 804 

31. Study of . 806 

32. Computation of the homology groups... 808 

33. Computation of the cohomology groups. 809 

34. The groups ffi . 811 

35. Universal coefficients. 812 

36. Closure finite complexes. 813 

Chapter VI. Topological Spaces. 814 

37. Chain transformations. 814 

38. Naturality.% 815 

39. Cech’s homology groups. 816 

40. Formulas for a general space. 818 

41. The case q ■* 0. 820 

42. Fundamental complexes. 820 

43. Relations between a space and its fundamental complex. 821 

44. Formulas for a compact metric space. 823 

45. Regular cycles. 824 

Appendix A. Coefficient Groups with Operators. 825 

Appendix B. Solenoids. 829 

Bibliography. 831 


Introduction 

In 1937 the following problem was formulated by Borsuk and Eilenberg: 
Given a solenoid 1 2 in the three sphere S 3 , how many homotopy classes of con¬ 
tinuous mappings f(S z — 2) C S 2 are there? In 1939 Eilenberg proved 
([4], p. 251) that the homotopy classes in question are in a 1-1-correspond¬ 
ence with the elements of the one-dimensional homology group H l (K, I) = 
Z X (K, I)/B l (Kj /), where K is any representation of S* — 2 as a complex, 
Z l (K, /) is the group of infinite 1-cycles in K with the additive group I of 
integers as coefficients and B l (K , I) is the subgroup of bounding cycles. This 
homology group is generally much “larger” than the conventional homology 
group H\(K , I) = Z x /B l where E 1 (K i I) is the group of cycles that bound on 
every finite portion of K; with an appropriate topology in the group Z 1 , B l turns 
out to be exactly the closure of B \ 

At this point the investigation was taken up by Steenrod [10]. By using 
“regular cycles” he computed the groups H l (S z — 2) for the various solenoids 2. 
The groups are uncountable and of a rather complicated nature. 2 

This paper originated from an accidental observation that the groups ob¬ 
tained by Steenrod were identical with some groups that occur in the purely 
algebraic theory of extensions of groups . An abelian group E is called an ex- 

1 For the definition see Appendix B below. 

1 A popular exposition of Steenrod’s results can be found in his article in Lectures in 
Topology, Ann Arbor, University of Michigan Press, 1941, pp. 43-55. 
























GROUP EXTENSIONS AND HOMOLOGY 


759 


tension of the group 0 by the group HifGClE and H * E/G. With a proper 
definition of equivalence and addition, the extensions of G by H themselves 
form an abelian group Ext {(?, H\. It turns out that H l {S* — 2, I) is iso¬ 
morphic with Ext {/, 2*} where 2* is a properly chosen subgroup of the group 
of rational numbers. 3 

The thesis of this paper is that the theory of group extensions forms a natural 
and powerful tool in the study of homologies in infinite complexes and topo¬ 
logical spaces. Even in the simple and familiar case of finite complexes the 
results obtained are finer than the existing ones. 

Our fundamental theorem concerns the homology groups of a star finite com¬ 
plex K. Let H 9 (G) denote the homology group of infinite cycles with coeffi¬ 
cients in an arbitrary topological group G. We obtain an explicit expression 
for H 9 (G) in terms of G and the cohomology groups 3Q of finite cocycles with 
integral coefficients. (DC q is the factor group % q /&> q of cocycles modulo co¬ 
boundaries). This expression is 

H 9 (G) = Horn {X q , G] X Horn {ffl g+1 , <?}/Horn {Z 9+1 1 % +1 , G \. 

Here Horn {#,(?} stands for the (topological) group of all homomorphisms of 
H into G y while Horn | £B 3 +i , (?J denotes the group of those homomor¬ 
phisms of £B g +i into G which can be extended to homomorphisms of Z fl +i into G. 
The factor group on the right in this expression appears to depend on the groups 
fB 7 +i and % q +i , but actually depends only on the cohomology group *7Q+i = 
% q +\/$> q +\. In fact this factor group can best be interpreted as the group “Ext” 
of group extensions of G by 3Q+i. The fundamental theorem then has the form 

H\G) = Horn {3 C q , G] X Ext {<?, 3f #+ i}. 

The paper is self contained as far as possible, both in algebraic and topo¬ 
logical respects. The first four chapters below develop the requisite group- 
theoretical notions. Chapter I discusses the groups of homomorphisms in¬ 
volved in the above formula, while Chapter II introduces the group of group 
extensions, and proves the fundamental theorem relating this group to groups 
of homomorphisms. This fundamental theorem is essentially a formulation of 
the known fact that a group extension of G by H can be described either by 
generators of H (and hence by homomorphisms) or by certain “factor sets.” 
Chapter III analyzes the group Ext {G f H} for some special cases of G. Chapter 
IV introduces some additional groups, closely related to Ext, which arise as 
inverse limit groups in the treatment of homologies of topological spaces. 

The last two chapters analyze homology groups. Chapter V treats the case 
of a complex, and proves the fundamental theorem quoted above, as well as 
parallel theorems for some of the other homology groups of a complex. Chapter 


•More precisely 2* is the character group of 2. The detailed treatment appears in 
Appendix B below. 



760 


SAMUEL EILENBERG AND SAUNDERS MacLANE 


VI obtains analogous theorems for the Cech homology groups of a topological 
space. 

Appendix A discusses the case when G is a group with operators. Appendix B 
contains a computation of the group Ext {I, 2*} mentioned above. 

Each chapter is preceded by a brief outline. The chapters are related as in 
the following diagram: 

I II III -> V 
\ \ 

IV->VI 

Almost all of V can be read directly after I and II, and a major portion after 
I alone. 

Chapters V and VI are strongly influenced by S. Lefschetz's recent book 
“Algebraic Topology” [7], that the authors had the privilege of reading in 
manuscript. 

Chapter I. Topological Groups and Homomorphisms 

After a certain preliminary definitions, this chapter introduces the basiogroup 
Horn {R, G} of homomorphisms. In the case when R is a subgroup of a free 
group, we require two subgroups of “extendable” homomorphisms. The to¬ 
pology of these subgroups is investigated when the “coefficient group” G is itself 
topological. 


1. Topological spaces 

A set X is called a space if there is given a family of subsets of X, called open 
sets, such that 

(1.1) X and the void set are open , 

(1.2) the union of any number of open sets is open , 

(1.3) the intersection of two open sets is open . 

Complements of open sets are called closed. X is called a Hausdorff space if in 
addition 

(1.4) every two distinct points are contained respectively in two disjoint open sets. 
X is called a compact (= bicompact) space if 

(1.5) every covering of X by open sets contains a finite subcovering. 

A space X is discrete if every set in X is open. 

The intersection of an open set of a space X with a subset A of X will be 
called open in A. With this convention A becomes a space. 

Let X and Y be spaces and x —>f(x) = y a mapping of X into a subset of Y. 
The mapping / is continuous if for every open set U C Y the set f~ l ( U) is open 
(in X). The mapping / is open if for every open set 17 C X the set f(U) is 
open (in Y). A well known result is 

Lemma 1.1. If f is a continuous mapping of a compact space X into a Haus¬ 
dorff space F, then f(X) is closed in Y. 

** A product space JJ a X a of a given collection {X a } of spaces X a is defined as 



GROUP EXTENSIONS AND HOMOLOGY 


761 


the space whose points are all collections {#<*}, x a c X a and in which open sets 
are unions of sets of the form U a , where U a is an open subset of X a and 
U a = X a except for a finite number of indices a. 4 5 It is lmown that JJ X a is a 
Hausdorff or compact space if and only if for every a the space X a is a Hausdorff 
or compact space. 6 7 

Let A be a set of elements and X be a space. We consider the set X A of all 
functions with arguments in A and values in X. The set X A is clearly 
in a 1-1 correspondence with the product n X\ where X 6 A and X\ = X. 
Hence we may consider X A as a space. 

2. Topological groups 

Only abelian groups (written additively) will be considered. 

A group G will be called a generalized topological group if G is a space in which 
the group composition (as a mapping G X G —» G) and the group inverse (as a 
mapping G —* (?) are continuous. 

If G , considered as a space, is a Hausdorff space, then G will be called a 
topological group. 6 Similarly, if G is compact as a space we shall say that G 
is a compact group. 

A subgroup of a (generalized) topological group is a (generalized) topological 
group. A closed subgroup of a compact group is compact. 

Lemma 2.1. In a generalized topological group G the following properties are 
equivalent: 

(a) every point of G is a closed set , 

(b) the zero element of G is a closed set , 

(c) G is a topological group. 1 

The factor group H = G/Gi of a generalized topological group G modulo a 
subgroup G\ is the group of all cosets g + Gi of G\ in G. The correspondence 
<p(G) = H carrying each g e G into its coset <pg = g 4- Gi in H is the “natural” 
mapping of G on H. We introduce a topology in H by calling a set U C H 
open if and only if <p~ l {U) is open in G. It can be shown that this topology is the 
only one under which <p will be both open and continuous. 

Lemma 2.2. If G is a generalized topological group and G\ is an arbitrary 
subgroup of (?, then the factor group H =* G/Gi is a generalized topological group ; it 
is a topological group if and only if G\ is a closed subgroup of G. If G is compact :, 
then G/G\ is compact. 

Lemma 2.3. The closure 0 of the zero element of a generalized topological group 
is a closed subgroup of G. Its factor group G/ 0 is the “largest” factor group of G 
which is a topological group. 

The preceding two statements show the utility of the study of generalized 

4 If {«) « 1, 2, • • • , n we also use the symbol X\ X Xi X • • • X X n for the product space. 

5 See C. Chevalley and 0. Frink, Bulletin Amer. Math. Soc. 47 (1941), pp. 612-614. 

6 G is then a topological group in the sense of Pontrjagin [8]. 

7 To prove that a) implies c) one first proves that each neighborhood of g contains the 
closure of a neighborhood of g , as in Pontrjagin [8], p. 43, proposition F. 



762 


SAMUEL EILENBERG AND SAUNDERS MACLANE 


topological groups. Several times in the sequel we need to consider an iso¬ 
morphism 

(2.1) Gi/Hx Si Gil Hi 

where the Gi are topological groups, while the Hi are not closed, so that Gi/Hi 
are only generalized topological groups. However, if we are able to prove that 
the isomorphism (2.1) is continuous in both directions in the “generalized” 
topology of the groups Gi/Hi , we obtain as a corollary the bicontinuous iso¬ 
morphism of the topological groups Gi/Hi . 

If {G«} is a collection of generalized topological groups the direct product 
XIa<?« is a generalized topological group, provided we define the sum \g a \ = 
\g ' a ! + [g'a] by setting g a = g' a + g" a for every a. Similarly, if A is any set and 
G is a generalized topological group, then the set G A of all mappings of A into G 
is a generalized topological group. It follows from the results quoted in §1 
that JlaGa and G A are topological or compact groups if and only if the groups 
G a and G are all topological or compact, respectively. 

3. The group of homomorphisms 

Let G and H be generalized topological groups. A homomorphism 8 of II 
into G is a continuous function 6(h) defined for all h e H with values in G, such 
that d(hi + h/) = 8(hi) + 8(hi). For instance, the natural mapping of a group 
into one of its factor groups is a homomorphism. If 0i and 0 2 are two homo¬ 
morphisms their sum 8i + 0 2 , defined by 

( 8i -|- 8i)(h) = &i(h) + 8i(h), (all h in H) 

is also a homomorphism. Under this addition, the set of all homomorphisms 8 
of H into-G constitutes a group, which we denote by Horn {//,(?): 

(3.1) Horn {H, G } = [all homomorphisms 8 of H into G]. 

To introduce a (generalized) topology in Horn {H, G}, take any compact 
subset X of H and any open subset V of G with 0 e V and consider the set 
U(X, V) of all 8 with 8(X) C V. In the usual sense ([8], p. 55) these sets 
U(X, F) constitute a complete set of neighborhoods of 0 in Horn {H, G), and 
are used to define the topology of Horn [H, G}. 8 

If H is discrete, the compact subsets X of H are just the finite ones. In this 
case Horn \H, G) is a subgroup of the group G" with the topology as defined 
in §2. 

Lemma 3.1. If G is a topological group and H is discrete, then Horn {H, G} 
is a closed subgroup of the group G H of all mappings of H into G. 

Proof. Let fo t G B be a mapping of H into G that is not a homomorphism. 
There are then elements hi , ht , h t in H such that hi + ht = h t and 
4>o(h i) + <tx>(hi) t* <h(hi). Since G is a Hausdorff space and the group composi- 


* This is the general definition stated by Weil [11], p. 99, and Lefschetz [7], Ch. II. 



GROUP EXTENSIONS AND HOMOLOGY 


763 


tion is continuous there are in 0 three open sets Ui , U 2 , U 3 containing 
^ 0 (^ 2 ), and <tx>(hz), respectively, such 4hat 9 (C/i + U 2 ) fl Us = 0 . Consequently 
the open subset U of G tt consisting of the mappings <f> such that <t>(hi) e Ui , 
0 (/t 2 ) € U 2 , and 4>(hz) t Uz has no elements in common with Horn {H, G]. Hence 
Horn { H , G} is closed. 

Corollary 3.2. If H is discrete and G is a topological (and compact) group , 
then Horn {if, (?) is a topological (and compact) group. 

Note that the topology of Horn {H, G] may not be discrete even though II 
and G both have discrete topologies. Observe also that if H is discrete, an 
alteration in the topology of G may alter the topology of Horn {//, G ) but not 
its algebraic structure. However, if H carries a non-discrete topology, an 
alteration in the topology of either H or G may alter the algebraic structure of 
Horn { H , G}, in that continuous homomorphisms may cease to be continuous, 
or vice versa. 

If H is compact, we can take H itself to be the compact set X used in the 
definition of the topology in Horn { H , G}. Consequently, given any open set 
V in G containing 0 , the homomorphisms 6 , such that 6(H) C F, constitute an 
open set. Hence if F can be picked so as not to contain any subgroups but 0 , 
we see that Horn {77, G} is discrete. 

Subgroups and factor groups of H will correspond respectively to factor groups 
and subgroups of Horn { H , G}, as stated in the following lemmas. 

Lemma 3.3. If H/Hi is a factor group of the discrete group H , then 
Horn [H/Hi , G\ is (bicontinuously) isomorphic to that subgroup of Horn {//, G) 
which consists of the homomorphisms $ mapping every element of H\ into zero. 

The proof is readily given by observing that each homomorphism 6 with 
6(Hi) = 0 maps each coset of Hi into a single element of G, so induces a homo¬ 
morphism O' of H/Hi . The continuity of the isomorphism 6 —* 0' can be 
established, as always for isomorphisms between groups, by showing continuity 
at 6 = 0. ([ 8 ], p. 63). 

Lemma 3.4. If L is a subgroup of H , then each homomorphism 6 of H into G 
induces a homomorphism 6' = 6 \ L of L into G. The correspondence 6 —> 0 ' 
is a (continuous) homomorphism of Horn {H, G} into Horn {L, Gj. If L is a 
direct factor of H, this correspondence maps Horn {H, Gj onto Horn {L, Gj. 

4. Free groups and their factor groups 

The homology groups will be interpreted later as certain groups of homo¬ 
morphisms of “free” groups, which we now define. If the elements z a of a 
discrete group F are such that every element of F can be represented uniquely 
as a finite sum n a z a with integral coefficients n a , F is said to be a free abelian 
group with generators (or basis elements) { z a }. The number of generators 
may be infinite. A free group can be constructed with any assigned set of 
symbols as basis elements. 

• Ui -f U% is the set of all sums gi -f g %, with gi « Ui . The symbol f| stands for the 
set-theoretic intersection. 



764 


SAMUEL EILENBEBG AND SAUNDERS MAcLANE 


Lemma 4.1. Every proper subgroup of a free group is free . 

For the denumerable case, this is proved^by Cech [3]; a general proof is given 
in Lefschetz [7] (II, (10.1)). 

Any discrete group H can be represented as a homomorphic image of a free 
group. Specifically, if we choose any set of elements t a in H which together 
generate all of H , and if we then construct a free group F with generators z a in 
1-1 correspondence z a t a with the given ^s, the correspondence n « z * 

Y. n a t a will map the free group F homomorphically onto the given group H . 
If the kernel of this homomorphism 10 is R , H may be represented as the factor 
group H = F/R. R is essentially the group of “relations” on the generators 
t a of II. 

Given R d F> each homomorphism <t> of F into G induces a homomorphism 
0 = <t> | R of the subgroup R into G , and the homomorphisms so induced form 
a subgroup of Horn {ft, (?}, denoted as 

(4.1) Horn {F | R,G\ = [all 0 = 0 | R, for </> e Horn {F, (?}]. 

Alternatively, the elements of this subgroup can be described as those homo¬ 
morphisms 0 of R into G which can be extended (in at least one way) to homo¬ 
morphisms of F into G. 

A similar, but lighter, restriction may be imposed as follows: Given 
0 € Horn { R , G\ , require that for every subgroup F 0 Z> R of F for which F 0 /R 
is finite there exist an extension of 0 to a homomorphism of F 0 into G. The 
0\s meeting this requirement also constitute a subgroup, 

(4.2) Horn/ {ft, G] F] = [all 0 e Horn {F 0 1 R, (?) for every finite F 0 /R\. 

These two subgroups, 

w Horn {F | ft, (?) C Horn/ {ft, (?; F] d Horn {ft, (?}, 

are important because the corresponding factor groups in Horn {ft, G) are 
invariants of the group H = F/R , in that they do not depend on the particular 
free group F chosen to represent H. This fact may be stated as follows. 

Theorem 4.2. If H is isomorphic to two factor groups F/R and F'/R f of free 
groups F and F' } then 

(4.3) Horn {ft, G}/Hom {F | ft, G) ^ Horn {ft', G}/ Horn [F f | ft', (?}, 

the isomorphism being both algebraic and topological. The same result holds for 
the factor groups 

(4.4) Horn {ft, Gj/Hom/ {ft,G;F}, Horn/ {ft,G;F}/Hom {F \R,G}. 

This theorem is a corollary of a result to be established in Chapter, II, as 
Theorem 10.1. It can also be proved directly, by appeal to the following 
lemma, which we state without proof. 

10 The kernel of a homomorphism 6 of a group H is the set of all elements h < H with 
B(h) - 0. 



GROUP EXTENSIONS AND HOMOLOGY 


765 


Lemma 4.3. Let F/R = E/G , where F ID R is a free group and E Z> G is any 
other group . There exists a homomorphism <t> of F into E such that , in the given 
identification of cosets of G with cosets of 72, 

(4.5) <f>(x) + G = x + 72, for all x e F. 

Any other <j>* « Horn { F , F} m’/A tfAfs property (4.5) has the form <f>* = 0 + 0 , 
/or some 0 eHom {F, G}. Conversely , given <f> with the property (4.5) am/ swcA 
0 + 0 Aas /Ae same property. 

Although a given group H can be represented in many ways as a factor group 
H = F/R of a gree group, there is a “natural” such representation, in which F 
is the additive group F H of the (integral) group ring of H. Specifically, given 
H , we choose for each A e H a symbol Zh and construct a free group F H generated 
by the symbols Zh . The correspondence Zh —» A induces a homomorphism 
of F h on H . Let R H denote the kernel of this homomorphism. The factor 
group (4.3) of the Theorem can then be described invariantly in terms of (H and 
G as the group 

Horn {72*, G}/Horn {F*|72*,G}. 

The same remark applies to the factor groups of (4.4). It would be possible to 
use the groups so described as substitutes for the group of group extensions to 
be introduced in Chapter II. 

6 . Closures and extendable homomorphisms 

If G is topological, we wish to examine the closures of the groups 
Horn {F | R, G\ and Horn/ in the topological group Horn {72, G}. A preliminary 
is a characterization of the subgroup Horn/ . 

Lemma 5.1. A homomorphism 6 of Horn { R . G} lies in Horn/ {R, G; F } if 
and only if for each element t in F with a multiple mt in R there exists A eG with 
6(mt) = mh. 

Proof. Let F< be the subgroup of F generated by t and R. If mt eR for 
m 0, FJR is finite and cyclic, so that 6 e Horn/ is extendable to F t . Hence 
the condition stated on 6(mt) is necessary* Conversely, for any given group 
Fo Cl F with Fo/R finite we can write Fo/R as a direct product of cyclic groups. 
By applying the given condition on 6 to each of these cyclic groups, we find an 
extension of OtoFo, as required. 

Another characterization of Horn/ can be found; the proof is similar: 

Lemma 5.2. A homomorphism 6 of Horn {72, G) lies in Horn/ {72, G; F\ if 
and only if 8 can be extended to a homomorphism (into G) of each subgroup F 0 of 
F which contains 72 and for which the factor group F 0 /R has a finite number of 
generators. 

We now consider the topology on Horn {72, G}. 

Lemma 5.3. If G and hence Horn {72, G} are generalized topological groups , 
Horn/ {72, G; F} is contained in the closure of Horn {F | 72, G}, or 



766 


SAMUEL EILENBERG AND SAUNDERS MXcLANE 


Horn {F I R, G) C Horn, {R, G; F] C Horn {F \R,G) C Horn {R, G ]. 

Proof. Let 0 O be in Horn/ {/?, G; F}, while U is any open set of Horn {/?, (?} 
containing 0 O . Since F is discrete, the definition of the topology in Horn {#, G} 
implies that there is a finite set of elements n , • • • , r n of R such that U contains 
all 0 for which each 0(r,) = 0 o (r,). The elements r< are all contained in a sub¬ 
group F 0 of F generated by a finite number of the given independent generators 
of the free group F. Since 0o e Horn/ , 0 O has an extension 0' to the group gene¬ 
rated by F 0 and R (Lemma 5.2). Introduce a new homomorphism 0* of F by 
setting 6*(z a ) = 8\z a ) for each generator z a of F 0 , 6*(z a ) = 0 otherwise. This 
0* induces a homomorphism 0 of R, which agrees with 0o on the original elements 
n , • • • , r n and which is by construction an element of Horn [F \ R, G}. In 
other words, the arbitrary neighborhood U of 0 O does contain a homomorphism 
0 e Horn [F | R, G ). This proves the lemma. 

Lemma 5.4. If G is a compact topological group , Horn {F | R, (7} is a closed 
sub-group of Horn {/?, (?}, and hence Horn {F | R, (?} = Horn/ {/?, G; F}. 

Proof. By Corollary 3.2, both the groups Horn {/£,(?} and Horn {F, G} 
are compact and topological. The second of these groups is mapped homo- 
morphically onto Horn {F \ R, G} by the continuous correspondence 0> 8 \ R 
of Lemma 3.4. Therefore, by Lemma 1.1, the image Horn {F | R , (?) is closed. 

For any integer m, let mG be the subgroup of all elements of the form mg , 
with g in G. A condition for the closure of Horn/ may be stated in terms of 
these subgroups. 

Lemma 5.5. If G is a generalized topological group , then Horn/ {F, G ; F} is 
closed in Horn {/?,(?) whenever every subgroup mG of G is closed in G , for m = 
2, 3, •• • 

Proof. Let 0 be a homomorphism in the closure of Horn/ {/2, G] F }. Con¬ 
sider an arbitrary t in F such that mt € R. By Lemma 5.1 and the given con¬ 
dition on G it will suffice to jJrove that 6(mt) e mG . Let V be any open set 
containing 0 in G. By the definition of the topology in Horn {i?, (?}, there 
exists for 0 in the closure of Horn/ an element 0' in Horn/ itself, such 
that 6'{mt) — 8(mt) c V. But 8 f (mt) is in mG } so that the arbitrary open set 
V + 8(mt) does contain an element of mG . This proves 8(mt) in mG, as re¬ 
quired. 

An examination of this proof shows that the given condition on G can be 
somewhat weakened. It suffices to require that the subgroup mG be closed in G 
for every integer m which is the order of an element of F/R. The same remark 
will apply in various subsequent cases when this condition on G is used. 

Chapter II. Group Extensions 

This chapter introduces the basic group Ext {G, H} of all group extensions of 
G by H , and its subgroup Ext/ [G,H] of all extensions which are “finitely trivial” 

11 If every subgroup mG is closed in Q, Steenrod [9] and Lefsohetz [7] say that G has the 
^“division closure property.” 



GROUP EXTENSIONS AND HOMOLOGY 


767 


(§8). Each individual group extension can be described either by a suitable 
“factor set” (§7) or by a certain homomorphism. The equivalence of these two 
representations is the fundamental theorem of this chapter (Theorem 10.1); 
it gives an expression of Ext {G, H) as one of the factor-homomorphism groups 
already considered in Chapter I. This fundamental theorem, which is implicit 
in previous algebraic work on group extensions, is of independent algebraic 
interest. The chapter closes with a proof that the representation of Ext {G, H\ 
by homomorphisms is a “natural” one (§12). This conclusion is needed for the 
subsequent limiting process, which is used in defining the Cech homology groups. 

6. Definition of extensions 

A group E having G as subgroup and H — E/G as the corresponding factor 
group is said to be an “extension” of G by //. More explicitly, if the groups 
G and H are given, a group extension of G by H is a pair ( E , 0), where E is a group 
containing G and 0 is a homomorphism of E onto H under which exactly the 
elements of G are mapped into 0 € H , 12 Such a 0 induces an isomorphism of 
E/G to H. For given G and //, two extensions (Ei , 0i) and ( E 2 , 0 2 ) are regarded 
as equivalent if and only if there is an isomorphism a> of E\ to E 2 which leaves 
elements of G and cosets of H fixed. In other words, the isomorphism « of 
Ei to E 2 must have cog = g for g e G and 0 2 cox = 0ix for x e E\ . We regard 
equivalent extensions as identical, and so study the equivalence classes of ex¬ 
tensions of G by H . It will appear that these equivalence classes are themselves 
the elements of a group. 

For given G and H , the direct product G X H has the “natural” homo¬ 
morphism ( g , ft) —* ft onto H , and so can be regarded as an extension of G by H. 
Any extension ( E , 0) equivalent to this direct product (with its natural homo¬ 
morphism) is said to be a trivial extension of G by H. 

7. Factor sets for extensions 

A given extension ( E , 0) of G by H can be described in terms of representatives 
for elements of H . To each ft in H select in E a representative u(h), such that 
0(u(h)) = ft. Every element of E lies in some coset ft, so has the form g + u(h) 
for g in G. The sum of any two representatives u(h ) and u{k ) will lie in the same 
coset, modulo G, as does the representative of the sum ft + ft. Hence there is 
an addition table of the form 

(7.1) uQi) + u(k) = u(h + ft) +/(ft, ft), 

where /(ft, ft) lies in G for each pair of elements ft, ft in H. The commutative 
and associative laws in the group E imply two corresponding identities for /, 

(7.2) f(h, k ) = f(k, h), 

11 Group extensions are discussed by Baer [2], Hall [6], Turing [11], Zassenhaus [15], and 
elsewhere. Much of the discussion in the literature treats the more general case in which 
G but not H is assumed to be abelian and in which G is not necessarily in the center of H. 



768 


SAMUEL EILENBERG AND SAUNDERS MacLANE 


(7.3) f(h, k) + f(h + ft, l) - f(h, k + l) + f(k, D. 

The sum of any two elements g\ + u(h) and gt + u(k) of E is determined by the 
addition table (7.1) and the addition given within G and H. 

The extension E does not uniquely determine the corresponding function f. 
An arbitrary set of representatives u'{h) for the elements of H can be expressed 
in terms of the given representatives as 

u'(h) — u(h) + g(h), each gib) eG; 

they will have an addition table like that of (7.1) with a function /' given by 

* 

(7.4) f'(h, k) - f(h, k) + [g(h) + g(k) - g(h + k )]. 

Conversely, a factor set of H in G is any function /(fe, jfc), with values in G for 
h y k in H which satisfies the “commutative” and “associative” conditions 
(7.2) and (7.3) for all h f k , and l in H. A transformation set is any function of 
h and k like the term in brackets in (7.4); thus for any function g(h) defined for 
each h *H and taking on values in (?, the function 

(7.5) t(h , k) = g(h) + g(k) - g{h + k) 

is a transformation set. Such a set automatically satisfies the conditions (7.2) 
and (7.3), hence is always a factor set. Two factor sets/ and/' are said to be 
associate if their difference is, as in (7.4), a transformation set. The corre¬ 
spondence between group extensions and factor sets may now be formulated 
as follows. 

Theorem 7.1. For given groups G and II , there is a many-one correspondence 
f —> {E^fS) between the factor sets f of H in G and the group extensions (E, fi) of G 
by Hy where f —> (2?, /3) holds if and only if f is the factor set which appears in one of 
the possible “ addition tables ” (7.1) for E. Two factor sets f and /' of H in G 
determine equivalent group extensions of G by H if and only if they are associate . 
In particular , the group extension determined by f is trivial if and only if f is a 
transformation set . 

Proof. As a preliminary, observe that the associative relations (7.3) for 
/ show (with k = l = 0, h = k = 0) that /(0, 0) — f(h y 0) = /(0, l ). Now, 
given /, we construct E; as the group of all pairs {g y h) with addition given by 
the rule 


(0i > h) + (02 , k) = ( g% + 02 + f(h , k ), h + k) 9 

and the homomorphism 0/ defined by 0 f (g } h) = h. Since /( 0, 0) = /( 0, l ), 
each element (g y 0) may be identified with the corresponding element g + /(0,0) 
in G; the pair (E /, 0/) is then indeed an extension of G by H. As a representa¬ 
tive of h in Ef , we may choose u(h) = (0, h ); the addition table (7.1) then in¬ 
volves exactly the original factor set /. If E is an arbitrary group extension 



GROUP EXTENSIONS AND HOMOLOGY 


769 


of G by H in which/appears as the factor set of E, the correspondence g + u(h) 

(g } h) shows that the extension E is in fact equivalent to the extension Ef just 
constructed. Therefore / —> {Ef , 0/) is a many-one correspondence with the 
defining property stated in the theorem. 

If / and /' are associate, as in (7.4), the correspondence 

(<7, h) -> (g - g(h), h)' 

shows that the corresponding extensions Ef and Ef are equivalent. Conversely, 
the argument leading to (7.4) shows in effect that E f is equivalent to Ef only 
if / is associate to /'. 

We turn now to two special applications of transformation sets. In the first 
place, the representative for the zero element of H may always be chosen as 
the zero in E. This means that r'(0) = 0, u'(0) + u'(h) = u'{h), so that 

(7.6) /'(0, h ) = f(h, 0) = 0 (all heH). 

A factor set/' with the property (7.0) may be called normalized; we have proved 
that every factor set / is associate to a normalized factor set. 

Free groups may be characterized in terms of group extensions as follows: 
Theorem 7.2. A group with more than one element H is free if and only if 
every extension of any group by H is the trivial extension. 

Proof. Suppose first that H satisfies the condition that every extension of 
every G is trivial. Represent H as F/R, where F is free. Then F is a trivial 
extension of R by H , hence is a direct sum of R and H . Therefore //, as a sub¬ 
group of the free group F, is itself free. The other half of the theorem is stated 
in more detail in the following Lemma. 

Lemma 7.3. Every factor set /' of a free group F in a group G is a transforma¬ 
tion set , so that 

(7.7) /'Or, y) = <t>{x + y) - <t>{x) - <t>{y ), eG, 

holds for all x, y e F. If F has generators z a , the function <f> may be chosen so that 
0(0) = — /'(0, 0), <t>(z a ) = 0 for each generator z* . 

Proof. In the extension Ef of G by F we have an addition table 

u’{x) + u\y) = u'{x + y) + /'(x, y) (x, y e F). 

In E we introduce a new set of representatives u(%2 e a z a ) = 21 e a u'(z a ) for the 
elements 23 e <* z a °f These are related to the original representatives by an 
equation u(z ) = u*{z) + where <f>{z) has values in G . Because F is a free 
group, z —► u{z) as defined is a homomorphism of F into E , so that u(x + y) = 
u{x) + u(y) y and the factor set belonging to u is identically zero. But the given 
/' is associate to this zero factor set, as in (7.4). Setting / = 0, <f> = — g in 
(7.4) gives (7.7), as desired. By construction, u{z a ) = w'(s«), so <f>{z a ) = 0. 
Also tt'(O) + w'(0) = u'(0) + /•'(0, 0), so that u'(0) = /'(0, 0), u( 0) = 0, and 
therefore 0(0) = —/'(0, 0). This completes the proof. 



770 


SAMUEL EILENBERG AND SAUNDERS MACLANE 


8. The group of extensions 

For fixed H and G the sum of two factor sets fi and ft is a third factor set, 
defined as 

(/i + ft) ( k , k) = fx{h, k) + Mh, k) (h, k tH). 

Under this addition, the factor sets and the transformation sets form groups, 
denoted respectively by 

(8.1) Fact {G, H } = group of all factor sets of H in G, 

(8.2) Trans [G, H } = group of all transformation sets of H in G. 

% 

The factor sets belonging to a given group extension E constitute a coset of the 
subgroup Trans {G, //}, as in (7.4). Hence the correspondence of factor sets 
to extensions is a one-one correspondence between cosets of Fact/Trans and 
equivalence classes of extensions. This correspondence carries the addition 
of factor sets into an addition of group extensions. We are thus led to define 
the group of group extensions of G by H as 18 

(8.3) Ext {G, H } = Fact {G, H )/Trans {G, H}. 

If H is discrete while G is a (generalized) topological group, there will be a 
corresponding induced topology on Ext {G, H }. For each factor set / is a 
function on H X H with values in G, so that Fact {G, H\ is a subgroup of the 
generalized topological group G HX H of all such functions. The subgroup “Trans” 
and the factor group “Ext” also carry topologies. Much as in §3 one can prove 
that if H is discrete and G topological, then Fact {G, H) is a closed subgroup 
of G hxh . This proves 

Lemma 8.1. If H is discrete and G is a topological (and compact) group , then 
Fact {&, H) is a topological (and compact) group . 

In general, however, Trans {G, H} will not be closed in Fact {G, H\, even 
when G is topological. In such cases Ext {G, H) is necessarily a generalized 
topological group. 

If ( E , 0) is an extension of G by H } each subgroup S C H determines a cor¬ 
responding subgroup E s C E , consisting of all e e E with /3(e) € S . Since 
E s ^ G, we may thus say that E “induces” an extension ( E s , 0) of G by S. 
We call an extension E finitely trivial if E s is trivial for every finite subgroup 
S CZH. 

Similarly, any factor set / of H in G determines for each subgroup S C H 
a factor set/ fl of S in G, where-/ a (/i, k ) = f(h , k) for ft, k in S (i.e., f s is obtained 
by “cutting off” / at S ). The correspondence between factor sets and group 
extensions readily gives 

Lemma 8.2. A factor set f of H in G determines a finitely trivial extension of 

11 It is possible to define the sum of two group extensions directly, without using the 
factor sets (see Baer [2] p. 394); it also is possible to give an analogous definition of the 
topology introduced below in Ext (G, 17). 



GROUP EXTENSIONS AND HOMOLOGY 


771 


G by H if and only if, for every finite subgroup S C H, the factor set f s U cut off” 
at S is a transformation set of S in G. Hence the finitely trivial extensions of G 
by H constitute a subgroup Ext/ \G,H) of Ext {(?, H \. 

9. Group extensions and generators 

A group extension can be described not only by factor sets, but also by certain 
homomorphisms related to the generators of the extending group IF. For let 
( E , $) be a given extension of G by H , and H = F/R a representation of IF as 
a factor group of a free group F . Let F have the generators z a , as in §4; the 
corresponding elements (or cosets) t a of H will then be a set of generators of H. 
For each generator t a choose a corresponding representative u a in the given 
group extension E , so that 0u a = t a . Then 0(2 e a u a ) = 2 > 80 that any 

element 2 € H has a representative of the form 2 e « u <* • This means that 

each element of E can be written in the form 

x = g + 2 e a u « , g e G, e a integers. 

. From this representation one can at once determine how to add the elements 
of E. However, this representation is not in general unique, for (2 <*« u a ) e G 
is equivalent to 2 e «t<* = 0, which in turn is equivalent to (2 e«z«) c /?. Thus 
to each r = 2 m the group R of “relations” there is assigned an element 
0(r) € G , defined as 

0(r) = 0(2 e «Za) = X) e « u <* 

These assignments 0(r) completely determine the extension E. 

The function 0 hereby defined 14 is a homomorphism of R into G. Conversely 
every such homomorphism 0 can be used to construct a corresponding group 
extension of G by H = F/R\ it suffices to construct E by reducing the direct 
product F X G modulo the subgroup of all elements of the form (r, 0(r)), for 
r e R. There is thus a correspondence between homomorphisms of R into G 
and extensions of G by H = F/R. lb 

10. The connection between homomorphisms and factor sets 

Given G and H = F/R, an extension E of G by H may be given either by a 
factor set or by a homomorphism of R into G. There must therefore be a rela¬ 
tion between factor sets and homomorphisms of this type. We now propose to 
establish this relation directly, without using extensions explicitly. (Actually, 
the correspondence which we obtain is identical with that obtained by going 
from a homomorphism first to the corresponding group extension and then to 
its factor set.) 

Theorem 10.1. If H = F/R is a factor group of a free group F, while G is 
any other group, then 

14 Actually 0 may be obtained by “cutting off” one of the homomorphisms <f> as described 
in Lemma 4.3. 

14 This correspondence has been stated by Baer ([2], p. 395) and used by Hall [6]. 



772 


SAMUEL EILENBERG AND SAUNDERS MacLANE 


(10.1) Ext {<?, H) S Horn {F, G}/Hom {F | F, G}. 

Under the correspondence which gives this isomorphism 

(10.2) Ext, {G, H) ^ Horn, {F, G; F}/Hom {F | R, G}, 

(10.3) Ext {G, }/Ext, {G, H } ^ Horn {F, G}/Hom/ {R, G; F}. 

If G is a generalized topological group while F and H are discrete, all these iso¬ 
morphisms are bicontinuous. 

Proof. As a preliminary, observe that the representation H — F/R means 
that the free group F is a group extension of R by H. In this extension "choose 
a representative Uo(h) in F for each h t H. F is then described, as in (7.1), 
by an addition table 

(10.4) Uo(h) + Uo(k ) = Uo(h + k) + fo(h, k ), 

where/o is a factor set of H in R. This factor set will be fixed throughout the 
proof. 

Since Ext {G, H } is defined as Fact/Trans, the required isomorphism (10.1) 
could be established by a suitable correspondence of homomorphisms to factor 
sets. Let 8 t Horn {F, G} be given, and define/* by 

(10.5) ' f»(h, k) = B[f 0 (h, k)] (h, k e H) . 

The requisite commutative and associative laws (7.2) and (7.3) for/* follow from 
those for/o, and the correspondence B —»/* is a homomorphism of Horn {F, G} 
into Fact {G, H), and therefore into Ext \G, H). 

Suppose next that 8 can be extended to a homomorphism 8* of F into G. 
This homomorphism applied to (10.4) gives 

6*\fo(h, &)] = B*[u Q (h)] + 0*[u o (fc)] — 6*[u 0 (h + k)}. 

If we set g(h) = 8*[uo(h)], the result asserts that 8*f 0 = 8f 0 = /* is a transforma¬ 
tion set. 

Conversely, suppose that /* is a transformation set, so that /*(/&, k ) = 
g(h) + g(k) — g{h + k) for some function g. Now any element in F can be 
written, in only one way, in the form r + uo(h), with r in F, h in H. We define 
6*(r + Uo(h)) as 8(r) + g(h). Clearly 6* is an extension of 8; a straightforward 
computation with (10.4) shows that 8* is actually a homomorphism. In this 
case, then, 8 is extendable to F. 

We know now that the correspondence 8 —* /* is an isomorphism of 
Horn {F, G}/Hom {F | F, G} into a subgroup of Ext {G, H). It remains to 
prove that it is a homomorphism onto. At this juncture we use for the first 
time the assumption that F is a free group. Let E be a given extension of G 
by H, with a factor set / which we can assume is normalized, as in (7.6). Let 
/So be the given homomorphism of F on H. Use / to define a factor set f'olF 
in G by the equation 



GROUP EXTENSIONS AND HOMOLOGY 


773 


( 10 - 6 ) /'(*> y) = fiPox, fay), x,y*F. 

Since F is free, f is a transformation set, so we can find, as in Lemma 7.3, a 
function <t>(z) on F to G with 

(10-7) <t>(x + y) = <f>(x) + <t>(y) + /'(*, y). 

In particular, if x and y lie in R, f}<# = fay = 0, and f'(x, y) = /(0, 0) = 0, 
because / is normalized. Thus <f>, restricted to R, is a homomorphism 0 = 
<l> | R of R into G. Furthermore, if <t> is applied to the addition table (10.4) 
for F, the property (10.7) gives 

4>[uo(h) -f- Uo(k)] = 4>{ u o(h -f- k)] + 4>[fa(h, k)], 

where a term f'(u 0 (h + k), f 0 (h, k)), which would have entered by (10.7), is 
zero because / is normalized, f 0 (h, k ) « R, and fa>fo(h, k) = 0. Now compute 
f(h, k) for h, k in H. By (10.6), 

f(h, k ) = f(u 0 (h), u 0 (k)) 

= <t>[u 0 (h ) + M 0 (fc)] - <t>[uo(h)] - <f>[uo(k )] 

= <t>[u 0 (h + k)] — <f>[uo(h)] — <<>[mo(A:)] + <l>[fo(h, &)], 

in virtue of the equation displayed just above. This equation asserts that / 
is associate to the factor set tf>f 0 = 0/o. In other words, given the normalized 
factor set /, we have constructed a homomorphism 6 for which f is essentially 
6fo . This completes the proof of (10.1). 

It is desirable to find a more explicit expression for this dependence of d on /. 
A simple induction applied to (10.7) will show that, for z< in F, 

( n \ n n—1 / k \ 

S Zt) = S 4>(Zi) + S f ( £ 2 » > Z*+l) • 

i-1 / t-1 *-1 \>-l / 

If Zi is one of the generators z a of F, then 4>(zi) = 0, by Lemma 7.2. If z< = — z a 
is the negative of a generator, then by (10.7) 

<t>( 0 ) = <t>(z a + (-Za)) = 4>(Za) + <t>(-Za) + f(Za , ~Za), 

so that <t>(—z a ) = —f'(z a , —Za)• Now any element of F can be written as a 
finite linear combinations of generators and hence as a sum , where each 
Xi is either a generator or the negative of a generator z„, and where any given 
generator may appear several times in this sum. In particular, for any element 
r — in the subgroup R, the previous formula for <t> becomes a formula for 
0 * $ | R , 

(10.8) 0^2= —32'f(fy>Xi, —faX{) + faXh+i^, 

where fa is the given homomorphism of F into H, and where the sum is 
taken over those elements which are the negatives of generators. The 



774 


SAMUEL EILENBERG AND SAUNDERS MAcLANE 


essential feature of this formula is the fact that it expresses 0(r) for r e R as a 
sum of a finite number of values of the given factor set / of H in 0 . 

Now consider the continuity of the correspondence 6 ► f 9 used to establish 

(10.1). It suffices to establish the continuity at 0. If U is any open set, con¬ 
taining zero, in Horn { R , G}/Hom {F | R, G}, there will be an open set V con¬ 
taining 0 in G and a finite set of elements r\ , • • • , r, e R such that U contains 
the cosets of all homomorphisms 6 with 0(r t ) e V, i = 1, • • • , s. 

For a given/, the expressions 0(r x ) of (10.8) for these elements r will involve 
but a finite number of elements of the factor set /. Because of the continuity 
of addition in G, we can construct an open set U ' in Fact {G, H\ sucji that 
each 0(r,) does in fact lie in the given V . This establishes the continuity of the 
correspondence / —> 0. The continuity of the inverse correspondence is obtained 
by a similar argument on the definition (10.5) of this correspondence. 

It remains only to consider the formulas (10.2) and (10.3) on finitely trivial 
extensions. Let 0 and its correspondent f e be given, and let F 0 ID R be any sub¬ 
group of F for which F 0 /R is finite. A previous argument, applied to F 0 instead 
of F, shows that 0 can be extended to a homomorphism of F 0 into G if and 
only if f e , regarded as a factor set for F 0 /R in G, is a transformation set. But 
the subgroup Horn/ {R, G; F} by definition consists of all those 0 which are 
extendable to every such F 0 , while Ext/ by Lemma 8.2 is obtained from 
those factor sets which are transformation sets on every such subgroup F 0 . 
Horn/ {jffi, G ; F}/Hom {F/R, G} is the subgroup corresponding to Ext/ {G, 11} 
under 0 —> fe . This proves (10.2) and with it (10.3). The continuity of the 
isomorphisms in this case follows from the continuity of the isomorphism (10.1). 

For subsequent purposes we observe that the correspondence 0 —> /* obtained 
in this proof is essentially independent of the choice of the fixed factor set / 0 
for H \itR. Specifically, if / 0 is replaced by an associate factor set /o, fe will be 
replaced also by an associate factor set, so that the corresponding element of 
Ext {G, 11} is not altered. 


11. Applications 

The representation of Ext {G, H} as Horn {R, G}/Hom {F | R> G} gives an 
immediate proof of the invariance of the latter group, as stated in Theorem 
4.2 of Chapter I. There are a number of other simple corollaries. 

Corollary 11.1. For a direct product H X H', 

(11.1) Ext {G, H X H[ } ^ Ext {G, H} X Ext {G, H 

If G is a generalized topological group , the isomorphism is bicontinuous . 

Proof. If H = F/R and H f — F'/R', we may write H X H f = 
( F X F')/{R X R f ), where F X F', like F and F', is free. Each homomorphism 
of R X R' into G determines homomorphisms 0 and 0' of the subgroups R and 
R' into G, and this correspondence yields a (bicontinuous) isomorphism 

Horn {R X R', G} & Horn \R, G} X Horn {R', G}. 



GROUP EXTENSIONS AND HOMOLOGY 


775 


Furthermore, under the same correspondence 

Horn {(F X F') | (ft X ft'), O) S Horn {F | ft, G\ X Horn {ft' | ft', G}. 

These two relations yield a corresponding isomorphism between the respective 
factor groups such as Horn {ft, Gj/Hom {F | ft, G\. By the fundamental 
theorem, the latter isomorphism is the one asserted in (11.1). 

This conclusion can also be established without using homomorphisms, by a 
direct argument like that of Lemma 7.2. (Choose new representatives in E 
for elements of H X //' by setting u'(hh') = u{h)u(h')). Another simple argu¬ 
ment directly with the factor sets will give a companion “direct product” 
representation, 

(11.2) Ext {G X G', H } g* Ext {G, H) X Ext {G', H ); 

this isomorphism is also bicontinuous. 

Corollary 11.2. If H is a cyclic group of order m , then 

(11.3) Ext {G, H) ^ G/mG, (mG = all mg , for g*G). 

This isomorphism is also bicontinuous. 

This is a well known result, which can be derived directly from our main 
theorem. The cyclic group H can be written as H = F/R , where F is an in¬ 
finite cyclic group with generator z , ft the subgroup generated by mz. Then 
any 0eHom {ft, G) is uniquely determined by the image 9(mz) = h of the 
generator mz. This correspondence 9 —> h (mod mG) gives the isomorphism 

(11.3) . 

A similar representation can be found for any finite abelian group H, simply 
by representing H as a direct product of cyclic groups of orders m t , i = 1, • • • , t. 
By Corollary 11.1, Ext {G, H) is then isomorphic to the direct product of the 
groups G/miG. A similar decomposition applies if the abelian group H has a 
finite number of generators. The result may be stated as follows. 

Corollary 11.3. If H has a finite number of generators , and T is the subgroup 
of all elements of finite order in H , then Ext {G, H } ^ Ext {G, T }, algebraically 
and topologically. The latter group is a direct product of groups of the form G/mG. 

Theorem 7.2 (extensions by a free group are trivial) has an analogue for 
infinitely divisible groups. Recall that G is infinitely divisible if for each g € G 
and each integer m j* 0 the equation mx = g has a solution x c G. 

Corollary 11.4. A group G is infinitely divisible if and only if every extension 
of G by any group is the trivial extension . 

Proof. If G is not infinitely divisible, some G/mG 0, so that there will 
be a non-trivial extension of G by a cyclic group, as in Corollary 11.2. Con¬ 
versely, suppose G is infinitely divisible. If ft C F are groups, a transfinite 
induction will show that every homomorphism of ft into G can be extended to a 
homomorphism into G of the larger group F . Therefore the subgroup 
Horn {ft | ft, G} exhausts the group Horn {ft, G}, and Ext {G, F/R) = 0. 
Corollary 11.5. If T is the subgroup of all elements of finite order in H , then 



776 


SAMUEL EILENBERO AND SAUNDEBS MacLANE 


(11.4) Ext {G, H }/Ext, {(?, H) ^ Ext {G, T }/Ext, [G, T). 

This isomorphism is bicontinuoua (if G is a generalized topological group). 

Proof. In the representation H — F/R, let F T denote the set of all elements 
of F of finite order modulo R. The group T then has the representation 
T = F t /R, while F T , as a subgroup of a free group, is itself free. Now the 
group Horn/ {/£, G; F} by definition consists of all homomorphisms extendable 
to subgroups of F finite over R; as these subgroups are all contained in F T , 
the group Horn/ is identical with Horn/ {R, G; F T } ■ If both factor groups in 
(11.4) are now represented by groups of homomorphisms, as in (10.3), the result 
is immediate. 

Observe that when T has only elements of finite order, the group Ext/ [G,T], 
though it consists of extensions E of G by T trivial on every finite subgroup 
of T, can contain non-trivial extensions. This is illustrated by the following 
example. Let p be a prime, and G a group with generators g, hi, hi, • • • and 
relations p'hi = g, for i = 1, 2, • • • . In this group G the intersection of all the 
subgroups p'G is the group generated by g alone. Let T be the group of all 
rational numbers of the form a/p', reduced modulo 1. Then all elements of T 
have finite order, and T may be written as T = F/R, where F is a free group 
with generators Z\, z 2 , • • • , and R the free subgroup generated by pzi, 
pzt —. Z\, pz» — z 2 , • ■ • . (The homomorphism F —> T maps Zi into 1/p'.) 

To prove Ext/ {G, T) ^ 0 it suffices to find a 0 t Horn/ [R, G; F\ which 
is not in Horn {F | R, G}. Such a 0 is determined by setting 6(pzi) = g, 
0(pZi +1 — Zi) — 0, i = 1, 2, • • • . The definition 0*(z n _j) = p'h n will provide 
an extension 0* of 0 to the finite subgroup of F generated by zi, • • • , z„ . How¬ 
ever, suppose that 0 had an extension <£'to F. Then <t>(pz i+ i) — <t>(zi), so that 
4>(zi) = p n <t>(z n + 1 ) for every n. This means that 4>(zi) is in every subgroup p n G, 
hence has the form eg for an integer e. But then g = 0(pzi) = p<t>(z i) = epg 
gives a contradiction. Therefore Ext/ {G, T) ^ 0 in this case. However, if 
G has no elements of finite order, one can prove easily that Ext/ {G, T\ — 0, 
using Lemma 5.1 (see §17 below). 

For several types of topological groups G, §5 gives information on the to¬ 
pology of the various relevant subgroups of Horn { R, G }. By the main theorem, 
the conclusions of Lemmas 5.3, 5.4, and 5.5 can now be rewritten as conclusions 
about the topology of Ext {G, H), as follows. 

Corollary 11.6. If H is discrete and G a generalized topological group, the 
closure of the zero element in the generalized topological group Ext {G, H\ contains 
Ext/ {G, H \. If, in addition, every subgroup mG is closed in G,form — 2,3, • • • , 
then Ext/ {G, H) is closed in Ext {G, H). 

In particular, if H has no elements of finite order, then every extension of G 
by H is trivial on (the non-existent) finite subgroups of H, consequently 
Ext/ {G, H) — Ext {G, H) and the closure of 0 is the whole group Ext {G, H). 
This means that Ext (G, H } carries the “trivial” (generalized) topology in which 
the only open sets are the whole group and the empty set. 



GROUP EXTENSIONS AND HOMOLOGY 


777 


Corollary 11.7. If H is discrete and G compact and topological, then 
Ext/ {G, H] =0 and Ext {G, H\ is itself a compact topological group. 

This conclusion is obtained from Lemma 8.1 and from Lemma 5.4. 

12. Natural homomorphisms 

The basic homomorphism i)(0) = ft mapping elements 0 of Horn (R, G) into 
factor sets/, as in Theorem 10.1, is a “natural” one. Specifically, this means 
that the application of y “commutes” with the application of any homomorphism 
T to the free group F and its subgroup R. To state this more precisely, we 
need to consider first the homomorphisms which T induces on the groups 
Horn {ft, (?) and Ext {(?, H\. 

Let F' be a free group with subgroup R', T a homomorphism z’ —* Tz' of F' 
into the free group F such that T(R') C R. T induces a homomorphism of 
H‘ = F'/R' into H = F/R. This induced homomorphism will be written with 
the same letter T, so that T(g + R') = Tg + R, for any coset g + R'. 

Now consider 0 eHom \R, (?}. Clearly the product 0' = ST is an element 
of Horn (R', G), and the correspondence 0 —* O' is a homomorphism T* of 
Horn \R, (71 into Horn {R\ G j. Furthermore 6 e Horn {F|R, (?) implies 
0T e Horn \F' | R', (?), so that T% also induces a homomorphism T* , 

(12.1) Tt : Horn {72, (7}/Hom {F | R, G) —► Horn (R', G}/Hom {F' \ R', G). 
Similarly, consider / e Fact \G, H}. The function/' defined by 

f'{h', k') = f(Th', Tk') ( hk 't H') 

is a factor set of H' in G, and the correspondence f —* f is a homomorphism 
T* of Fact {(?, H\ into Fact {(?, H'\. Furthermore,/ 1 Trans {(?, H\ implies 
/' « Trans \G, //'}, so that T* also induces a homomorphism T* for the corre¬ 
sponding factor groups Ext = Fact/Trans, 

(12.2) T* : Ext {(?, H } -> Ext {(?, H'\. 

By the (dual) homomorphisms induced on Ext or Horn by T we always mean 
these homomorphisms T* and T* . 

Theorem 12.1. Let T he a homomorphism of F' into F with T(R') C R, 
where F U R and F' 3 R' are free groups, while y (or y') is the homomorphism of 

Horn {ft, (?) onto Ext )G, F/R } established in the proof of Theorem 10.1. Then 

(12.3) v'T* = T*y, 

where T* , T* are the appropriate homomorphisms induced by T on Horn and Ext, 
respectively. 

Proof. The figure involved is 

Horn {R, (?) -* Ext {<?, F/R] 

| Tt { T* 

Horn {R'.G} -T Ext {G, F'/R') 

V 



778 


SAMUEL EILENBERG AND SAUNDERS MACLANE 


The correspondence y was constructed from a factor set /o for F as an extension 
of R; similarly, y' is based on a factor set/o for H' in R', such that 

(12.4) u 0 (h') + u 0 (k ') = u 0 (h' + k') + fo(h', k'), 

where Uo(h') is a representative of h' e H' in F'. First we determine the rela¬ 
tion between f 0 and /o. The given homomorphism T carries F' into F, H' 
into H and thus u 0 (h') into Tu'o(h')> a representative in F of Th' in H . This 
representative will differ from the given representative Uo(Th') by an element 
of R , so that 

Tu' 0 (h') = Uo(Th') + p(h') (all h' in H '). 

where each p(A') lies in R. Now the representatives Tu 0 (h f ) will add with a 
factor set Tfo(h', k') y as may be seen by applying T to both sides of (12.4). 
This factor set in associate (in the group TH') to the originally given factor 
set/o of H 3 TH'; explicitly we have, by the argument leading to (7.4), that 

Tfoih 1 , k f ) = f 0 (Th', Tk 1 ) + [p(h') + p(k') - p(h' + *')]• 

Suppose now that 6 t Horn {/?, G } is given. Application of rj and then T* 
will give, by the definitions of these correspondences, a factor set /', with 

f\h\ k') = 6[f 0 (Th', Tk')] 

= 6T{f' 0 (h', k')] + [0 P (h' + k') - dp(h') - d P (k')]. 

On the other hand, application of Tt and then r\' will give, again by the appro¬ 
priate definitions, a factor set f* with 

r(h', k') = e'[f a (h',k')) = erifoih', k% 

Since 6p(h') is an element in G for each h! e H', these two equations show that 
f* and S' are associate, hence that /' = T*rj6 and f* = y'Ttd do determine the 
same element of Ext {(7, H }, as asserted in the theorem. 

Chapter III. Extensions of Special Groups 

In this chapter we shall determine Ext {G, H\ more explicitly for various 
special groups G and H. We begin with a brief review of the theory of char¬ 
acters, which will be used extensively in this chapter and also in Chapters V 
and VI. 

13. Characters 16 

Let G f H y and J be three generalized topological groups. G and H are said 
to be paired to J if a continuous function 17 4>(g f h) with values in J is given 

l# The character theory was discovered by Pontrjagin (see [8]), generalized by van 
Kampen (see Weil [12], Ch. VI and Lefschetz [7] Ch. II). 

17 As a mapping G X H —► J; for discussion of pairing, cf. (8], [14], 



GROUP EXTENSIONS AND HOMOLOGY 


779 


such that for any fixed go , <t>(g Q , h) is a homomorphism of H into J and for 
any fixed h 0 , <t>(g , ho) is a homomorphism of G into J . 

Each subset A C (? determines a corresponding subset Annih A C H, called 
the annihilator of A> such that h e Annih A if and only if <f>(g, h) = 0 for all 
g e A. Annihilators of subsets of H are defined similarly. It is clear that the 
annihilators arc subgroups. 

Lemma 13.1. If G and H are paired to a topological group J, then for each 
A C G, Annih A is a closed subgroup of H . 

This is an immediate consequence of the continuity of for fixed g. 

G and H are said to be dually paired to J if they are so paired that 

Annih G = 0 and Annih H = 0. 

Lemma 13.2. If G and H are paired to J then G/ Annih H and H/ Annih G 
are dually paired to J . 

The most important group pairings arise when J = P is the additive group 
of reals reduced modulo 1. A homomorphism of a group G into P will be called 
a character and the group Horn {G, P\ will be written as Char G. Since P has 
no “arbitrarily small” subgroups, it follows from a remark in §3 that if G is 
compact, Char G is discrete. Vice versa, by Corollary 3.2, if G is discrete* 
Char G is compact and topological. 

The basic lemma of the theory of characters is 

Lemma 13.3. Let G be a discrete or compact topological group and let g 0 
be an element of G. There is then a character 6 e Char G such that 6(g) ^ 0. 

In the case of discrete G the lemma follows easily from the proof of Corollary 
11.4, since P is infinitely divisible. In the compact case the proof is much less 
elementary and uses the theory of invariant integration in compact groups. 

The lemma can be equivalently formulated as follows: 

Lemma 13.4. Let G be a discrete or compact topological group . G and Char G 
are dually paired to P with the multiplication 

<t>(g , 0) = 6(g) , g e G, Q e Char G. 

Now let G and H be paired to P with <t>(g } h) as multiplication. Since, for a 
fixed g , <t>(g> h) is a character of H and, for fixed A, <t>(g, h) is a character of G, 
we obtain induced mappings 

(13.1) G -► Char H , H -> Char G. 

A basic result of the character theory is 

Theorem 13.5. Let the compact topological group G and the discrete group H 
be paired to P. The pairing is dual if and only if the induced mappings (13.1) 
are isomorphisms : 

G == Char H and H = Char G. 

The following two theorems are consequences of the previous results: 

Theorem 13.6. If G is a discrete or a compact topological group , then 



780 


SAMUEL EILENBERG AND SAUNDERS MAcLANE 


Char Char G ^ G. 

Theorem 13.7. If the compact topological group G and the discrete group H 
are dually paired to P, then for every closed subgroup Gi of G and every subgroup 
H\ of H we have 

Annih [Annih GJ = Gi, Annih [Annih Hi] = Hi . 

14. Modular traces 

To study Ext {G, H) for compact G we need a certain modification of the 
“trace” of an endomorphism of a free group. The simplest case of this modifica¬ 
tion refers to a correspondence which is not a homomorphism, but is a*homo- 
morphism, modulo m-folds of elements. It may be stated as follows. 

Lemma 14.1. Let m be an integer , and let r —> S(r) be a correspondence carrying 
the free group R into a finite subset of itself in such manner that 

(14.1) S{r x + r 2 ) ee S(r x ) + >S(r 2 ) (mod mR ), 

for all n , r 2 1 R . Let the elements y a be any independent basis for R , and write 
S(y a ) = 2/3 c <*$y& > w ith integral coefficients c a p . Then the “trace” 

(14.2) t m (S) m 2« c «a (mod m) 

is a well defined finite integer , modulo m, independent of the choice of the basis 
y a for R. 

The proof is exactly parallel to the standard one (e.g. [1], p. 569) for an actual 
homomorphism of R to itself, using the “modular” homomorphism condition 
(14.1) at the appropriate junctures in place of the full homomorphism condi¬ 
tion. .4 similar analogue of a special case of the “additivity” of traces will give 
the following conclusion. 

Lemma 14.2. If in Lemma 14.1 the elements w \, • • • ,w t are any independent 
elements of R such that S(R) lies in the group generated by w \, • • • , w t , and if 
S(wi) = /L,jdifWj , then t m (S) = m ). 

Now let R be a subgroup of the free group F, a a homomorphism of R into a 
finite subgroup of F/R, There will then be at least one integer m for which 
m<r(R) = 0. Choose for each coset u of F/R a representative p(u) in F; then 
p(u + v) as p(u) + p(v) (mod R). For each r m(par) is also an element 
of R , and S(r) = m(p<rr), where 

R “ l v * F/R —r -► F — R, 

is a correspondence of R to R with the modular homomorphism property (14.1). 18 
The trace of the original homomorphism <j is now defined as 

(14.3) t(a) ss t m (S)/m = t m (mp<r)/m (mod 1). 

18 S could also be described in terms of m and a as follows: S is the essentially unique 
correspondence of R to a finite subset of mF fi R with property (14.1) and such that each 
ir(r) is the coset modulo R of S(r)/m. 



GROUP EXTENSIONS AND HOMOLOGY 


781 


Theorem 14.3. If 72 C F, F a free group , and if a is any homomorphism of R 
into a finite subgroup of F/R , then the trace t(<r) defined by (14.3) is a unique real 
number , modulo 1, independent of the choices of m and p made in its definition . 
If <n and <r 2 are two such homomorphisms of R to F/R , 

(14.4) t(cr i + <r 2 ) s t(a i) + Z(<r 2 ) (mod 1). 

In particular , Z( 0 ) = 0 (mod 1). Furthermore , t/ To 2 $ a fixed finite subgroup of 
F/R , ZAe correspondence a —» £(<r) is a continuous homomorphism of Horn {2?, T 0 } 
into Zta roaZs modulo 1. 

We are to prove the invariance of the definition of t. First, hold p fixed and 
replace m by a proper multiple m' = km. Then S and t m (S) are both multi¬ 
plied by k , hence t'(<r) = t km (kS)/km = kt m (S)/km = t(<r) is unaltered, mod 1. 
Now hold m fixed and let p' be any second set of representatives p'(u) for cosets 
ueF/R. Then p'(n) = p(u) (mod 72), so S'(r ) = $(r) (mod m/2), which 
implies that t m (S') = Z m (>S) (mod m). This shows that the trace is independent 
of p and m. 

The additive property (14.4) is readily established; it is only necessary to 
choose a single integer in such a way that both maiR and m<r 2 R are zero. 

Before establishing the continuity of t(<r), we propose a more explicit repre¬ 
sentation of the finiteness of t(a). Let To be a fixed finite subgroup of F/R , 
and choose a direct summand F 0 of F with a finite number of generators such 
that F 0 /(F 0 H R) contains To. We can choose simultaneously ([1], p. 566) a basis 
«!,•••,«» for F 0 and a basis yi , • • • , y a for F 0 fl 72 so that t/ t = d&i , for integers 
di , i = 1, • * • , s ^ n. Furthermore, one can prove F 0 fl R a direct summand 
of R; there is then a (not necessarily denumberable) basis for R of the form 
2/i > ’ ‘ • > V» j Vet , yp , * * • . In particular, if <r(R) C To , we may choose p(0) == 0, 
p(T 0 ) C Fo , hence S(R ) = mp<x(/2) Cl F 0 H R. The equations for S and its 
trace then take the form 

a a 

(14.5) $( 2 / 7 ) == CyiVi, tm($) — (mod m), 

*-1 

where 7 = 1 , 2 , • • • , s, a, ft, • • • . 

To prove Z(<r) continuous it suffices to establish the continuity at <r = 0, and 
hence to prove that t(<r) s 0 for in a suitable neighborhood U of 0 in 
Horn {72, To}. Let U be the open set in Horn {72, To] consisting of all a with 
c(y x ) =s ... = a(yf) = 0, where is the special basis constructed from Fo above. 
Then, because p(0) = 0, we have S(y/) = 0, t m (S) m 0 (mod m), and therefore 
t(o) sa 0 (mod 1) for <r in U. 

We next consider circumstances under which the traces will vanish. 

Lemma 14.4. If a t Horn {72, F/R] has an extension cr* which carries F homo - 
morphically into a finite subgroup To of F/R , then t(<r) = 0 (mod 1). 

Proof. For To we choose yi = diZi as above, and then select p with p(To) C F 0 
and m with mT 0 = 0 and each = 0 (mod m). Then, for suitable integers e ^, 

• 

pc*(zi) = HeijZi, 

i -1 


i — !>•••>»; 



782 


SAMUEL EILENBERG AND SAUNDERS MAcLANE 


furthermore p<r*(kz/) kpa*(zi) (mod R 0 ), for any integer k. But S(yi) «*> 
nipo(yi) = mpa*(diZi) s* mdip<r*(z,) (mod mRo). Then computing t„(S) by 
(14.5) and using the fact that m = 0 (mod d,) for each j, we find that t m (S) = 
w2«« s 0 (mod to), as asserted. 

Conversely, we can find certain circumstances in which the trace will assuredly 
not vanish. 

Lemma 14.5. If z tF has order n, modulo R, and if a is a homomorphism of R 
into the subgroup of F/R generated by the coset of z, then a(nz) j* 0 implies t(a) ^ 0 
(mod 1). 

Proof. Let u denote the coset of z, modulo R. Choose the system of repre¬ 
sentatives so that p(iu) = iz, for i = 0, • • • , n — 1 , and use n as the integer m 
in the definition of the trace. Then S = mpa carries R into the cyclic subgroup 
generated by mz. Since a(nz) = ku, where k ^ 0 (mod to), S(nz) = knz, 
and the trace, as computed by Lemma 14.2, is ( m (<S) == k ^ 0 (mod to), as 
asserted. 


15. Extensions of compact groups 

The group of extensions of a compact topological group G can be expressed 
as an appropriate character group. 

Theorem 15.1. If G is compact and topological, H discrete, then Ext/ \G,H) — 
0 and there is a ( bicontinuous) isomorphism-. 

(15.1) Ext {G, H) ^ Char Horn {G,H\. 

If Go is the component of 0 in G and T the subgroup of all elements of finite order 
in H, then also 

Ext {6r, H) at Char Horn {(?,' T) at Char Horn {G/G 0 , T). 

The last conclusion follows at once from the first, for Horn [G, H) includes 
only continuous homomorphisms 0 of the compact group G; every such homo¬ 
morphism must map the connected subgroup Go into 0. Furthermore each <f> 
carries G into a finite subgroup of the discrete group H, hence into a subgroup 
of T. Observe also that H is discrete, hence has no arbitrarily small subgroups; 
therefore (cf. §3) Horn {G, H\ is discrete, as should be the case for a character 
group of the compact group Ext {G, H J. 

It remains to prove (15.1). Represent H as F/R; then, according to the 
fundamental theorem of Chapter II, (15.1) is equivalent to 

(15.2) Horn {f2, <?}/Hom {F \ R, G\ 9* Char Horn {<?, F/R). 

According to Theorem 13.5 it will thus suffice to provide a suitable pairing of 
the compact group Horn {i?, (7} and the discrete group Horn {<?, F/R) to the 
reals modulo 1. To this end, take any d « Horn {72, (?) and <t> € Horn [G, F/R). 
As just above, <t>(G) is a finite subgroup of F/R. Therefore <r = <j>0 is a homo¬ 
morphism of R into a finite subgroup of F/R, so that the trace introduced in the 
previous section can be used to define 



GROUP EXTENSIONS AND HOMOLOGY 


783 


(15.3) 2 ( 0 , 0 ) as 2 ( 00 ) (mod 1 ). 

We propose to show that this is the requisite pairing. 

In the first place, this product is additive, for 

t (6 + 0 ', 0 ) s 2 ( 0 , 0 ) + 2 ( 0 ', 0 ) (mod 1 ), 

2 ( 0 , 0 + 0 ') 3 2 ( 0 , 0 ) + 2 ( 0 , 0 ') (mod 1 ) 

follow from the corresponding property (14.4) for a = 00 . Secondly, if 0 is 
fixed, the product 2(0,0) is continuous in 0. For when 0 is fixed, <r = 00 maps 72 
into a fixed finite subgroup of F/72. Since 0 —> 00 = <r is continuous, and since 
o- —> 2(cr) is continuous, by Theorem 14.3, the continuity of 2(0, 0) follows. 

As to the annihilators under this pairing, we assert that 

(15.4) Annih Horn {(?, F/72} = Horn {F | 72, <?}. 

For suppose first that 0 € Horn {F | 72, (?} and let 0 * be an extension of 0 to F . 
Then <r* = 00 * is an extension of a = 00 to F, and ( 7 * still carries F into (the 
same) finite subgroup of F/R. Therefore, by Lemma 14.4, 2 ( 0 , 0 ) s 2 (<r) 3 0 
(mod 1 ). Hence 0 is in the annihilator in question. 

Conversely, let 0 be fixed, and suppose that 2(0, 0) = 0 (mod 1) for every 0; 
then 0 c Horn {F | 72, G}. Since G is compact and topological, it will suffice 
by Lemma 5.4 to prove that 0 c Horn/ {72, G; F). If this were not the case, 
there would be in F an element z of some order n, modulo 72, such that 
0(nz) = gro is not an element of nG. But nG is a continuous image (under 
g —» ng) of the compact group G , hence (Lemma 1.1) is a closed subgroup of (?; 
therefore G/nG is compact and topological. By Lemma 13.3 there is then 
character x of G/nG with x(0o) ^ 0, where g'o is the coset of g 0 modulo nG. 
Since every coset of G/nG has as order some divisor of n, this character x 
carries G/nG into the group generated by the fraction 1/n, modulo 1. This 
is a cyclic group of order n, and so can be replaced by the isomorphic cyclic 
group of order n generated by the coset z' of z in F/R. The so-modified char¬ 
acter X of G/nG then induces a continuous homomorphism 0 of G into F/72, 
where 

<l>(go) * o, 4 (G) C [0, z', z' 2 , . •. , Z'- 1 ]. 

For this particular 4 , the homomorphism <r = 46 carries nz into 40 (nz) = 
4(go) 0. Lemma 14.5 of the previous section then shows that t(a) = 

<t>) & 0 (mod 1), contrary to the assumption 1 ( 6 , 4 ) = 0 for every 4 . There¬ 
fore 0 does lie in Horn {F | R, (?), and 15.4 is proved. 

Finally, we assert that, under the pairing t, 

(15.5) Annih Horn {/2, (?) = 0. 

For suppose instead that t(0, 4 ) = 0 (mod 1) for all 6 and for some 4^0. Then 
for some go « G, 4(go) = u / 0. The element u of F/R is the coset of some 
element w of F; as before, 4 maps G into a finite subgroup of F/R, so that w 



784 


SAMUEL EILENBERG AND SAUNDERS MACLANE 


has a finite order m, modulo R. It is then possible to select in the free group F 
an independent basis with a first element Zo such that w = kzo for some integer k. 
If Zo has order n, modulo R, there is then a corresponding basis for R of elements 
y a , with y Q = nz 0 ■ Now construct 6 e Horn {F, G} by setting 

6(yo) = 00 , 6(y a ) -0, 2/a 7^ 2 / 0 . 

This particular homomorphism carries R into the subgroup of G generated by 
go , so that the product <r = <j>6 carries R into the finite subgroup of F/R generated 
by <h(go) = u. Since u is the coset of w = kzo , this is contained in the sub¬ 
group of F/R generated by the coset of z 0 . Furthermore <r(nzo) = mV 0. 
Lemma 14.5 again applies to show that t(er) = t(6, </>) ^ 0 (mod 1), counter to 
assumption. 

Given the assertions (15.4) and (15.5) as to annihilators, it follows from 
Lemma 13.2 that the groups Horn {72, G }/Horn {F | R, (?) and Horn {G, F/R) 
are dually paired. Formula (15.2) is then a consequence of Theorem 13.5. 

16. Two lemmas on homomorphisms 

A generalized topological group G is said to have no arbitrarily small sub¬ 
groups if there is in G an open set V containing 0 but containing no subgroups 
other than the group consisting of 0 alone. 

Lemma 16.1. If the discrete group T has no elements of infinite order and the 
generalized topological group G has no arbitrarily small subgroups, while Go is 
the same group with the discrete topology, then HomfT, G} and Horn) T, Go} 
have the same topology. 

Proof. Horn {T , G} and Horn { T, G 0 } are algebraically identical. The 
hypotheses on T insure that every finite set of elements of T generates a finite 
subgroup of T. A complete set of neighborhoods U of 0 in Horn {T, G) may 
therefore be found thus: take a finite subgroup T 0 C T and an open set Fo in G 
containing 0, and let U consist of all homomorphisms d with 6(To) C Fo . In 
particular, if Fo is contained in the special open set F of G which contains no 
proper subgroups, the subgroup 6(To) is zero, so that U consists of all 6 with 
6(To) = 0. The special sets U so described also form a complete set of neigh¬ 
borhoods of 0 in Horn { T, Go}. Therefore the two topologies on the group are 
equivalent. 

Lemma 16.2. Let F HRbe a free (discrete) group, G' HGa discrete group, while 
Horn (F, G'; R, G} denotes the set of all homomorphisms </> e Horn {F, G'} with 
4>(R) C G. Then 

(16.1) Horn {F, G'; R, G}/Horn {F, G} ^ Horn {F/R, G'/G). 

Proof. Any homomorphism of F/R into G'/G may be regarded as a homo¬ 
morphism of F into G'/G which carries R into zero (Lemma 3.3), so that (16.1) 
becomes 


(16.2) Horn {F, G'; R, G }/Horn {F, G} £* Horn {F, G'/G; R, 0}. 



GROUP EXTENSIONS AND HOMOLOGY 


785 


For each <f> t Horn {F, (?'} let <t>* be the corresponding homomorphism re¬ 
duced modulo 6, so that for x e F, <f>* (x) is the coset of c/>(x), modulo G. The 
correspondence <t> —+ 4>* is a homomorphism mapping Horn {F, G'\ R, (?) into 
Horn {F, G'/G; R, 0}. Furthermore 0* = 0 if and only if <j>{F) C <?, 
or <t> e Horn {F, (?}. Therefore <£—><£* provides an (algebraic) isomorphism of 
the left hand group in (16.2) to a subgroup of the right hand group. 

Conversely, select a fixed basis z a for the free group F, and for each coset 
b t G'/G pick a fixed representative element p(b ) in G'. For given <r t Horn {F, 
G'/G; R, 0}, define a corresponding homomorphism <j> = 4>(<r), for any x = 
2 k a z a * F, as 

^ foaZa) “ ^ 

a a 

This is a homomorphism of F into G'. By construction, p[az a ] modulo G is 
just <rz a , hence <t>(x) t modulo G , is a(x), or <p* = <r. This implies that </>{R) C G, 
and so that each a is the correspondent of some in the homomorphism —»</>*. 

To show (16.2) bicontinuous, we first analyze the topology in the groups 
involved. By the definition of the topology in a factor group, we have to 
consider only open sets in Horn {F, G'; R , G} which are unions of cosets of 
Horn {F, (?}. If Zi , • • • , z n is any finite selection from the fixed set of generators 
for F, the set U(zi , • • • , z n ) consisting of all <t> with <t>(zi) m • • • &s <f>(z n ) = 0 
(mod (?) is such an open set, and contains <£ 0 = 0. We assert that any open set 
V containing 0 which is a union of cosets contains one of these sets U. For, 
given V, there will be elements x \, • * • , x m in F such that V contains all <f> 
with 4>Xi = 0. Select generators z \, ■ • • , z n such that each Xi can be expressed 
in terms of z \, • • • , z n ; then V contains all 0 with <t>Zi = 0. Moreover, if 
< f>Zi s 0 (mod G), there is a homomorphism fa of F into G with <t>Zi = faZi ; since 
<t> — fa c V, since V is a union of cosets of Horn {F, G}, and since fa e Horn {F, G}, 
we conclude that <t> e V. Thus V 3 U(zi , • • • , z n ). 

A,similar but simpler argument for Horn {F, G'/G; R , 0} will show that every 
open set containing zero in this group contains all a with <jz\ = • • • = vz n = 0, 
for a suitable set of the generators of F. The mapping a —> 0 carries open sets 
of this special type into the open sets U(zi , • • • , z n ) described above, and con¬ 
versely. This shows that the correspondence <t> —► <f>* is continuous at 0, and 
hence everywhere. 

17. Extensions of integers 

Next we consider the case in which every element of H has finite order; we 
then write T instead of H for this group. The group of extensions of the integers 
by such a group T can be written as a group of characters. In case T is finite, 
the result is a generalization of Corollary 11.2, for in this case Char T ^ T. 

Theorem 17.1. If T has only elements of finite order , and if I is the {additive) 
group of integers , 

(17.1) 


Ext/ {/, T} = 0, 




786 


SAMUEL EILENBERG AND SAUNDERS MacLANE 


(17.2) Ext {/, T } Char T. 

The methods used to establish this result apply with equal force if 7 is replaced 
by any discrete group G which has no elements of finite order. The group 
Char T of homomorphisms of T into the group of reals modulo 1 must then be 
replaced by a group of homomorphisms of T into another group suitably con¬ 
structed from G. In fact, any G with no elements of finite order can be embedded 
in an essentially unique discrete group G* with the following properties: 19 

(i) G w has no elements of finite order, 

(ii) GJG has only elements of finite order, 

(iii) G* is infinitely divisible. 

For any g tG * and any integer m there is then a unique h = g/m in Go with 
mh = g. The (discrete) factor group GJG is the analogue of the topological 
group P f of rationals modulo 1. Specifically, if G = 7, is the group of 

rational numbers, and G JG is the group P', but with a discrete topology. Since 
T has only elements of finite order, Char T is Horn { T, P'}. But P' clearly has 
no arbitrarily small subgroups, so that the latter group, by Lemma 16.1, is 
identical (algebraically and topologically) with Horn {T, 7^/7). The exact 
generalization of Theorem 17.1 is thus 

Theorem 17.2. If T has only elements of finite order , while G is discrete and 
has no elements of finite order , and (j qo is defined as above , 

(17.3) Ext/ {G, T) = 0, 

(17.4) Ext {G, T) 9* Horn {T, GJG). 

The isomorphism is bicontinuous if G and G^/G are both discrete . 

Proof. If T is represented in the form T = F/R , for F free, the conclusions 
of this theorem can be reformulated, according to the fundamental theorem 
of Chapter II, as 

(17.3a) Horn/ \R,G;F) = Horn {F | R, G}, 

(17.4a) Horn { R , G}/Hom {F | P, G} ^ Horn {F/R, GJG ). 

Observe first that any homomorphism 6 c Horn {P, G} can be extended in a 
unique way to a homomorphism 8* e Horn {F,GJ. For, since every element of 
T = F/R has finite order, every z e F has a finite order modulo P. For each 
such z pick an integer m such that mz e P, and define 

(17.5) e*(z) = (1 /m)8(mz), z*F,mztR. 

This definition of 0 * is independent of the choice of m, and does yield a homo¬ 
morphism of F into G w . Clearly it is the only such homomorphism extending 
the given 8 . 

Suppose now that 0 e Horn/ {R, G; F). Each element ztF then generates a 


G m could also be described as a tensor product; see SIS. 



GROUP EXTENSIONS AND HOMOLOGY 


787 


finite subgroup of F/72, so 9 can be extended to a homomorphism mapping z 
and R into G. This extension of 9 must agree with the unique extension 9 *. 
This shows that 9*(z) e G for each z, so that 9 * is in fact a homomorphism of F 
into G C G* , and 9 e Horn {F | R, G}. This proves (17.3a). 

As in §16, let Horn {F, G* ; R, G} denote the group of all homomorphisms 
<f> e Horn {F, G^} with <t>(R) C G. This is a topological group, under the usual 
specification (§1) that any open set in Horn {F, ; 72, (7} is the intersection 

of this group with an open set in the topological group Horn {F, G*}. 

The correspondence —»4> | R provides a bicontinuous isomorphism 

(17.6) Horn {F, G* ;72,G} ^ Horn {72, G}. 

For, by Lemma 3.4, <t> —> <f> | R is a continuous homomorphism. It is an iso¬ 
morphism because each 9 6 Horn {72, G| has a unique extension 9* = 
<t> cHom {F, G„ ; 72, G}, by (17.5). This inverse correspondence is also con¬ 
tinuous; for if U is the open set consisting of all </> with <f>Zi = , for given z* c F 

and gi t G* , i = 1, • • • , n, there is an open set U m in Horn {/?, G} consisting of 
all 9 with 9(mzi) = mg ,, where m is chosen so that each raz, c /? and each mgi e G. 
The correspondence 9 —> 9* of (17.5) carries C/ m into f/. This proves (17.6). 

The correspondence <£ —></> | R maps the subgroup Horn {F, G} of Horn {F, G w ; 
R , G} onto Horn {F | 72, G}. Hence (17.6) also yields an isomorphism 

Horn {F, G w ; 72, G)/Hom {F, G} ~ Horn {R, G \/Horn {F | 72, G}. 

On the other hand, Lemma 16.2 provides an isomorphism 

Horn (F, G m ; R, G }/Horn {F, G} ^ Horn {F/72, G w /G). 

These two combine to give the required isomorphism (17.4a). 

It should be remarked that the results of this section can also be obtained by 
arguments directly on factor sets, without the interposition of the fundamental 
theorem of Chapter II. Specifically, to prove Theorem 17.2, one could consider 
an extension E of G by T , determined by a factor set /(s, t) for s, t eT. If 
ttT has order m, let <t> B (t) = (l/w)iC</(^> 0 ( m °d G), where i == 
0, 1, • • • , m — 1. In this fashion E determines a homomorphism <j> B € Horn { T , 
GJG). Conversely, given such a homomorphism <t>, one may select for each 
«G^/G a representative element <t> f (t) eG M and construct the corresponding 
factor set as/(s, t) = + 4>'{t) — <£'($ + t). These correspondences will 

establish (17.4). The device of constructing fa by summation over the terms 
of the factor set is an application of the so-called “Japanese homomorphism, ,, 
as commonly used for (multiplicative) factor sets. 

18. Tensor products 

Some of our formulas can be expressed more easily by means of the tensor 
products introduced by Whitney [13]. If A and B are given discrete abelian 
groups the tensor product A © B is a set whose elements are finite formal sums 



788 


SAMUEL EILENBERG AND SAUNDERS MacLANE 


23 a &i of formal products o, 6 <, with each a i t A , bi c B. Two such elements are 
added simply by combining the two formal sums into a single sum. Two such 
elements are equal if and only if the second can be obtained from the first by a 
finite number of replacements of the forms ( a + a')b «-» ab + a' 6 , a(b + b') «-» 
ab + ah'. The tensor product A o B so defined is a discrete abelian group, and 
the multiplication a • b is a pairing of A and B to A o B. 

In the special case when B = G is a group containing no elements of finite 
order, and A = Ro is the additive group of rational numbers, any sum 23 a «&* 
can, by the distributive law, be rewritten as a single term (r/s) 6 , where s is a 
common denominator for the rational numbers a,. This representation is 
essentially unique. Therefore Ro o G is simply the group G„ used in §17 above, 
and GJG is {Ro ° G)/G (for details, cf. Whitney [13], pp. 507-508). 

The tensor product can equivalently be defined in terms of characters, in the 
following fashion: 

Theorem 18.1. If A and B are (discrete ) abelian groups, 

(18.1) A o Char Horn {B, Char 4}. 

Proof. This conclusion can also be written in the form 

(18.2) Char (A o B) ^ Horn [B, Char A }. 

Since the group of characters is the group of homomorphisms into the group P 
of reals modulo 1 , this conclusion is a special case (with C = P) of the following 
Lemma 18.2. If A and B are discrete abelian groups, C any generalized ( topo¬ 
logical) abelian group, then there is a bicontinuous isomorphism 

(18.3) Horn { A oB,C)^ Horn {B, Horn {A,C)\. 

Proof. Let 0«Hom [A o B, C) be given. For each beB, let 4> b {a) = 
6{ab) . Then <f> b t Horn {A, C). Let 109(6) = 4 >b ■ Then <09 e Horn {B, Horn {A, C )}, 
and the correspondence 0 —» u e is a homomorphism of Horn (.4 o B, C\ into 
Horn {B, Horn (A, C) }. One verifies readily that it is an (algebraic) iso¬ 
morphism (u>t = 0 only if 6 = 0). Furthermore, it is an isomorphism onto the 
whole group Horn [B, Horn (A, C)}. For let any u in the latter group be 
given, with «(6) = <f>' b eHom (4, C ) for each b t B. Then define 

0»(23 = 2J 06. (^t), Or, € A , bi € B • 

One verifies that 0 a is uniquely defined, under the identifications (a + a')b —* 
ab + a'b, a(b + b’) —*ab + 06 ' used in the definition of A ° B. Furthermore, 
0„ t Horn {A o B, C ), and 0„ —> u in the previously given correspondence. There¬ 
fore 0 —» C 09 , u —* 6 U does yield the indicated isomorphism (18.3). The con¬ 
tinuity of the isomorphism in both directions is readily established from these 
explicit formulas and the appropriate definitions of open sets in the given 
topologies of the groups concerned. 



GROUP EXTENSIONS AND HOMOLOGY 


789 


Chapter IV. Direct and Inverse Systems 

The Cech homology groups for a space are defined as limits of certain “direct” 
and “inverse” systems of homology groups for finite coverings of the space 
(Chap. VI). In view of our representation of homology groups in terms of 
groups of homomorphisms and groups of group extensions we are led to consider 
limits of groups of this sort. We shall show that the limit of a group of homo¬ 
morphisms is itself a group of homomorphisms (§21) and that the corresponding 
proposition holds in certain special cases for groups of group extensions (§22). 
In the general case, however, we must introduce a new group to represent the 
limit of a group of group extensions. This group can also be introduced as a 
limit of tensor products (§25). 

19. Direct systems of groups 

A directed set J is a partially ordered set of elements a, 0 y 7 , • • • such that 
for any two elements a and 0 there always exists an element 7 with a < 7 , 
0 < 7 . For each index a in a directed set J let H* be a (discrete) group, and 
for each pair a < 0, let <t>p a be a homomorphism of H a into Hp . If <j> ya = <t> y p<t>&a 
whenever a < 0 < 7 , the groups H a are said to form a direct system with the 
projections <t>p a . 20 

Any direct system determines a unique (discrete) limit group H = Lim H a 
as follows. Every element h a of one of the groups H a is regarded as an element 
ft* of the limit H , and two elements ft* , ft£ are equal if and only if there is an 
index 7, a < 7, 0 < 7, with <f> ya h a = <t> y php . Two elements ft£ and ft£ in H are 
added by finding some 7 with a < 7, 0 <7; the sum is then the element ft* = 
( 4 >yah a + <t>yphp)*. Under this addition and equality, the elements ft* form a 
group H = Lim H a . Each of the given groups H a has a homomorphism 
<£«(A a ) = ht into the limit group, and = <t> a , for a < 0 . 

In case each given projection is an isomorphism (of H a into H$) y the limit 
group can be regarded as a “union” of the given groups: each group H a has an 
isomorphic replica within H , and H is simply the union of these subgroups. 

A subset J' of the set J of indices a is said to be cofinal in J if for each index 
a there is in J f an a' with a < a'. The limit Lim HJ , taken over any such 
cofinal subset, is isomorphic to the original limit H. 

20. Inverse systems of groups 

For each index a in a directed set let A a be a (generalized topological) group, 
and for each a < 0 let ^ be a (continuous) homomorphism of Ap in A a . If 
'I'arffiy = ^ay whenever a < 0 < 7 , the groups A a are said to form an inverse 
system relative to the projections . Each inverse system determines a limit 
group A = Lim A a . An element of this limit group is a set [a a } of elements 
a* € A a which “match” in the sense that = a a for each a < 0. The sum 

10 Direct (and inverse) systems were discussed in Steenrod [9], Lefschetz [7], Chap. I 
and II, and in Weil [12], Ch. I. 



790 


SAMUEL EILENBERG AND SAUNDERS MacLANE 


of two such sets is {o a } + {i>«} = {a«* + b a ]; since the ^'s are homomorphisms, 
this sum is again an element of the group. This limit group A is a subgroup 
of the direct product of the groups A a . The topology of the direct product 
XI A a thus induces (§ 1 ) a topology in Lim A a ; an open set in the latter group 
is the intersection with Lim A „ of an open set of H A „. This makes Lim A a 
a generalized topological group. As before, a cofinal subset of the indices gives 
an isomorphic limit group. 

Let each B a be a subgroup of the corresponding group A „ of an inverse system, 
and assume, for a < 0, that i/apBp C B a . Then the system B a is an inverse 
system under the same projections \p a s , and the limit Lim B a is, in natural 
fashion, a subgroup of Lim A a . On the other hand, p a $ induces a homomorphism 
fae of the (generalized topological) group Ap/B s into A a /B a . Relative to these 
projections, the factor groups themselves form an inverse system AJB a . The 
limit group of the latter system contains a homomorphic image of Lim A a ; 
if each a„ in A„ determines a coset a a in A a /B a , the map {a a j —> (a„j is a 
homomorphism of Lim A a into Lim (A a /B a ) in which exactly the elements of 
Lim B a are mapped on zero. Thus we have 

(20.1) Lim A g / Lim B a C Lim (A a /B a ). 

For compact topological subgroups this is an isomorphism: 

Lemma 20.1. If the A „ form an inverse system relative to the , and if each 
B a is a compact topological subgroup of A a urith ypapBp C B a , then 

( 20 . 2 ) Lim A B /Lim B a = Lim ( A a /B a ). 

Proof. Consider any c = {c„} in Lim' (A a /B a ), where p'aeCg = c„ for each 
a < 0. Each c a « A a /B a is a coset of the compact topological subgroup B a , 
hence itself is a compact Hausdorff subspace of the space A „ . Furthermore 
ipaft is a continuous mapping of the set c# into c a , for each a < 0. Since \p ay = 
fasPsy , the sets c„ form an inverse system of compact non-empty Hausdorff 
spaces. Their limit space is therefore 21 non-vacuous. This means that there 
is a set of elements a a « c a with = a a for a < 0. The element {a„( in the 
group Lim A a is therefore an element which maps onto the given element 
{c a } in the homomorphism {o a j —> {a ' a } used to establish ( 20 . 1 ). The con¬ 
tinuity of ( 20 . 2 ), in both directions, follows readily. 

There is also an “isomorphism” theorem for inverse systems. 

Lemma 20.2. If the groups A a form an inverse system relative to the projections 
ypafi, while C a form an inverse system (with the same set of indices) relative to 
projections <t> a $ , and if <r a are ( bicontinuous ) isomorphisms of A a to C a , for every 
a, such that the “naturality” condition <r a pa$ = 4>«s<r» holds, then the groups Lim A a 
and Lim C a are bicontinuously isomorphic. 

11 See Lefachetz [7], Theorem 39.1 or Steenrod [91, p. 666. Observe, however, that the 
latter proof is incomplete, because of the gap in lines 10-11 on p. 666. 



GROUP EXTENSIONS AND HOMOLOGY 


791 


21. Inverse systems of homomorphisms 

Consider the group of all homomorphisms of H into G. As in Chap. II, §12, 
each projection 4>g a of a direct system of groups H a will induce a “dual” homo¬ 
morphism <j>tg of Horn {Hg , G\ into Horn [H a , G]. Furthermore <t>tg <t>»y = 
4> a y for all a < 0 < y, so that the groups Horn {H a ,G\ form an inverse system 
relative to these dual projections. 

Theorem 21.1. If the {discrete) groups H a form a direct system, then 

(21.1) Horn { Lim H a , G] = Lim Horn [H a , G}. 

Proof. Consider any element w = {6 a ) in Lim Horn {#«, G). To define 
a corresponding homomorphism 9 U on H = Lim H a , represent each element 
h eH as a projection h = <f> a h a of some element h a eH a , and set 

(21.2) 0„{h) = 6 a {<f>aha ) = e a {h a ), h = <j> a ha . 

The “matching” requirement that 0 a = <j>tgdg for a < 0 readily shows that 
6 u (h) has a unique value, independent of the representation h = <j> a h a chosen. 
Furthermore, «Horn {//, G}, and the correspondence u —* 9 U is an 
isomorphism. 

Conversely, let any 6 e Horn {H,G) be given, and define 

(21.3) 0a(h a ) = d(<t> a h a ), ha f H a ■ 

If a < jS, <j>agQg{h a ) = Og[<t>p a ha) — 0[<t>g<t>gaha] — 9{<t>ah a ) — 9 a h a so <t> a B0g — 9a , 
and these 0’s match. Therefore u = j j is an element of the inverse limit 
group Lim Horn {H a ,G], and clearly 9 a is the original homomorphism 9. The 
correspondence « —» 0 U therefore does establish the desired isomorphism ( 21 . 1 ). 
The continuity in both directions follows directly from the formulas ( 21 . 2 ) 
and (21.3) and the appropriate definition of neighborhoods of zero in the groups 
concerned. 


22. Inverse systems of group extensions 

Consider a direct system of discrete groups H a • As in Chap. II, §12, each 
projection of H a into H g will induce a homomorphism <t>tg of Ext [G, H e ) 
into Ext {G, H a ] . Furthermore — 4>ty for all a < fi < y, so that the 

groups Ext {G, H a \ form an inverse system. Contrary to the situation in the 
previous section, the limit group Lim Ext {G, H a \ may not be isomorphic to 
Ext {G, Lim H a \- An example to this effect will be given below. However, 
there are two important cases when “Lim” and “Ext” are interchangeable. 

Theorem 22.1. If G is compact and topological, while the {discrete) groups 
H a form a direct system, then 

(22.1) Ext {G, Lim H a \ — H.m Ext (G, #«}. 

T his is proved by repeated applications of Lemma 20.1 to the representation 



792 


SAMUEL EILENBERG AND SAUNDERS MACLANE 


(22.2) Ext {G, H) - Fact {G, H)/ Trans {G, H], 

where H — Lim H a . Recall that any / e Trans {G, H) has the form 

f(h, k ) = g(h) + g{k) — g(h + k), h,ktH. 

Here g *G H is any mapping of H into G. Clearly / = 0 if and only 
if g (Horn [H, G}, so 

(22.3) Trans {G, H} at G a / Horn {H, G}. 

The correspondence g —* f is clearly continuous; since the isomorphism (22.3) 
is one-one and since the groups G H and Horn \H, G) are compact, by Lemma 
3.1, the bicontinuity of (22.3) follows. Furthermore, this isomorphism is a 
“natural” one relative to homomorphisms, so that the isomorphism theorem 
for inverse systems (Lemma 20.2) gives 

Lim Trans {Q, H a } at Lim [G"“/Hom {H a , G}]. 

In this representation the groups G* a and Horn {H„ , G} with the “dual” 
projections <t>*» form inverse systems with the respective limits G H and 
Horn {H, G). Furthermore each group Horn {H a , Gj is compact and topo¬ 
logical, so Lemma 20.1 gives 

(22.4) Lim Trans {G, H a \ = L™ G" a / Lim Horn \H a , G} 

= G7Hom \H, G} at Trans {G, //}. 

On the other hand one may show exactly as in the proof of Theorem 21.1 
on homomorphisms that there is a bicontinuous isomorphism 

(22.5) Lm? Fact H a \ & Fact {G, H}. 

Furthermore, each of the groups Trans {G, H a \ is compact and topological, so 
that Lemma 20.1 applies again to prove 

Lim [Fact/Trans] = Lim Fact/ Lim Trans. 

This, with (22.4) and (22.5), gives the desired conclusion. 14 

Theorem 22.2. If G is discrete and has no elements of finite order, while T a 
is a direct systems of discrete groups with only elements of finite order, then 

(22.6) Ext {G, Lim T a } S Li“ Ext {G, T tt \. 

The proof appeals directly to the result found in Theorem 17.2 of Chapter III, 
to the effect that 

(22.7) Ext {G, T a ] at Horn [T a , GJG). 

The groups Horn {T a , GJG] will form an inverse system under the dual 
projections <&$ as in Theorem 21.1 we then have 

Horn {Lim T a , GJG} £* Lim Horn \T a , GJG). 

** Theorem 22.1 can also be proved by representing Ext by means of Char Horn {(?, H } 
as in Theorem 15 . 1 . This argument, however, requires a tedious proof that the isomorphism 
established in the latter theorem is “natural,” in the sense of $12. 



GROUP EXTENSIONS AND HOMOLOGY 


793 


But the group on the left is simply Ext {G, Lim T a ], by another application 
of Theorem 17.2. The desired result should then follow by taking (inverse) 
limits on both sides in (22.7). 

To carry out this argument, it is necessary to have the naturality condition 
which gives the isomorphism theorem (Lemma 20.2) for inverse systems. This 
naturality condition requires that the isomorphism (22.7) permute with the 
projections of the inverse systems. This is just a statement of the fact that the 
isomorphism (22.7) established in Theorem 17.2 is “natural” in the sense en¬ 
visaged in §12. The proof of this naturality is straightforward, so details will 
be omitted. 

Corollary 22.3. If the discrete group G has only a finite number of generators , 
while T a is a direct system of discrete groups with only elements of finite order , then 

Ext {G f Lim T a \ ^ Lim Ext {<?, T a }. 

* 

Proof. Write G as F X L where F is free, L is finite (and thus compact). 
By (11.2) there is a “natural” isomorphism 

Ext {Gy Lim T a } S Ext {F, Lim T a } X Ext {L, Lim T a }. 

The asserted result now follows by applying Theorem 22.2 to the first factor 
on the right, and Theorem 22.1 to the second, using Lemma 20.2. 

We now show by an example that “Ext” and “Lim” do not necessarily com¬ 
mute. Let p be a fixed prime number, H the additive group of all rationals 
with denominator a power of p, and H n the subgroup consisting of all multiples 
of l/p n . Then Lim H n = H y since H is the union of the groups H n . Further¬ 
more H n is a free group, so Ext {/, H n ] =0, where I is the group of integers. 
On the other hand, Ext {/, Lim H n \ = Ext {/, H) is a group computed in ap¬ 
pendix B; it is decidedly not zero, in fact it is not even denumerable. 

23. Contracted extensions 

Before further consideration of the inverse limits of groups of extensions, 
we make a comparison of the group of extensions of a group G by a group H 
with the group of extensions by a subgroup H 0 of H . The identity mapping I 
of Ho into H is a homomorphism, hence, as in §12, will give dual homorphisms 

(23.1) /*: Fact {G, H] -> Fact {G, H 0 } f 

(23.2) /*: Trans {G, H] —> Trans {G, Ho}. 

Specifically, /* is the operation of “cutting off” a factor set f e Fact {G, H\ to 
give a factor set fo = /*/ € Fact {G, i/o} ; /o(ft, k) is defined only for h y k e Ho , 
and always equals/(ft, k ). Clearly I* carries transformation sets into trans¬ 
formation sets, as in (23.2). Thus I* also induces a dual homomorphism 

(23.3) I*: Ext {G, #} —> Ext {G, ffo}. 

This homomorphism may be visualized as follows: given E such that G C E 



794 


SAMUEL EELENBERG AND SAUNDERS MAcLANB 


and E/0 — H, there is an Eo C E such that G C Eo and Eo/G = H 0 . Then 
I*(E) = Eo . 

Lemma 23.1. If Ho is a subgroup of the group H then for any group 0 the 
homomorphism I* of (23.3) maps the group Ext {(?, H\ onto Ext {<?, Ho ). 

Proof. 28 Represent H as F/R, where F is free. There is then a subgroup 
Fo of F such that R C Fo and Fo/R — Ho. By the fundamental theorem we 
have isomorphisms 

Ext {G, H\ Horn {R, G}/Hom [F \ R, G), 

Ext \G, H 0 ) ^ Horn {#, G}/Hom {F„1 R, G). 

* * 

where Horn {F | R, G) C Horn {F 0 \ R, G}. According to the “naturality” 
theorem of §12 the homomorphism I* between the groups on the left can be 
represented on the right as that correspondence which carries each coset of 
Horn [F | R, G} into the coset of Horn {Fo | R } G} in whiclf it is contained. This 
makes it obvious that the homomorphism is a mapping “onto.” 

Lemma 23.2. If Ho C #, then the dual homomorphisms I* of factor and trans¬ 
formation sets , as in (23.1) and (23.2), are mappings “onto” 

Proof. Any element in Trans {G, H 0 } has the form 

f(K k) = g(h) + g{k) - g(h + k), 

where g is an arbitrary function on H 0 to G. Let g* be an arbitrary extension 
of g to H , and 

f*(h, k) - g*{h) + g*{k) - g*(h + k). 

Then f* is a transformation set with l*f* = /. This proves that (23.2) is a 
mapping onto. Since (23.3) and (23.2) are mappings onto, the same holds 
for (23.1). 


24. The group Ext* 

Since limits do not always permute with groups of extensions, we now intro¬ 
duce a new group which is the limit of an inverse system of groups of group 
extensions. 

Consider a discrete group T with only elements of finite order. The set 
{£«} of all finite subgroups of T is a direct system, if a < 0 means that S a C Sp , 
and that the projection Ip a of S a into S& is simply the identity. The direct 
limit of {S a \ is the group 7\ 

Let G be any generalized topological group. Since {£«} is a direct system, 
it follows from a previous section that the groups Ext {G, £«} form an inverse 
system with projections I * a &. We define our new group as the limit of this 
system 


83 The lemma can also be proved directly in terms of the group extensions E , Eo , using 
a suitable transfinite induction. 



GROUP EXTENSIONS AND HOMOLOGY 


795 


(24.1) Ext* {<?, T) = Lim Ext {G, S a \. 

The two theorems of §22 as to cases in which “Ext” and “Lim” commute 
give at once 

Corollary 24.1. If G is compact and topological , or is discrete without ele¬ 
ments of finite order , then 

Ext* {G, T\ s Ext {G, T). 

In the definition of Ext* we used the approximation of T by its finite sub¬ 
groups S a . However, any approximation by finite groups will give the same 
result: 

Theorem 24.2. If T a is any direct system of finite groups , the corresponding 
inverse system of groups Ext {(7, T a \ has a limit 

(24.2) Lim Ext {G, T a ) S Ext* {(?, Lim T a \. 

Proof. In case T a is the system of all finite subgroups of the limit T = 
Lim T a , this equation is simply the definition of Ext*. In general, T = Lim T a 
is a group in which every element has finite order. Each T a has a homomorphic 
projection T' a = <t> a T a into the limit T 7 , and T is simply the union of these 
subgroups Ta . The set of these subgroups T' a is therefore cofinal in the set of 
all finite subgroups of T . The inverse system of the groups Ext {(?, 7 1 !}, rela¬ 
tive to the “identity” projections 1% , is cofinal in the inverse system used to 
define Ext*, hence gives the same limit group, 

(24.3) Ext* {G, T\ at Lim Ext {G, T’ a \. 

An element f* in this limit group can be represented (but not uniquely) as 
a set \f a \ of factor sets f„ e Fact {(?, T' a \ which “match” modulo transformation 
sets. This means that for each /3 > a there is a transformation set 
t„p t Trans {G, T„ } such that 

fa(h\ k') = Mh', k') + t a p{h\ *0, h', k' t T' a . 

Now each homomorphism <J>„ of T a into T' a determines, as in §12, a dual homo¬ 
morphism <t>t of Fact {G,Ta} into Fact {G, T a \, defined so that e a = 4>tfa is the 
factor set given by the equations 

(24.4) e a (h, k) = f a (<t>ah, <t> a k), li, k e T a . 

If the f a match, one readily proves that the corresponding e a also match, modulo 
transformation sets. If the representation of f* by {/„) is changed by adding 
to each f a a transformation set, the e a ’s are changed accordingly by trans¬ 
formation sets. Therefore the correspondence 

(24.5) r = {/«} -» e* = {<t>* a fa\ = 

carries each element/* in Lim Ext {G, T „} into a well defined element e* in 
Lim Ext \G, T „}. One verifies at once that this correspondence is a homo¬ 
morphism. 



796 


SAMUEL EILENBERG AND SAUNDERS MAcLANE 


Now we use the assumption that each T a is finite. If <t> a h a — 0 for some 
h a e T a , the definition of equality in a direct system shows that <t>f tt h a = 0 for 
some 0 > a. Since the whole group T a is finite, we can select a single /S = 

/3«(a) > a which will do this for all h a , so that 

<t> a h a = 0 implies <t>» a h a = 0, /3 = 0o(a). 

Since </>s</>sa = 4> a , <t>s is now an isomorphism of <t>p a T a onto T' a . Let 4>e l denote 

the inverse correspondence. 

Next we show that u, as defined by (24.5), is an isomorphism. For suppose 
uf* = 0; every is then a transformation set t a . Using (24.4) and /? = 
j8 0 (a), we then have, for any h', k' tT' a , 

fa(h',k') = fe(h', k f ) = eMJ x h',<fi l k') = <t>J l k’). 

This shows that f a is a transformation set, hence that /* = {/„) = 0 
in Ext* {G, T). 

To construct a correspondence inverse to w, let e* = {e a \ be a given element 
jn Lim Ext {G, T a ], where each e a c Fact {G,T a }. Define 

(24.6) f a (h', k') - eMA', ^V), fi = p 0 (a) 

for each h', k't T' a . Since the e„’s are known to match, we may verify that 
the replacement of 0 by any larger index y in this definition will only alter f a 
by a transformation set. To show that f a and / 7 match properly for a < y, 
one then chooses 0 > /3 0 (a), /3 > 0o(y) in (24.6) and uses the given matching of 
the e a ’s (modulo transformation sets). Finally, one verifies easily that the 
correspondence [e a \ —> {/„} of (24.6) is the inverse of the given correspondence 
« of (24.5). This establishes the isomorphism (24.2) required in the theorem. 
The continuity, in both directions, follows from the formulae (24.5) and (24.6). 

Theorem 24.3. If every element of T has finite order, the group Ext* {G, T\ 
contains an everywhere dense subgroup isomorphic to Ext (G, T\ /Ext/ {(?, T \. 

This will be established by constructing a “natural” homomorphism of 
Ext [G, T) into Ext* {G, T}. To this end, let E be any extension of G by T 
determined by a factor set /. As in §23, / may be “cut off” to give a factor set 
f a for any given finite subgroup S a C T. These factor sets match properly, so 
{/„} determines a definite element in the inverse limit group Ext* {(?, T]. 
Alteration of / by a transformation set alters each f a by the correspondingly 
“cut off” transformation set, hence does not alter the element {/„} = f* of 
Ext*. Therefore / —► {/<,} is a well defined homomorphism of Ext into Ext*. 
In case / lies in Ext/ [G, T], each /„ is a transformation set, by the very defini¬ 
tion of Ext/, so that {fa} = 0. Conversely, if each /„ is a transformation set, 
/ c Ext/. We thus have a (bicontinuous) isomorphism of Ext/Ext/ onto a sub¬ 
group of Ext*. 

To show this subgroup everywhere dense in Ext* it will suffice, whatever the 
topology in G, to show the following: Given an element/* = {f „) in Ext* {G, T } 
and a finite set Jo of indices, there exists a factor set / in Fact {G, T] such that 



GROUP EXTENSIONS AND HOMOLOGY 


797 


fa — fa is a transformation set for every index atJ 0 . To prove this, choose a 
finite subgroup S y which contains all the groups S a , for at Jo. By Lemma 
23.2, the given factor set f y can be obtained by “cutting off” a suitable factor 
set f, so that f y — f y is the transformation set 0. The matching condition for 
the f a then shows that each difference f a — f a is also a transformation set, for 
at Jo . This proves the property stated above, and with it, the theorem. 

In many cases the subgroup considered in Theorem 24.3 is the whole group 
Ext*. It follows from previous considerations that this is the case when G is 
compact or when G is discrete and has no elements of finite order. Another 
important case is that when T is countable: 

Theorem 24.4. If T is countable then 

(24.7) Ext* {G, T) ^ Ext {G, T}/Ext/ [G, T\. 

Proof. Since T is countable, the system of all finite subgroups of T used to 
define Ext* {G, T) may be replaced by a cofinal sequence of finite subgroups 
T n with Ti C Tt C • • • C T„ C • • • C T, with the identity projections /„ : 
T n -> T„ +1 . Therefore Ext* [G, T) = Lim Ext ]G, T n \. An element 
e* of this group can then be represented as a sequence {/„} of factor sets 
/„ eFact {G, T„] which match, in the sense that, for some g n , 

(24.8) fn+i(h, k) = / n (ft, k) + [fir„(ft) + ffn(ft) - g n (h + ft)] 

for all h, ft « T„. The transformation set shown in brackets may be extended 
to all of T by extending g n to a function g* on T, as in Lemma 23.2. We intro¬ 
duce a new function s n (h) = gt(h) + • • • + g*Uh), for all h t T, and a new 
family of factor sets 

fn(h, ft) = UK ft) - [s n (ft) + Sn(ft) - s n (h + ft)], 

for ft, ft « T n Since f n differs from /„ by a transformation set, the given ele¬ 
ment e* of Ext* has both representations {/„) and j f' n ). But (24.8) also shows 
that" f n + 1 , cut off at T„ , is exactly f n . Therefore these factor sets match 
exactly, and provide a composite factor set / of T in G. This factor set / is one 
which corresponds to the given element e* of Ext* in the “natural” homo¬ 
morphism of Ext into Ext* as constructed in Theorem 24.3, so this homo¬ 
morphism maps Ext on all of Ext*, as asserted in (24.7). 

25. Relation to tensor products 

The group Ext* introduced in this chapter is closely related to the tensor 
product. Since an early form ([5]) of our results was formulated in terms of 
tensor products, we shall briefly state the connection. Let G be any group, 
A a compact zero-dimensional group, {A„} the family of all open and closed 
subgroups of A. Then the groups A/A „ and Go (A /A a ) both form inverse 

14 This construction is an exact group theoretic analog of a similar matching process 
for chains, as devised by Steenrod ([9], p. 692). 



798 


SAMUEL EILENBERG AND SAUNDERS MACLANE 


systems. The modified tensor product G*A is defined as the limit of the groups 
Go(A/A m ). 

Now let the group T with all elements of finite order be represented in terms 
of a free group F as T = F/R. Each finite subgroup S a then has a repre¬ 
sentation Fa/Ry and the fundamental theorem of Chapter II asserts that 

(25.1) Ext {Gy S a \ s Horn {R, G }/Horn {F a | R , G}. 

The groups on both sides here form inverse systems, relative to the identity as 
projections. Furthermore, the isomorphism of (25.1) permutes with these pro¬ 
jections, so that the limits of the two direct systems in (25.1) are also isomorphic. 
In view of the definition of Ext*, this gives 

(25.2) Ext* {Gy T) a* Lim [Horn { R , G}/Hom { F a | Ry G}]. 

Now if / is the group of integers, any element a = ^ gfa in the tensor product 
GoHom {Ry I\ determines in natural fashion the homomorphism 6 e Horn {#, G} 
with d(r) = ^2 0i Mg* • By a somewhat lengthy argument, this correspond¬ 
ence a —» 6 can be used to “factor out” the G in (25.2) to give 

(25.3) Ext* {Gy T} ^ Lim <? 0 [Horn {R y I}/ Horn {F a | R } /}]. 

The group in brackets here is Ext {/, F a /R }, by the fundamental theorem on 
group extensions. According to Theorem 17.1 it can be expressed as Char S a . 
Therefore (25.3) is 26 

(25.4) Ext* {Gy T] ^ Lim (GoChar S a ). 

But the group Char S a can, by the theory of characters (Lemma 13.2, Theorem 
13.5), be rewritten as a factor group Char T /Annih S a , where the subgroups 
of the form Annih S a in Char T are exactly the open and closed subgroups in 
the zero-dimensional group Char T. Thus (25.4) may be restated in terms of 
the modified tensor product, as 

(25.5) Ext* {Gy T] 9* G.Char T. 

The use of the “modified” tensor product is therefore equivalent to the use of 
the group Ext*. 

Chapter V. Abstract Complexes 

Turning now to the topological applications, we will establish the fundamental 
theorem on the decomposition of the homology groups of an infinite complex in 
terms of the integral cohomology groups of the complex. This theorem will be 
obtained in several closely related forms (Theorems 32.1, 32.2 and 34.2) for 
three different types of homology groups. The largest (or “longest”) homology 
group is that consisting of infinite cycles, with coefficients in G, reduced modulo 

“ This argument requires an application of the isomorphism theorem for inverse sys¬ 
tems, and hence rests on the fact that the isomorphism of Theorem 17.1 is “natural” in 
the sense of §12. 



GROUP EXTENSIONS AND HOMOLOGY 


799 


the subgroup of actual boundaries. Since the latter subgroup is not in general 
closed, this homology group will be only a generalized topological group. This 
suggests the introduction of the shorter “weak” homology group, which con¬ 
sists of cycles modulo “weak” boundaries; i.e. those cycles which can be re¬ 
garded as boundaries in any finite portion of the complex. The fundamental 
theorem for this type of homology group uses the group Ext/ which has been 
already analyzed. Finally, the group of cycles modulo the closure of the group 
of boundaries gives (following Lefschetz) a homology group w r hich is always 
topological; for this we derive a corresponding form of the fundamental theorem. 
Furthermore, the standard duality between homology and cohomology groups 
enables us to deduce a corresponding theorem (Theorem 33.1) for the coho¬ 
mology groups with coefficients in an arbitrary discrete group G. 

The fundamental theorem expresses a homology group by means of a group 
of homomorphisms and a group of group extensions; the latter can also be repre¬ 
sented by groups of homomorphisms, as in the basic theorem of Chapter II. 
The requisite connection between cycles of the homology group and homo¬ 
morphisms is provided by the Kronecker index (§29). 

26. Complexes 

The complexes considered here will be abstract cell complexes 26 satisfying a 
star finiteness condition. More precisely, we consider a collection K of abstract 
elements a 9 called cells. With each cell there is associated an integer q called 
the dimension of a 9 . (There is no restriction requiring the dimension to be non¬ 
negative.) To any two cells <r 9+1 , <t 9 there corresponds an integer [<r? +1 :<ry], 
called the incidence number. K will be called a star finite complex provided the 
incidence numbers satisfy the following two conditions: 

(26.1) Given <r 9 , [<r? +1 :<7y] ^ 0 only for a finite number of indices i; 

(26.2) Given <rf 1 and < tV\ E [<r^ 1 :<r?][<r?:<rr 1 ] = 0. 

i 

Condition (26.1) is the star finiteness condition. It insures that the summa¬ 
tion in (26.2) is finite. 

If we consider the “incidence” matrices of integers 

A* = || M +1 :*!l || 

we can rewrite the two conditions as follows: 

(26.1') A 9 is column finite ; 

(26.2') A q A q ~ l = 0. 

Actually we could have defined a complex as a collection of matrices {A tf }, 
q as 0, ±1, ±2, • • • , such that (26.T) and (26.20 hold; we must assume then 
that the columns of A 9 have the same set of labels as do the rows of A q ~ l , in 


11 Essentially like those introduced by A. W. Tucker, for the case of finite complexes. 
Homology and cohomology are treated as in Whitney [14]. 



800 


SAMUEL EILENBERG AND SAUNDERS MACLANE 


order to form the product A 9 A 9-1 . A g-cell will be then either a column of A 9 
or the corresponding row of A® -1 . 

A subset L of the cells of K is called an open subcomplex if L contains with 
each g-cell all incident (g + l)-cells; that is, if <r? in L and [<r 9 + 1 :<r 9 ] 0 imply 

<r] +l c L. The incidence matrix A® of L is then the submatrix obtained from A® 
by deleting all rows and all columns belonging to cells not in L. Conditions 
(26.1) and (26.2) automatically hold in L, the latter because of the requirement 
that L be “open.” 

A subset L C K is a closed subcomplex if L contains with each g-cell all inci¬ 
dent (g — l)-cells; that is, if <r 9 1 L and [o 9 : a* -1 ] ^ 0 imply of -1 « L. The incidence 
matrix of L is obtained as before, and the conditions (26.1) and (26.2) again 
hold in L. Whenever L is a closed subcomplex, its complement K — L is an 
open one, and vice versa. 

A subset L of K will be called q-finite if L contains only a finite number of 
g-cells. Because K is star-finite, every (g — l)-cell is contained in a g-finite 
open subcomplex of K. 

27. Homology and cohomology groups 

Let G be an abelian group. A g-dimensional chain c® in K with coefficients 
in G is a function which associates to every g-cell <r® in K an element g< of G. 
We write c® as a formal infinite sum 

c® = ]£ff<<r®. 

t 

The sum of two chains gr.crj and 2 18 the chain '£2 (g% + h x )a q i , and the 

chains form a group denoted by C 9 (K, G). If g x 0 for only a finite number 
of indices i then the chain c 9 is finite . The finite chains form a subgroup 

e q (K, (?) of c 9 . 

The coboundary be 9 of a finite chain c q = X) gp) defined as 
5c® = Z 

* i 

Because of (26.1) Sc 9 is a finite (g + l)-chain, while, because of (26.2), 35c® = 0. 
The operation 6 is a homomorphic mapping of (? 4 into (? 9 +i. The kernel of this 
homomorphism is a subgroup %,{K, G) of € q . The chains of % q are called 
(finite) cocycles: 


% 9 (K, G) — [all finite g-chains c® with Sc q = 0]. 

A coboundary is a g-chain of the form SdV 1 for some d® -1 «(?*_i; these coboun¬ 
daries form a subgroup 

&q(K, G) = [all finite chains W® -1 ]. 

From the relation 66 = 0 it follows that £B 4 c % 9 . The corresponding factor 
group 



GROUP EXTENSIONS AND HOMOLOGY 


801 


TfC 9 (K, G) = % t {K, G)/%(K , G) 

is called the g th cohomology group of finite cocycles of K with coefficients in G. 
We also define the co-torsion group G q (K, G) as the subgroup of all elements of 
finite order in UC t (K, G). 

For a chain c q = 5"! g# * of C q (K, G) we also define the boundary 

9 * - E(ZW:»ngO»r. 

i i 

It again follows from (26.1) that dc q is a well defined chain of C 9 ~ l (K, G) and 
from (26.2) that ddc 9 = 0. The operation d is a homomorphic mapping of C 9 
into C®~’. The kernel of this homomorphism is a subgroup Z 9 {K, G) of C®. 
The chains of Z 9 are called cycles: 

Z*(if, G) = [all chains c 9 with dc 9 = 0]. 

The chains of the form dd 9+1 where d 9+l e C® +1 are the boundaries. They form 
a subgroup 

B 9 (K, G) = [all chains c 9 = dd® +1 ]. 

Because dd = 0 it follows that B 9 C Z 9 . The group 

H 9 (K, (?) = Z\K, G)/B\K, G) 

is called the 2 th homology group of K with coefficients in G. 

Let L be a (closed or open) subcomplex of K. Each chain c 9 in K, considered 
as a function on the g-cells, defines a corresponding chain c® in L. If c 9 = 
52 , cl = Oi ff i is the sum found by deleting all terms g# 9 for which 

o 9 is not in L. If L is open, then d L (c 9 L ) = {dc q ) L , so that one can establish the 
following facts. 

Lemma 27.1. c® « Z 9 (K, G) if and only if c 9 L e Z 9 (L, G) for every q-finite open 
subcomplex L of K. 

Lemma 27.2. If c® < B 9 (K, G) then c® « B 9 (L, G), provided L is an open sub¬ 
complex of K. 

A statement analogous to Lemma 27.1 concerning B 9 is not generally true. 
In this connection we define the group B 9 W (K, G) of the weak boundaries as 
follows: c* c Bl,(K, G) provided c® « B"(L, (?) for every g-finite open subcomplex 
L of K. For each such open subcomplex L we can construct a subcomplex V 
consisting of all g-cells of L, all those (g + l)-cells of L which lie on coboundaries 
of g-cells of L, and all (g + i)-cells of K, for i > 1. This subcomplex V is 
open, is both g and (g + Infinite, and has B 9 (L, G) — B 9 (L', G). Hence we 
conclude that c®« B%(K, G) if and only if c® e B 9 (L, G) for every open sub¬ 
complex LolK which is both g- and (g + Infinite. Clearly B 9 = Bt when K 
itself is g-finite. 

It follows from Lemmas 27.1 and 27.2 that 

£•(£, G) C Bl(K, G) C Z?(K, G). 



802 


SAMUEL EILENBERG AND SAUNDERS MacLANE 


The factor group 

H 9 V (K, G) = Z\K, G)/Bl(K, (?) 

will be called the weak q th homology group of K with coefficients in (?. Clearly 
H * — H% when K is g-finite 

Lemma 27.3. c® e B q w (K, (?) if and only if for each finite subset M of K there 
is a chain cl in K — M such that c® — c® e B q (K, (?). 

Proof. Suppose that c q tB%. Given the finite set M there is a g-finite open 
subcomplex L containing M. Since cl e B q (L, (?) there is a d q+1 in L such that 
(dd® +1 )x, = cl. Set cl = c® - dd q+ \ Clearly c* - c?«B® and (cl) L = 
cl - (dd q+1 ) L = 0, hence cl C K - L C K - M. 

Suppose now that c® satisfies the condition of Lemma 27.3. Given a g-finite 
open subcomplex L of K there is a c® in K — L such that c® — cl e B?{K, (?). 
There is then a d q+1 such that dd q+1 = c® — c®. Since L is open we have 

c hid? 1 ) = (dd q+1 ) L = cl - (cl) L = cl ; 
therefore c® t B q (L, (?). 

\ 

28. Topology in the homology groups 

The group of g-chains C q (K, (?) is isomorphic with JJ. (?,, where (? = (?, and 
the set of indices i is in a 1-1 correspondence with the set of g-cells <r® . Hence, 
if G is a generalized topological group, we can consider C q (K, (?) as a generalized 
topological group, under the direct product topology, as defined in §1. If (? is 
topological or compact, then C(K, (?) is also topological or compact, as the 
case may be. 

The boundary operator d, regarded as a homomorphism of C * into C® -1 , is 
continuous. Since Z® is the group mapped into 0 by d, we obtain 

Lemma 28.1. If G is topological then Z q (K, (?) is a closed subgroup of C q (K, G). 

From Lemma 27.3 we deduce 

Lemma 28.2. B q (K, (?) C B q w {K, (?) C 5®(£, <?). . 

The homology groups H q = Z q /B q and HI = Z q /B q w as factor groups of gen¬ 
eralized topological groups are generalized topological groups; this is the way 
they will be considered in the rest of this paper. Even in the case when G and 
consequently Z q is topological the groups H q and HI, may be only generalized 
topological groups, for B q and B q w need not be closed subgroups of Z q . 

If G is compact and topological, then Z q (K, (?) and C* +1 (/iC, (?) are compact; 
since B q (K, (?) is a continuous image of C® +1 (under the operation d), B q {K, G) 
is compact and therefore closed (see Lemma 1.1). Consequently we obtain 

Lemma 28.3. If G is compact and topological, then B q (K, (?) = B q w (K, G) = 
5®(/l, G), H q (K, G) = Hh(K, G) and the groups are all compact and topological. 

Despite the fact that <? t is a subgroup of the generalized topological group C® 
we consider (?„ discrete and consequently the cohomology groups H q (K, G) are 
taken discrete. 



GROUP EXTENSIONS AND HOMOLOGY 


803 


29. The Kronecker index 

Let G be a generalized topological group, H a discrete group and assume that 
a product <t>(g, h) e J is given pairing G and H to a group J (see §13). 

Given two chains 

c q e C q (K, G), d q t € q (K, H), 
we define the Kronecker index as 

c q -d q = 'E<t>{g<,h<)*J; 

i 

the summation is finite since d q is a finite chain. We verify at once that in 
this way the groups C(K, G) and G q (K, H ) are paired to J. 

Given c q+i e C* +1 (K, G) and d q « <S q (K, H) we have 

(29.1) (dc q+1 )-d q = c q+1 -(6d q ). 

This is a restatement of the associative law for matrix multiplication, since 
the operator d is essentially a postmultiplication by the incidence matrix, while 
the coboundary operator 5 is a premultiplication by the same matrix. 

We now examine the annihilators relative to the Kronecker index. 

(29.2) Z V (K, H) C Annih B q w (K, G) C Annih B q (K, G). 

(29.3) Z"(K, G) C Annih SB t {K, H). 

Proof. Let z" « B q w and w q e 2,. Since w q is finite there is a finite subset 
M of K such that w q C M. In view of Lemma 27.3 there is a cycle z q (ZK — M 
and a chain c q+1 e C 9+l (K, G) such that dc q+l — z q — z q . Consequently 

z q -w q = (z q - zV)-w q = (aO-w* = c q+1 • Sw q = c 4+1 -0 = 0. 

Therefore 2, C Annih B q w . The proof of (29.3) is analogous. 

It follows from (29.2) and (29.3) that 

(29.4) H q (K, G) and fK q (K, H) are paired to J, 

(29.5) H q w (K, G) and 'X q (K, H ) are paired to J. 

Lemma 29.1. If G and H are dually paired to J then, relative to the Kronecker 
index, 

(29.6) G*(K, G) and C q (K, H) are dually paired to J, 

(29.7) 2 9 {K, H ) = Annih B q w (K, G) = Annih B q (K, G), 

(29.8) Z?{K, G) = Annih %(K, H). 

Proof. Given c q = ^ t* 0 in C q , we have g< 0 ^ 0 for some to. Select 

h eH so that <t>{g< q , h ) ^ 0. Consider the chain cP = ft<r? 0 . Then c q -<f = 

4>{g iq , h) 9 * 0. This proves that Annih G q (K, H ) = 0. Similarly we prove 
that Annih C*(K, G) — 0. This establishes (29.6). 



804 


SAMUEL EILENBERG AND SAUNDERS MACLANE 


Let d* € Annih B*(.K , G). Hence c‘ +1 •(&**) = (dc^-d 9 = 0 for every c f+1 , 
and therefore = 0, in view of (29.6). This shows that Annih B 9 d % q , 
which, together with (29.2), gives (29.7). 

The proof of (29.8) is analogous to the previous one. 

We remark that even when the pairing of the coefficient groups G and H is 
dual, the pairing (29.4) or (29.5) of the homology and cohomology groups need 
not be dual, as observed by Whitney ([14], p. 42). 

We shall be especially interested in the pairing of G with the group 7 of in¬ 
tegers to G by means of the product m) = mg. This pairing has the prop¬ 
erty that Annih 7 = 0. This is half of the definition of a dual pairing; tfye other 
half (Annih (7 = 0) may fail in case the order of every element in G divides a 
fixed integer m. Nevertheless the argument for Lemma 29.1 shows in this case 
that 

(29.60 Annih C q (K, 7) = 0, 

(29.80 Z 9 (K, (?) = Annih %(K, I). 

We now introduce a subgroup of the group of cycles by the following definition: 

(29.9) A 9 (K, (?) = Annih Z q (K , 7); 

in other words, c 9 e A 9 (K, G) if and only if c q ■ w 9 = 0 for every finite integral 
cocycle w 9 . The position of this group A 9 may be described as follows: 

(29.10) Bl(K, (?) C A\K , (?) c Z\K , (?). 

By (29.2) we have % q d Annih B q w ; consequently B q w d Annih Z q = A 9 . 
Since f6 g d Z q , we have A 9 = Annih % q d Annih = Z 9 by (29.80. 

Lemma 29.2. If G is a topobgical group , A 9 (K, (?) is dosed . 

This follows immediately from the continuity of the Kronecker index. 

In case G is topological, the various subgroups of cycles of C 9 {K , (?) are there¬ 
fore related as follows: 

B q d Bl d B 9 d A 9 = A 9 d Z q = 2 q d C 9 . 


30. Construction of homomorphisms 

The essential device of this chapter is that of using the Kronecker index to 
generate homomorphisms. For a given chain c 9 e C 9 (K, (?) define 0 c q by 

(30.1) e c *(d 9 ) = c 9 -d 9 , d 9 € C q (K, I). 

Lemma 30.1. The correspondence c 9 —> 0 C « establishes an isomorphism 
C*(K,(?) ^ Horn {C q (K, 7), (?}. 

Proof. It is clear that 6 c q e Horn {(?*, (?}, and that the correspondence 
c 9 —> d d q preserves sums. Also, if B c q = 0 then c q -d q = 0 for all d 9 e C q and conse¬ 
quently c 9 = 0. Conversely, given 6 6 Horn {(? Q , (?}, define 

(30.2) c 4 =E<K<r?V?. 



GROUP EXTENSIONS AND HOMOLOGY 


805 


Clearly c q t C(K, G), while, for any given d q = 22 «<?„, we have 

Oc «(<f) = c q -d q = 'EhidW) = = e(d*). 

This establishes the algebraic part of the Lemma. 

We now recall that 

c q (K, (?)=n ft 

t 

where G t - = G and the subscripts i are in a 1-1 correspondence with the g-cells < 7 ?. 
On the other hand, since the {<j 9 \ constitute a set of generators for 6 q (K, /), 
we have 

Horn {<?,(£, I), G\ S II G t . 

i 

Both these isomorphisms are bicontinuous, hence the combined isomorphism, 
which is precisely the isomorphism c q <-> 0 c q , is also bicontinuous. 

Lemma 30.2. Z q (K, I) is a direct factor of € q (K, I). 

Proof. The coboundary operator 5 maps € q onto and the kernel is % q . 
Hence C q is a group extension of % q by . As a subgroup of the free group 
(? g +1 the group is free (Lemma 4.1) and therefore the group extension is 
trivial (Theorem 7.2). Hence C q is the direct product of % q and a subgroup 
isomorphic with . 

Theorem 30.3. A q {K , G) is a direct factor of Z q {K , G) and of C q (K y G ). 
Proof. Since A 9 CL Z q it will be sufficient to show that A q is a direct factor 
of C q . In the group Horn {C q (K, 7), G} consider the subgroup A of those homo- 
morphisms that annihilate Z q . Since % q is a direct factor of C q , A is a direct 
factor of Horn {(?,, G}. However, under the isomorphism 6 c q —> c 9 of Lemma 
30.1 the group A is mapped onto A q (K , G) = Annih % q , hence the conclusion. 
This proof also shows (Lemma 3.3) that 

(30.3) A q (K, G) a Horn {<?,(*, 7)/Z,(K, J), G}. 

Theorem 30.3 leads to the following direct product decompositions of the 
homology groups: 

(30.4) . H q (K, G) SS ( Z q /A q ) X (A q /B 9 ), 

(30.5) ~ Hl(K, G) s (Z q /A q ) X (A 9 /B q w ). 

We proceed with the study of the first factor, Z?/A 9 . 

Theorem 30.4. The correspondence c 9 —> d cq establishes an isomorphism 

Z 9 {K, G)/A 9 (K, G) S Horn {X q (K, 7), G}. 

Proof. Since Z 9 = Annih , by (29.8'), it follows that under the iso¬ 
morphism c q —> 6 0 q the group Z 9 is mapped onto the subgroup of Horn {C g , G} 
consisting of those homomorphisms annihilating . By Lemma 3.3 the latter 
subgroup can be identified with Horn {CV£B<z, £}> s0 % q — Horn {(?*/$«> G}. 
On the other hand, %J < & q is a direct factor of (? 9 /£B g , so that Lemma 3.4 shows 



806 


SAMUEL EILENBERG AND SAUNDERS MacLANE 


■that Horn {Z 4 /£B 4 , G\ is a factor group of Horn {(?,/$«> <?}, corresponding 
to the subgroup consisting of homomorphisms annihilating 2> 4 /£B 4 . This sub¬ 
group in turn corresponds to the subgroup A q of Z q , hence 

Z?/A q S Horn {Z 4 /£8 4 , G}. 

This is the desired conclusion. 


31. Study of A q 

The correspondence c q —> 9 c q of Lemma 30.1 maps the group A q of annihilators 
of cocycles onto the group of those homomorphisms of (? 4 into G which carry Z 4 
into zero. As observed in Lemma 3.3, the latter group is isomorphic to 
Horn j(? 4 /Z 4 , G}. Since (? 4 /2> 4 = £B 4 +i , this gives the isomorphism 

(31.1) A q (K, G) a Horn {ffi t+1 ( K, 1), G ]. 

An examination of this construction shows that the homomorphism corre¬ 
sponding to a given z q « A q is determined as follows. For each d® +1 e £B 4 -x x choose 
a d q e e t (K, I) for which Sd 9 = d q+ \ and define 27 

<MO = z q -d q . 

Because z q is in A 9 , this result is independent of the choice of d q for given rf ,+l . 
Furthermore <t>,i is a homomorphism of fB a+1 into G, and it is obtained from 
by the process indicated above, for one has 


tt>tv{6d q ) — 8 x q{d 9 ). 

We therefore have the following result. 

Lemma 31.1. The correspondence z q —* <£,« establishes the (bicontinuous) iso¬ 
morphism (31.1). 

The properties of this isomorphism can be collected in the following 
Theorem 31.2. The isomorphism z q —» <t>,i induces the isomorphisms 

A q (K, G)/B\K, G) s Horn {fB e+1 , G}/Hom {Z 4+1 1 6B 4+1 , G\, 

Bl(K , G)/B\K, G) & Horn/ {S 4+1 , G; Z 4+l }/Horn {Z 4+1 1 S 4+1 , G), 

A\K, G)/Bl{K, G) S Horn {S 9+1 , GJ/Hom, {<B 4+1 , G; S 4+ i}, 

where SB t +i = SB 4 + i(/C, I) and 2 4+1 = Z g+ i(K, I). 

Proof. We shall show that the groups B q (K, G) and B q u {K, G) are mapped 
onto Horn {Z 4 +x | $ 4+ i, G} and Horn/ {SB 4+ x, G; Z 4+1 }, respectively. 

Assume that z® < B®(i)C, G); then dz q+1 = z q for some z q+l e C q+1 (K, G). Define 

<t>*(d? +l ) = z q+1 • d« +1 ; d q+1 e e 4+i . 


17 Notice the analogy with the definition of the so-called “linking coefficient” (cf. 
Lefschetz [7], Ch. III). 



GROUP EXTENSIONS AND. HOMOLOGY 


807 


Clearly 0* t Horn \G t+ i, G}. If d 4+1 = Sd q then 

0*(d 4+1 ) = z q+1 -(P +1 = z ,+1 • 5d q = dz 4+1 -d 4 = z 9 -d Q = 0„(d 4+1 )- 

Hence 0* is an extension of 0 Z? to Cj+i and in particular also to Z 9+ i. 

Suppose conversely that 0,» can be extended to Z t+ i . Since 2»+i is a direct 
factor of (? 5+l (Lemma 30.2) we may then find an extension 0* of 0*« to (?, + i. 
Define 

2 ,+1 = E 0*(<r! +1 Vf\ 

i 

Clearly z* +1 € C" +1 (7C, (?) and 2 * +1 -<r? +1 = </>*(<r q+1 ) and hence = <W +1 ) 

for all d* +1 € (?*+i. Consequently 

d/ +1 -<rj = 2 g+l -5<r? = ^*(50 = <MM) = z q -<r? . 

Since this holds for every a 9 we have dz * +1 = z* e £*(/£, (?). 

Suppose z 9 eB 9 w (K, G). In view of Lemma 5.1 it is sufficient to prove that 
if the cocycle md 9+l et& q +i(K) then <t> xq (md * +1 ) is divisible by m. Let 8d q = 
md * +1 and let M be a finite subset of K such that d 9 d M. In view of Lemma 
27.3 there is a chain z\ d K — M such that z q — z\ = z\ e B q (K , G). It follows 
that z\ € A 9 and so that z\ e A 9 , hence <fr z \ and <f>zl are defined and </> 2 « = <t> z \ + <£ 2 .j . 
Since z\ eB q (K , (7), then, as we just proved, <£ 2 ; can be extended to and 
therefore <l> x «(md q+1 ) must be divisible by m. Since d 9 d M and z\ d K — M 
we have md q+1 ) = z q -d q = 0. Hence 0 z «(md 9+l ) is divisible by m. 

Suppose conversely that <f> z q can be extended to every subgroup of % g+l (/£, 7) 
of finite order over $> q +i(K, I). Then, as in Lemma 5.2, <f> z q can also be ex¬ 
tended to every subgroup £i) of Z 9 +i(/£, 7) such that has a finite number 

of generators. Now let L be any open subcomplex of K which is both q and 
(q + Infinite; there is then an extension of <t> x q to the group generated by 
£Bg+i (K y 1 ) and % q +i (L, 7). But in the complex L the homomorphism <t> yq in¬ 
duced by y 9 = z q agrees on fB q +i(L, I ) with the homomorphism . Therefore 
4> v q fHom 7), (?} has an extension to Z q+ i (L, 7). In view of what 

we proved before, we therefore have y 9 = z q L eB q (L , G). Since this holds for 
each L considered, z 9 cBl(K, (7). This concludes the proof of Theorem 31.2. 

In this theorem the factor homomorphism groups on the right can be reinter¬ 
preted as groups of group extensions, in accord with the results of Chapter II. 

Theorem 31.3. The isomorphism z 9 «-> <t> z q combined with the isomorphisms 
establishing relations between group extensions and homomorphisms lead to the 
following isomorphisms: 

(31.2) A\K } G)/B q (K 9 (?) ^ Ext {(?, 3C fl +i}, 

(31.3) B 9 W {K, G)/B 9 (K J G) ^ Ext/ {(?, 3Q +1 }, 

(31.4) A\K } G)/Bl(K, (?) ^ Ext {(?, T ff +i}/Ext/ {(?, c J q+1 ) 

where 3Q+i = 3Q+i(7C, 7) and T ? +i = T fl +i(7T, 7) is the corresponding co-torsion 
group . 

The isomorphisms established so far have all been bicontinuous. 



808 


SAMUEL EILENBERG AND SAUNDERS MacLANE 


32. Computation of the homology groups 

As we have shown in §29, the Kronecker index establishes a pairing of the 
group IFiK, G) or H„(K> G) with the group TfCqiK , 7), the values of the products 
being in the group G. Accordingly we define the following subhomology groups: 

(32.1) Q\K, (?) = Annih TfC(K f 7) in H 9 (K , (?), 

(32.2) Ql(K, (?) = Annih 3C q (K, 7) in H q w {K , G). 

We verify at once that Q q = A q /B q and Ql = A q /B q w . Consequently the 
results of the last two sections furnish the following two basic theorems: 

Theorem 32.1. For a star finite complex K the homology group H q (k, G) of 
infinite cycles with coefficients in a generalized topological group G can be expressed 
in terms of the integral cohomology groups 3Q = X g (7C, I) and X g + 1 = Hi' q +i{Ky 7) 
of finite cocycles. The explicit relation is 

(32.3) H q (K, G) ^ Horn {X g , (?} X Ext {G, X g+ i}. 

More explicitly , H 9 has a subgroup Q q , defined by (32.1), where 

(32.4) Q q (K, G) is a direct factor of H 9 (K 7 G), 

(32.5) Q\K , G) ^ Ext {G, X g+1 }, 

(32.6) H 9 (K, G)/Q 9 (K 7 G) si Horn {rX,, G}. 

Theorem 32.2. For a star finite complex K the weak homology group H 9 W (K 7 G) 
of infinite cycles with coefficients in a generalized topological group G can be ex¬ 
pressed in terms of the integral cohomology group X Q = X q (iv, 7) and the integral 
co-torsion group = tf q +i(K, I) of. finite cocycles. The explicit relation is 

(32.7) - HUK, G) ^ Horn {X g , G}‘ X (Ext {G, T g+ i}/Ext/ {G, c J q+l }). 

More explicitly , H q w has a subgroup Q 9 W , defined by (32.2), where 

(32.8) Ql>(K 7 G) is a direct factor of H q w (K 7 G), 

(32.9) Q q w (K , G) ^ Ext {G, ‘S q+ i}/Ext f {G, 

(32.10) 7?2,(7C, G)/Q 9 W (K, G) ^ Horn {X fl , G}. 

Both factors in (32.3) and (32.7) are generalized topological groups and the 
isomorphisms are bicontinuous. 

If G is topological then by Corollary 3.2 the group Horn {X tf , G} is topological. 
If we also assume that mG is a closed subgroup of G for m = 2, 3, • • • then 
Corollary 11.6 shows that Ext/ {G, T g +i} is a closed subgroup of Ext {G, ^ + i}. 
Consequently we obtain 

Theorem 32.3. (Steenrod [9]). If G is a topological group and mG is a closed 
subgroup of G for m = 2, 3, • • • then H„(K, G) is topological. 

The expressions for Q q and Ql, can be simplified if additional information 
concerning the group G is available. If G is infinitely divisible then, by Corollary 
11.4, Ext {G y H) = 0 for all H and therefore 



GROUP EXTENSIONS AND HOMOLOGY 


809 


Corollary 32.4. If G is infinitely divisible then Q\K, (?) = Ql{K, (?) = 0 and 
H\K, G) = Hi{K, (?) s* Horn {X„ , (?}. 

From Theorem 17.2 we deduce 

Corollary 32.5. If G has no elements of finite order then 
QUK, (?) as Ext {(?, rr 4+1 }. 

If, in addition, G is discrete then 

QKK, G) as Horn {3 T 9+ x , GJG). 

In particular, if G = I then, by Theorem 17.1, Q'L(K, I) = Char ff g+1 and 
therefore 

(32.11) Hl(K, I) s Horn {TK q , I\ X Char ST,+i. 

Theorem 32.6. 7/ (? is compact and topological then H q {K, (?) = H q w (K, (?) 
is compact and topological and 

(32.12) Q q (K, (?) = Q*(K, (?) as Ext }(?, ff t+1 } s Char Horn {(?, T, +1 J. 

This is a consequence of Corollary 11.7 and Theorem 15.1. Since G is com¬ 
pact, II 9+ i discrete, and only continuous homomorphisms are taken in Horn 
{(?, ffq+i}, it follows that in the formula (32.12) for Q"(K, (?) we may replace G 
by G/Go where G 0 is the component of 0 in G. 

Corollary 32.7. If 1K q +\(K, I) has a finite number of generators then 
B\K, (?) = Bl{K, (?) and 

(32.13) H\K, G) = Hl{K, G) ^ Horn (3C, , (?j X Ext {(?, ff 5+1 }. 

In fact, since Ext/ {(?, S^g+i} = 0 (Corollary 11.3) it follows from (31.3) that 
B" = Bl, . Since also Ext/ {(?, fig+i} = 0, formula (32.13) follows from Theo¬ 
rem 32.2. 

In particular, Corollary 32.7 applies if K is a finite complex (cf. Alexandroff- 
Hopf [1], Ch. Y and Steenrod [9], p. 675). 

33. Computation of the cohomology groups 

We start out with a brief review of the duality between homology and coho¬ 
mology. Let G be a discrete group and Q = Char G compact and topological. 
Since 0 and G are dually paired to the group P of reals mod 1 (see §13) the 
Kronecker index c’-d 4 « P is defined as in §29 for c q t C q {K, Q) and d q e (S q (K, G). 
Since the pairing of & and G is dual (Theorem 13.5) we have by Lemma 29.1 

(33.1) CiK, 0 ) and € a (K, (?) are dually paired to P, 

(33.2) % q (K, (?) = Annih B\K, 6 ); Z q (K, 6) = Annih %(K, (?). 

These formulas, Theorem 13.7, Lemma 13.2 and Theorem 13.5 imply that the 
Kronecker index defines a dual pairing of 3C,(K, (?) and H q {K, Q) to P and that 

(33.3) X 9 (K, G) ^ Char H Q (K, Char G). 



810 


SAMUEL EILENBEBG AND SAUNDEBS MacLANE 


Using this result and the formulas established in the previous section for 
H*{K, Char (?) we could write down a formula expressing 3C t (K, G). For con¬ 
venience we first define a subcohomology group 

(33.4) 9 q (K, G ) = Annih Q\K, Char G) in 3Q(K, G), 

in order to get a more detailed form for our result. 

Theobem 33.1. For a star finite complex K the cohomology group TfC t (K, G ) 
of finite cocycles voith coefficients in a discrete group G can be expressed in terms of 
the cohomology group 3Q = I) and the integral co-torsion group 3',+j = 

W q+ i(K, I). The explicit relation is 

(33.5) ? X q (K, G) S* ((?°3Q X Horn {Char G, 5T 9+1 }. 

More explicitly, TK' q (K, G) has a subgroup 9 q (K, (?), defined by (33.4), where 

(33.6) 9 q {K, G) is a direct factor of 'DC q {K, G), 

(33.7) 9 q (K, (?) S GoTfC q 

(33.8) : X q (K, G)/9 q {K, (?) ^ Horn {Char G, 9 q+1 }. 

Proof. Since Q q is a direct factor of H Q it follows from the character theory 
that 9 q = Annih Q 4 is a direct factor of TK' q (K, G) = Char IT 1 . It also follows 
that 

9 q ^ Char (tf 4 /Q 4 ), J TC q (K, G)/9 q (K, G) ^ Char Q\ 

The first formula and (32.6) imply 

9 q (K, G) S Char Horn {:X S , Char (?}, 

which in view of Theorem 18.1 gives (33.7). The second formula combined 
with (32.12) proves (33.8). 

If G has no elements of finite order, then Char G is connected and therefore 
Horn {Char G, T s+ i) = 0. From (33.7) and (33.8) we therefore obtain 
Corollary 33.2. If G has no elements of finite order then 

3 QK, G) - 9 q (K, (?) a GoDC q (K, I). 

We now proceed to give an intrinsic characterization of the subgroup 9 q of 
3Q(if, 6). A cocycle w q t % q {K, (?) will be called pure if it is a linear combina¬ 
tion of integral cocycles, as 

k 

w q = ^2 QiW q i, g { e G, wit % q (K, I). 

Lemma 33.3. The group 9 q (K, (?) is the subgroup of VC q (K, G) determined by 
the pure cocycles. 

Proof. Let S be the subgroup of % q (K, G) consisting of all the pure cocycles. 
It may be shown that 9> q (K, (?) C S. In order to prove that S/9b q (K, G) = 
9 q (K, (?) we must prove that §>/9 q (K, (?) =» Annih Q q (K, Cf) where Q = Char (?. 



GROUP EXTENSIONS AND HOMOLOGT 


811 


This is equivalent to proving that Q"(K, 0 ) = Annih G)), which 

reduces to the formula 

A\K, 6) = Annih S, 
that we now propose to establish. 

Let z 9 € A 9 (K, 0 ) and let w q c S. Since w q = giW 9 , where w q e % q {K , I) 
and since z 9 -w 9 = 0 by the definition of A 9 , it follows that z q -w q = 0. 

Suppose now that c 9 lies in C q (K 1 &) but not in A q (K, &). There is then a 
w q e % q (K , I) such that c q -w q = g j* 0 where g e Q. Pick g e G so that g(g) j* 0 
and define w q = gw 9 . Clearly c is a pure cocycle and c q -w q = g(g) 0, 
hence c q is not in Annih S. This concludes the proof of the Lemma. 

Using the description of @ q (K, G) given in the Lemma we could easily establish 
the isomorphism iP q ^ GoTJC q (K f I) directly, using the definition of the tensor 
product. This was the procedure adopted by Cech [3] who essentially has 
proved all the results of this section. Our main improvement is that our iso¬ 
morphisms are given explicitly and invariantly, while Cech used generators and 
relations throughout. 


34. The groups H q 

The fact that the groups H 9 and HI, may not be topological groups even 
though the coefficient group G is chosen to be topological induced Lefschetz and 
others to introduce the following group, for a topological coefficient group (?, 

H q t (K , G) = (?)/#*(*:, G) 

as a standard homology group for K . 

The relation of this group to the groups previously considered is immediate: 

(34.1) H q t a H 9 /0 s Hl/Q. 

Theorem 32.3 can now be reformulated as follows. 

Theorem 34.1 (Steenrod [9]) If G is topological and mG is closed for m = 
2,3, • • • then HUK, G) = H q t (K , <?). 

Since G is a topological group, A q (K, G) is a closed subgroup of Z q (K , G) 
(Lemma 29.2) and consequently E 9 C A 9 . It follows that the Kronecker 
index can be defined for elements of H q (K, G) and 3C c (/f, I). We define a sub¬ 
homology group 

(34.2) Qi(K, G) = Annih DC q (K, I) in H q (K f G). 

• 

Theorem 34.2. For a star finite complex K the topological homology group 
Hi(K, G) of infinite cycles with coefficients in a topological group G can be ex¬ 
pressed in terms of the integral cohomology group 3Q = I) and the integral 

co-torsion group S^+i = Tg +1 (Z£, I) of finite cocycles . The explicit relation is 

(34.3) HKK, G) Si Horn [ u X q , G] X (Ext {<?, fT a+ i}/0). 



812 


SAMUEL EILENBERG AND SAUNDERS MACLANE 


More explicitly, 77? has a subgroup Q1, defined by (34.2), where 
(34.3) Qt(K, (?) is a direct factor of H\(K, 6), 

(34.5) Q 9 ,(K, G ) s Ext \G, 3" a+1 }/0, 

(34.6) H\{K, G)/Q1(K, (?) a* Horn {X,, (?). 

Proof. From the direct product decomposition (30.5) we obtain 

m at (Z q /A (t ) x KA'/BV)/ 0]. 

Consequently Q ? = Ql /0 is a direct factor. Sin ce Qi ^ Ext {(?,. SQ+i}/ 
Ext/ {<?, Tj+i} and since, by Corollary 11.6, Ext/ {(?, T„+i} = 0, we obtain 
(34.5). Formula (34.6) follows from Theorem 30.4. 

It might be interesting to notice that, while the groups H q (K, (?) and H q w (K, (?) 
were algebraically independent of the choice of the topology in (?, the group 
H q {K, G) depends both algebraically and topologically upon the topology chosen 
in (?. 


35. Universal coefficients 

The results of the previous three sections can be summarized in the following 
fashion. 

Universal Coefficient Theorem. In a star finite complex K the integral 
cohomology groups of finite cocycles determine all the homology and cohomology 
groups that were defined for a star finite complex, specifically: 

The groups G, I) and TK' v+ i(K, I) determine the generalized topological 

homology group H q (K, (?) of infinite cycles with coefficients in a generalized topo¬ 
logical group G. 

The groups G, TfC t (K, 7) and ‘S t+l (K, I) determine: 

(a) the generalized topological weak homology group H q w (K, (?) of infinite cycles 
with coefficients in a generalized topological group G; 

(b) the topological homology group H q (K, (?) of infinite cycles with coefficients 
in a topological group G; 

(c) the discrete cohomology group 1K V {K, (?) of finite cocycles with coefficients in a 
discrete group G. 

This shows that the group I of integers is a universal coefficient group for the 
homology theory of the complex K. Since the group P of reals mod 1 is the 
group of characters of I we have in view of (33.3) the fact that X,(7C, 7) ^ 
Char H q (K , P ); therefore all the groups can be expressed in terms of H\K, P) 
and H t+1 (K, P), so that P is also universal. 

Given a closed subcomplex L of K one often has to consider the relative 
groups of K mod L. However, the complexes used here are so general that 
K — L is also a complex and the usual groups of K mod L coincide with the 
groups of K — L as we have defined them. Consequently all our formulas 
remain valid in the relative theory. 



GROUP EXTENSIONS AND HOMOLOGY 


813 


36. Closure finite complexes 

Closure finite complexes are obtained by replacing condition (26.1) in the 
definition of a complex by the following 

(36.1) Given <r?, [<r?: o-J? -1 ] 0 for only a finite number of indices k. 

Simplicial complexes are all closure finite. 

In a closure finite complex we consider finite cycles and infinite cocycles and 
obtain the discrete homology groups ?JC 9 (7C, (?) and the topologized cohomology 
groups H q (K , (?), H q (K, (?) and H l q (K , G ). All our development can be re¬ 
peated with the modification of interchanging homology and cohomology groups 
and replacing q + 1 by q — 1. For instance formula (32.3) will take the form: 

H q (K , G) K Horn {3C 9 (tf, 7), 0} X Ext {<?, TKT\K, /)}. 

Instead of repeating the arguments for closure finite complexes we can use 
the previous results for star finite complexes and apply them to closure finite 
complexes by means of the concept of the dual complex. If the complex K 
is described by the incidence matrices A 9 , the dual complex K* will be defined 
by the transposed matrices 


B q = (A* 9 )' 

The dual of a star finite complex is closure finite and vice versa. Also 
( K *)* = K . Moreover by passing from a complex to its dual, the boundary 
operation becomes the coboundary, and vice versa. Hence the homology and 
cohomology group are interchanged, and our formulas apply. 

A locally finite (i.e. both closure and star finite) complex carries therefore 
two homology theories, namely, the theory of a star finite complex and the 
theory of a closure finite one. In the case of a manifold the Poincar6 duality 
establishes a relation between the two theories. In general the theories are 
unrelated and in any specific problem we only use one at a time. We will quote 
two examples to this effect. 

A) In the following chapter we define for every compact metric space a 
complex called the fundamental complex. This complex is locally finite, but 
its closure finite theory is trivial, while its star finite theory is extremely useful 
for the study of the underlying space. 

B) Let us consider two infinite polyhedra represented as two locally finite 
complexes K and K Given a continuous mapping f of K into K' it is well 
known that / induces homomorphisms: 1°) of the groups of finite cycles of K 
into the corresponding groups of K', 2°) of the groups of infinite cocycles of K! 
into the corresponding groups of K . This explains why in problems connected 
with continuous mappings (like Hopf’s mapping theorem and its generalizations; 
see [4]) we use only finite cycles and infinite cocycles, or in other words we use 
only the closure finite theory of K and K f . 



814 


SAMUEL EILENBEKG AND SAUNDERS MaCLANE 


Chapter VI. Topological Spaces 

Here we formulate our results for the homology groups of a space. In the 
case of a compact metric space, Steenrod has shown that the homology groups 
can all be expressed as corresponding homology groups of the fundamental 
complex of the space, so that the results of Chapter V apply directly (§44). 
For a general space, the Cech homology groups are obtained as (direct or in¬ 
verse) limits, so that the decomposition of the homology group is obtained as a 
limit of the known decompositions for the homology groups of finite complexes, 
and here the techniques developed in Chapter IV apply. The results obtained 
for a general space are not as complete as those for complexes, partly because 
the limit of a set of direct sums apparently need not be a direct sum, and partly 
because “Lim” and “Ext” do not permute, so that the group Ext* discussed 
in Chap. IV is requisite. We also discuss (§45) Steenrod’s homology groups of 
“regular” cycles. 


37. Chain transformations 

Let K = {<r?} and K' = {ry} be two star finite complexes. Suppose also 
that for every integer q there is given a matrix of integers, 

IIWill 

with rows indexed by the g-cells of K , columns by the g-cells of K', and with 
only a finite number of non-zero entries in each column. 

Given a g-chain c q = X) g.o-? e C 9 (K, (?) in K , define 

Tc* = Z (EffiKiW. 

' i . i 

The column finiteness condition implies that the summation Z< Qb\j is finite 
and therefore that Tc q is a well defined element of C(K', G). We thus obtain 
homomorphisms (one for each q and G) 

T: CP(K, G) -* C(K', G). 

Given a finite g-chain cP = Z 9i T i « G) ip K\ define 

t*<? = Z (EgMM 
♦ 

This time the column finiteness of B 9 implies that T*cP is finite; hence we ob¬ 
tain homomorphisms 

T *: € t (K', G) —* C t (K, G). 

T* is called the dual of T. 

It can be verified at once that if c q is a chain in K and d® is a finite chain in 
K’ then 

(37.1) (Tc^-cf — c 9 -(T*d?), 

’whenever the coefficients are such that the Kronecker index has a meaning (§29). 



GROUP EXTENSIONS AND HOMOLOGY 


815 


T is called a chain transformation of K into K' if dTc q = T(dc q ) for every q 
chain; that is, if 

(37.2) dT = Td. 

It can be shown that this condition is equivalent to the requirement that 

(37.3) ST* = T*S. 

It follows that a chain transformation T maps the groups Z q , A q , B q w and B q 
of K homomorphically into the corresponding groups of K'. Similarly T* maps 
the groups of K' into the corresponding groups of K. In particular a chain 
transformation induces homomorphisms of the homology groups 

(37.4) T: iT’iK, G) -► H q (K\ G), 

(37.5) T *: OC t (K', G) -* 3C t (K, G), 

and of the corresponding subgroups defined by (32.1) and (33.4) 

(37.6) T : Q q (K, G) -» Q q {K', G), 

(37.7) T*: 9 q {K', G) -* & t (K, G). 

38. Naturality 

• We are now in a position to give a precise meaning to the fact that the iso¬ 
morphisms established in Chapter V are all “natural.” 

Theorem 38.1. If T is a chain transformation of a complex K into K', then 
T permutes with the isomorphisms established in Theorems 30.4 and 31.2, provided 
the application of T in any group is taken to mean the application of the appro¬ 
priate transformation induced by T on that group. 

Proof. If the homomorphism established in Theorem 30.4 be denoted by 
u (or by /, for K'), then we have the homomorphisms 

Z q {K ) Horn {rX 4 , G} 

\t , }rt* 

Z\K')-Z-* Horn {3K,,G}, 

where T** is the homomorphism of Horn {TfC Q (K, I),G] into Horn {’DC t {K , t I), G\, 
induced as in §12 by the dual chain transformation T*. The theorem then 
asserts that 

p't = r?V 

To show this, take c q (^(K, G). The corresponding homomorphism $ = nc q 
is then defined, for each cocycle d q in % t {K), by 6{d q ) = c q -d q (cf. §30). Then 
8' = Tt*0 is, according to the definition of Tk , simply 8'{d' q ) — 8{T*d' q ). Hence, 
for any cocycle d", 

d'id”) = 8{T*d n ) = c q -(T*d’ q ) = ( Tc q )-d 



816 


SAMUEL EILENBERG AND SAUNDERS MACLANE 


In the other direction, Tc q maps under / into the homomorphism defined 
for d n « Z q (K’) by 


*'(d") = (Tc q )-d ,q . 


The formulas show that <t>' = n'Tc 9 and 0 ' = T**nc q are in fact identical, as 
required by Theorem 38.1. 

To treat Theorem 31.2, let t (or r') denote the homomorphism of A q (K , (?) 
onto Horn {£B s+ i(fL, I), G} given in that theorem, while i? is the map of the 
latter group onto Ext {(?, DQ+x}. The figure is 


A q 

T 

A' q 


-►Horn 

jjr 

—T^Hom {S; + i, (?} 

T 



Ext {G, DC t+1 } 
I yt** 


T* Ext {(?, X + i\ 

v 


where T**, T** are again the induced homomorphisms. If z q e A q (K, G) is 
given, <l> = rz q is defined on each coboundary 8d q as <t>(8d") = z q -cP, while 0 ' = 
T**<i> is defined in turn as 

<t>'(8d ,q ) = <t>(T*8d ,q ) = <t>(8T*d ,q ) = z q -(T*d ,q ). 

On the other hand, x = r'(Tz q ) is defined on a coboundary 8d' q of K' as 


X (Sd ,q ) = (Tz q )-d ,q = z q -(T*d ,q ). 

The results are identical, so Tt*r = t'T. Now the “uaturality” theorem for 
group extensions showed that T permutes with 17 , as in = v'T**. Combina¬ 

tion of. these results gives 

Wr')T = T**(r,r). 

This is the required commutativity condition, for r)r is the isomorphism envi¬ 
saged in Theorem 31.3. 


39. Cech’s homology groups 

We now briefly outline Cecil’s method of defining the homology and co¬ 
homology groups for a space X. Let U a be a finite open covering of X and N a 
the nerve of U a . If Up is a refinement of U a we write a < /3. For a < 0 we 
have a chain transformation T a $: Np —> N a defined as follows: for each open 
set of the covering Up select a set of U a containing it; this maps the vertices of 
Np into the vertices of N a and leads to a simplicial mapping T a p . This chain 
transformation is not defined uniquely, but the induced homomorphisms 

T a p: H q {Np,G)-*H q {N a ,G), 

. Tp a : X(Na , G) -» 3Q(AV, G) 



GROUP EXTENSIONS AND HOMOLOGY 


817 


are unique. Using the directed system of all the finite open coverings of X 
we define 28 

(39.1) 3C*(X, (?) = Lim H q (N a , G ) 

(39.2) 3C,(X, (?) = Lim X 9 (iV„ , G). 

In (39.2) the groups are all discrete. In (39.1) G can be any generalized 
topological group and DC(X, G), as an inverse limit of generalized topological 
groups, also is a generalized topological group. If G has the property that each 
of its subgroups mG (m = 2, 3 • • •) is closed in G, the finiteness of each N a 
implies that H q (N a , (?) and hence TH?(X, G) is topological. If G does not have 
this property, it would still be possible to consider the group 

Lim HUN a , (?) = Lim \JP(N a , (?)/0]. 

This group is always topological but its relation to the other groups is rather 
obscure. 

In view of (37.6) the subgroups Q q (N a , (?) of H V {N a , (?) form an inverse 
system. We define 

(39.3) 2*(X, (?) = Lim Q"(N a , G). 

Clearly SP is a subgroup of rK'"(X, G). 

Similarly, in view of (37.7), the subgroups f ? t (N„ , (?) of ffC q (N a , (?) form a 
direct system so we define 

(39.4) 9\(X, G) = Lim <p g (N a , G). 
fP q is a subgroup of UC q (X, G). 

Lemma 39.1. The Kronecker index establishes a pairing of TK V (X, (?) and 
TK V (X, I) with values in (?; under this pairing 

°P(X, (?) = Annih 3C s (A r , I). 

Lemma 39.2. Let G be discrete and G = Char (?. The Kronecker index estab¬ 
lishes a dual pairing of Gi v (X, G ) and UC q {X, (?) with values in the group P of reals 
mod 1; under this pairing 

X,(X, G) ^ Char 3K’ ? (X, G) 

9 q (X, (?) = Annih ( JP(X, G). 

Both lemmas have been established for each of the complexes N a . The 
passage to the limit is possible in view of formula (37.1.) 

In X„(X, (?) we also consider the subgroup e S q (X, (?) of all elements of finite 

88 For more detail see Lefschetz [7]. Although the definition of the homology and 
cohomology groups given here is valid for any space A r , it is well known that its interest 
is restricted to compact spaces only. This is due to the fact that only in compact spaces 
is the family of finite open coverings cofinal with the family of all open coverings. 



818 


SAMUEL EILENBERG AND SAUNDERS MacLANE 


order. Since each approximating group tfC q (N a , G) has a finite set of genera¬ 
tors, one can show, by arguments resembling those of §24, that 

J Q (X, (?) = Lim U- t (N a , <?). 

40. Formulas for a general space 

Using the formulas for complexes and applying a straightforward passage 
to the limit we obtain here some relations for DC(X, (?) and 0C a (X, (?) in terms 
of the groups 3C„(X, I ) and ^f t+ i{X, I). The results are not as complete as 
in the case of a complex. 

Theorem 40.1. For a space X and a generalized topological coefficient firoup G 
the subgroup 2® of the Cech homology group is expressible, in terms of a co-torsion 
group, as 

(40.1) 2®(X, G) £* Ext* {(?, T, +1 (X, /)}, 

while the corresponding factor group 3C\X, (?)/2®(X, (?) is isomorphic to a sub¬ 
group of Horn I), G}. 

If G/mG is compact and topological for m — 2, 3, • • • then 

(40.2) X®(X, G)/2®(X, G) ^ Horn {3Q(X, I), (?}. 

Proof. For each nerve N a we have (Theorem 32.1) 

Q\N a , G) =* Ext {G, 7MN a , /)) 

The groups on either side form inverse systems and it follows from Theorem 
38.1 and Lemma 20.2 that the limits of these systems are isomorphic, 

2*(X, G) s* Ext {G y c S q+1 (N a9 /)}. 

However since I) = Lim T ff+1 (AT a , I) and the groups T 4+ i(i\T a , I) are 

finite it follows from Theorem 24.2 that the limit on the right is Ext* {(?, ‘f !+l) • 
This proves formula (40.1). 

From Theorem 32.1 we also have 

H*(N a , G)/Q q {N„ , G) « Horn {3 C t (N a , I), G], 

and again the limits of the two inverse systems are isomorphic in view of Theorem 
38.1. Consequently from Theorem 21.1 we get 

Lim [H\N a , G)/Q"(N a , (?)] s* Horn (3Q(Z, I), G\. 

Now it follows from (20.1) (Chap. IV) that the group 

3C®(X, (?)/2®(Z, (7) = Lim Hl/Um Q% 

is isomorphic with a subgroup of the group Lim (Hl/Ql). This proves the 
second assertion of the theorem. The subgroup will turn out to be the whole 
group whenever we are able to prove that Q q (N a , G) are compact topological 
groups. 

* Suppose now that G/mG is compact and topological for m = 2, 3, • • • . 



GROUP EXTENSIONS AND HOMOLOGY 


819 


Given a cyclic group T of order m ^ 2 we have Ext {<?, T) ^ G/mG (Corollary 
11.2) and consequently Ext {G, T) is compact and topological. It follows that 
Ext {G, T } is compact and topological for every finite group T. In particular 
the groups 

Q Q (N a , G) 9* Ext {G, 3 r fl+ 1 (tf«, 7)} 
are all compact and topological. 

This completes the proof of the theorem. Notice that if G/mG is compact 
and topological for m = 2, 3, • • • then the group 2*(X, G), as a limit of compact 
topological groups, is compact and topological. 

If G is discrete and has no elements of finite order, or if i is countable, then 
by Theorem 24.4 and Corollary 24.1, the group Ext* in (40.1) may be replaced 
by Ext/Ext/ . In particular if G = 7 then by Theorems 17.1 and 40.1, 

(40.3) 2<(X, 7) ^ Char T* +1 (X, 7), 

(40.4) 3 V(X, 7)/2*(X, 7) ^ Horn {3Q(X, 7), 7}. 

Theorem 40.2. The Cech homology group 3C*(X, G) of a space X over a compact 
topological group G has a subgroup 2®, with factor group TfC 1 / 2®, both expressible 
in terms of integral cohomology groups of X as 

(40.5) 2*(X, G) s Char Horn {G, c J q+ i(X, 7)}, 

^(40.6) 3C*(X, G)/2*(X, G) s Horn {3FQ(X, 7), G}. 

Proof. From Theorem 40.1 we have 2 9 ~ Ext* {G, 3^+1}. However since 
G is compact topological we have Ext* {G, T fl +i} ^ Ext {G, T fl +i) (Corollary 
24.1) and Ext {G, T fl +i} = Char Horn {G, T 7 + i} (Theorem 15.1). This proves 
formula (40.5). We recall here that only continuous homomorphisms are 
considered. Formula (40.6) is a consequence of (40.2). 

Theorem 40.3. The Gech cohomology groups TfC q 3 £P tt of a space X over a 
discrete coefficient group G can be expressed , in part , in terms of the integral co¬ 
homology groups as 

(40.7) 95r(X, G) & G o 3Q(X, 7), 

(40.8) 3Q(X f G)M(X, G) 3* Horn {Char G, fT, +l (X, 7)}. 

Proof. Let 6 = Char G. Since (?) s Char DC(X, 6) and % = 

Annih S*(X, 0) we have 

9> t (X, G) » Char [3C*(X, 6)/3*(X, Q)], 

and using Theorems 40.2 and 18.1 we get 

9 g (X, G ) £* Char Horn {3Q(X, 7), Char G}^G o 3C t (X, 7). 

This formula could have been proved directly, passing to the limit with 
9 g (N a ,G) 9 * Go dC t (N a , I ). Since also TKJ9, S Char Char G), formula 
(40.8) is a consequence of Theorem 40.2. 



820 


SAMUEL EILENBERG AND SAUNDERS MACLANE 


The theorems and proofs carry over without change to the homology theory of 
X modulo a closed subset. Another generalization can be obtained by replacing 
the space X by a net of complexes, as defined by Lefschetz ([7] Ch. VI). 

We are unable to answer the question whether 2®(Af, (?) and £P fl (X, (?) are 
direct factors of DC'tX, (?) and 3Q(X, (?). This is why we do not obtain expres¬ 
sions for TfC(X y (?) and DC g (X, (?) in terms of 3 Q(X, I) and 7 q+1 {X, I). The 
best we achieve in the case of a general space X is a description of the subgroups 
2* and £P t and of the corresponding factor groups, leaving the direct product 
proposition undecided. 29 

In the following sections of this chapter we shall discuss the case when X is a 
compact metric space, using the method of the fundamental complex. In this 
case we are able to obtain complete results, including the direct product de¬ 
composition. 

41. The case q = 0 

Before we proceed with the treatment of compact metric spaces we will discuss 
some details connected with the definition of the homology and cohomology 
groups for the dimension zero. 

Let K be a finite simplicial complex. If we assume that there are no cells 
of dimension less than zero then every 0-chain will be a 0-cycle and the groups 
H°(K , (?) and 3C 0 (K, (?) will be isomorphic to the product of G by itself n times, 
n being the number of components of K . 

An alternate procedure is to consider K “augmented” by a single (— l)-cell 
(T 1 such that [<r? :<r~ l ] = 1 for all . In this case, given a 0-chain c° = 52 f h<r°i , 
we have dc° = 1 and consequently c is a cycle if and only if 52 0* = 0. 

The cohomology group gets affected also because the cocycle 52 a 9 that was not 
a coboqndary in the first approach is a coboundary in the augmented complex, 
since ha" 1 = 52 ^ • If turns out that (?) and <J(o(K, (?) arc isomorphic to 

the product of G by itself n — 1 times. 

In defining the groups 3C°(X, (?) and 3f 0 (X, G) for a space we again have two 
alternatives according as the nerves N a are augmented or not. 

Both the augmented and unaugmented complexes are abstract complexes 
in the sense of Ch. V and therefore^all our previous results hold for either defini¬ 
tion of 3C° and 3Co. However in the discussion of compact metric spaces that 
follows there is an advantage in considering the nerves as augmented complexes, 
so as to have (?) = 3f 0 (X, (?) = 0 if X is a connected space. 

42. Fundamental complexes 

Let AT be a compact metric space. There is then a sequence JJ n {n = 0,1, • • •) 
of finite open coverings of X such that U n is a refinement of I7 n -i and every finite 

98 Steenrod [9] §10 brings an argument, which if correct would settle the question posi¬ 
tively. Unfortunately an error occurs on p. 681, line 6. The error was noticed by C. 
Chevalley, who has also constructed an example showing that the argument could not be 
corrected in the general case. If X is metric compact, Steenrod’s argument can be cor¬ 
rected to give the desired direct product decomposition (see §44 below). 



GROUP EXTENSIONS AND HOMOLOGY 


821 


open covering of X has some U n as a refinement. This last property asserts 
that in the directed family of all the finite open coverings of X the sequence 
{Un) constitutes a cofinal subfamily and therefore the Cech homology and co¬ 
homology group can be equivalently defined using only the sequence of coverings 
U n . We shall assume that Uo is a covering consisting of only one set, namely X 
itself, so that the nerve No of Uo is a vertex. For each n we select a projection 
T n :N n —*► N n—i of the nerve of U n into the nerve of U n -i . The projections 
N n —► N n -k we define by transitivity. 

We now define the fundamental complex K of X as follows. The complexes 
N n for n = 0,1, • • ■ shall be disjoint subcomplexes of K. For each n = 1,2, 
and each simplex a q of N n we introduce a new (q + l)-cell 3V® whose boundary 
is T n <r q — <r q — 3 )da q . This formula gives a recursive definition of the incidence 
numbers. 

In order to give a more intuitive picture of K we may consider each of the 
nerves N n as a geometric simplicial complex, the projection T n can then be 
regarded as a continuous simplicial transformation; that is, as linear on every 
simplex <r q of N n , while 3)<r 9 can be visualized as a deformation prism consisting 
of intervals joining each point of <r* with its image under T n . With this inter¬ 
pretation K becomes a geometric complex and the cells 3V 9 can be subdivided 
so as to furnish a simplicial subdivision of K. It is clear from this picture that 
K can be contracted to a point, namely by moving every point up its projection 
lines towards the vertex No . 

The complex K is countable and is locally finite; i.e., both closure and star 
finite. Viewing A r as a closure finite complex, we can define finite cycles and 
infinite cocycles. However, since K is contractible all the homology group with 
finite cycles will vanish. Using the results of Ch. V we conclude that the 
cohomology groups with infinite cocycles also will vanish. Consequently, re¬ 
garded as a closure finite complex, the structure of K is trivial. If we approach 
K as a star finite complex we obtain cohomology groups with finite cocycles and 
homology groups with infinite cycles. Regarded this way the complex K fur¬ 
nishes a true picture of the combinatorial structure of the space X. 

43. Relations between a space and its fundamental complex 

Theorem 43.1. The compact metric space X and its fundamental complex K 
are linked by isomorphisms 

(43.1) VC(X, (?) 96 Hi +1 (K, G), 

(43.2) 3Q(X, (?) 96 VC v+l (K, G ). 

We shall restrict ourselves here to indicate the definitions of the isomorphisms 
without going into the complete proof, which involves lengthy but straight¬ 
forward calculations.* 0 

Let z* be an element of 3C*(X, G). Then z q can be represented by a sequence 

80 This proof is closely related to one given by Steenrod; see [10], §4. 



822 


SAMUEL EILENBERG AND SAUNDERS MACLANE 


of cycles z\ c Z?(N n , G) such that zl-i — T^zl «, G). For each n — 
1, 2, • • • select a chain in iV„_i such that 

dcSti = 4-i - T n z" n , 

and consider the chain 


2 S+1 = Z<£?i + Z®4. 

n—1 n—1 

We verify that 

dz 9+1 = Z (4-x - T„4) + Z (r»4 - 2» - awz 9 ) 

n—1 n—1 

= z? - Z - o, 

n—1 

since dz£ = 0, while z? = 0 for q ^ 0, zo = 0 by §41. Consequently z 9+1 is a 
cycle of K. If instead of {c 9+1 } we use a sequence {c’ +1 } to define a cycle 
2* +1 , then 

* 9+1 - z* +1 = Z (clt\ - clt\) 

n—1 

Each term cStti — is a finite cycle and therefore bounds in K, therefore 
z* +1 — z* +1 is a weakly bounding cycle and z 9+1 determines uniquely an element 
z 9+1 tH?\K,G). We define 

*(z 8 ) =• z 9+1 . 

Now let w* c 3C 9 (X, G). The element w 9 can be represented for suitable n 
by a single cocycle io 9 t Z,(7V„ , G). We verify that ®u> 9 is then a (q + ^-co- 
cycle of K. Using the formula 

&w q = 3)rV - 0u> 9 in K, 

and the fact that and 6 commute we show that 3)u> 9 determines uniquely an 
element w 9+1 of TfC t {K, G). We define 

i(w 9 ) = w 9+1 . 

We also notice that the pair of isomorphisms </>, ^ preserves the Kronecker 
index 

(43.3) <t>{ z 9 )-^(w e ) = z 9 w 9 . 

If Xo is a closed subset of X then every covering U n of X determines a covering 
of Xo whose nerve L n is a subcomplex of the nerve N n of U n . The subcomplex 

L - Z Ln + Z ®Ln 

n—1 n—1 



GROUP EXTENSIONS AND HOMOLOGT 


823 


of K is theq a fundamental complex of Xo. The isomorphisms (43.1) and 

(43.2) of Theorem 43.1 can be generalized as follows 

(43.1') DC*(X mod X„, G) at Hl +1 (K mod L, G) 

(43.20 DC,(X - X„, G) s DC, +1 (K - L, G). 

44. Formulas for a compact metric space 

Using the fundamental complex and the results of Ch. V we shall now establish 
theorems for a compact metric space quite analogous to the ones proved for a 
complex in Ch. V. 

Theorem 44.1. The Cech homology groups of a compact metric space X over 
a generalized topological coefficient group G can be expressed in terms of the integral 
cohomology groups DC, = DC,(X, 7), fT,+i = 3", + i(X, I) as 

DC*(X, G) S* Horn {DC,, G) X (Ext {(?, ‘T q+1 }/Ext, \G, .T, +1 )). 

More precisely, in terms of the subhomology group 2® of (39.3) we have 

(44.1) 2*(X, G ) is a direct factor of DC*(X, G), 

(44.2) 2®(X, G) S Ext [G, fjT, +1 }/Ext/ {(?, T,+,}, 

(44.3) DC*(X, (?)/2®(X, (?) at Horn {DC,, (?). 

To prove the theorem we use the fact that the Kronecker intersection is 
preserved under the pair of isomorphisms <t>, \p of the previous section. Con¬ 
sequently, since 


2*(X, (?) = Annih DC,(X, I) in Dl®(X, G), 
Ql +1 (K, (?) = Annih DC,+i(K, I) in H£\K, G), 


we have 

0[2®(X, (?)] = Qt\K, G), 

and the theorem becomes a consequence of Theorems 43.1 and 32.2. 

Theorem 44.2. The Cech cohomology groups of a compact metric space X 
with coefficients in a discrete group G can be expressed in terms of the integral 
cohomology groups DC, = DC,(X, I), T, +1 = D",+i(X, 7) as 


DC,(X, G)^ (Go DC,) X Horn {Char (?, 


More precisely, in terms of the subgroup £P, of (39.4), we have 

(44.4) 9 q (X, G) is a direct factor of DC,(X, (?), 

(44.5) 9 q (X, G)=*G o DC,, 

(44.6) DC,(X, G)/9 q (X, G) £* Horn {Char (?, T, +1 }. 



824 


SAMUEL EILENBEEG AND SAUNDERS MACLANE 


To prove the theorem we notice that 

0> f (X, G) = Annih 2*(X, Char (?) in 3 C q (X, (?), 

9> q +i{K f G) = Annih Qt\K 9 Char (?) in 3Q +1 (X, G) f 
and therefore 

t[9 q (X, (?)] = & q +i(K, (?) 

and the theorem becomes a consequence of Theorems 43.1 and 33.1. 

All these results remain valid for the homologies of X modulo a closed subset* 
We now proceed to compare the results obtained here for the metric'compact 
case with the results of §40 concerning general spaces. 

Statements (44.1) and (44.4) contain a positive solution for the direct product 
problem which is still unsolved for the general space. Formula (44.3) was 
proved in (40.2) for general spaces only under the additional condition that 
G/mG be compact and topological for m = 2, 3, • • • . Formula (44.2) was 
proved for general spaces under the form 

2*(X, G) ^ Ext* {(?, 3T fl+ i(X, /)} 

which is equivalent to (44.2) because 

Ext* {(?, T] ^ Ext {(?, TJ/Ext, {(?, T] 

for countable groups T with only elements of finite order (Theorem 24.4) and 
the group ff* + i{X, 1) = c J q + 2 (K, 7) is countable for a compact metric X, since K 
is countable. 

Formulas (44.5) and (44.6) coincide ‘ with the ones proved in Theorem 40.3 
for a general space. 

46. Regular cycles 

Using the concept of a “regular cycle” Steenrod ([10]) has defined a new homol¬ 
ogy group H q (X , (?) of “regular” cycles, for a compact metric space X. This 
group is useful especially in the case when X is a subset of the n-sphere S n y 
because it provides information about the structure of the open set S n — X. 

Steenrod ([10], Theorem 7) has proved that if K denotes a fundamental 
complex of X then 

(45.1) H q (X ) G)g*H\K,G). 

From this, using Theorems 43.1 and 32.1 we derive the formula 

(45.2) tf*(X, (?) ^ Horn {3fQ^(X, I), (?} X Ext {(?, 3Q(X, /)}, 

for q > 0. This formula expresses H q (X> (?) in terms of 3Q_i(X, I) and TfC q (X , 7) 
and hence shows that, essentially, H 9 (X , (?) is no new invariant. 

Let us specialize formula (45.2), assuming that q = 1, and that X is connected. 
We have then 3Co(X, 7) = 0 and therefore 

*(45.3) H\X, (?) ^ Ext {(?, 3fG(X, 7)}. 



GROUP EXTENSIONS AND HOMOLOGY 


825 


Let us further assume G = / and that X is one of the solenoids 2. Since 2 
is a connected, compact abelian group we have H l ( 2, P) ^ 2 (Steenrod 
[9], Theorem 15) where P (Steenrod’s I) is the group of reals mod 1. Further, 
since Char I ^ P we have #i(2, I) ^ Char H\ 2, P) ^ Char 2. Hence 
finally 

(45.4) iP(2, J) s Ext {/, Char 2}. 

This group will be explicitly computed in Appendix B; it was the starting 
point of this investigation (see introduction). 

Steenrod has defined a subgroup H 9 (X y G) of H q (X y G) by considering regular 
cycles that are sums of finite cycles. He has also proved that under the iso¬ 
morphism (45.1) this group is mapped onto the subgroup B q w (K , G)/B q (K , G) 
of H 9 (K, G). 

We shall now show that, for q > 1, 

(45.5) H\X , G) s Ext/ {G, 3Q(Z, /)}. 

(45.6) /P(X, G)/H 9 (X, G) s fJC^CX, G). 

In fact, from Theorems 31.3 and 43.1 we deduce that B q w (K, G)/B g (K , G) ^ 
Ext/ {G, 3Q+i(/iT, /)} ^ Ext/ {G, 3C/X, /)}. This proves (45.5). In order 
to prove (45.6) notice that H 9 (X y G)/H q {X , G) ^ J/*(Z, G)/[2?*(X, G)/ 
B\K y G)] ^ «J(X, G) £* G). 

Formulas (45.5) and (45.6) provide a splitting of H q (X y G) different from the 
one used in (45.2). The isomorphism (45.6) was established by Steenrod [10J, 
who has also shown that H 9 can be computed using G and TK q (X y /), without 
however getting the explicit formula (45.5). 

From (45.5) we immediately deduce the theorem of Steenrod that 
H 9 {X y G) = 0 and H\X , G) S* 7KT l (X f G) whenever TK' q (X y I) has a finite 
number of generators. 

Appendix A. Coefficient Groups with Operators 

In many topological investigations it is convenient to construct homology 
groups Il 9 (K y G) in cases when G is not just a group, but a ring or even a field. 
More generally, G can be allowed to be a group with operators. We show here 
that our results extend unchanged to such cases, and in particular, that the 
resulting homology groups are still completely determined by the integral 
cohomology groups. 

G is called a group with operators 12 if G is a generalized topological group, 
12 a space, and if to each element co e 12 and each g e G there is assigned an ele¬ 
ment a>g e G (the result of operating on g with w), in such wise that 

(i) wg is a continuous function of the pair (w, g) y 

(ii) <a(gi + fib) = o#i + a #2 Gh >02 e G). 

It then follows that each element w determines a (continuous) homomorphism 
g —► ug of G into G; however, distinct elements of 12 need not determine distinct 



826 


SAMUEL EILENBERG AND SAUNDERS MACLANE 


homomorphisms. The set 12 may have a discrete topology, or may even consist 
of just one operator w. 

If both Gi and Ga have operators 12, a homomorphism (or isomorphism) 0 of 
Gi into Gi is said to be 12 -allowable if <t>[uQi] = o)[<hgi] for all gi c Gi , co € 12. 

If G has operators 12, a subgroup S C G is said to be allowable if w(/S) C S 
for all co 112 . The operators 12 may then be applied in natural fashion to the 
factor group G/S , by setting c c(g + S) = wg + S. Then G/S is a group with 
operators 12, and the natural homomorphism of G on G/S is allowable. 81 

If G is a group with operators 12, the various groups introduced as functions 
of G in Chapters I-IV are also groups with operators. Specifically, let^AT be a 
discrete group, and for each 6 eHom { H , G} define w0 as [w0](/t) = o>[0(ft)]. 
Then a>0 € Horn [H f G}, and 

(A.1) Horn { H , G] has operators 12. 

Furthermore, if H = F/R , where F is free, the groups Horn {F | 72, G} and 
Horn/ {if, G; F\ are allowable subgroups of Horn {/?, G}, so 

(A.2) Horn {#, G}/Hom {F | R, G} has operators 12. 

Again, let/be a factor set of H in G, and define another factor set w/ by taking 
[<af](h, k) as w[f(h , A)]. Then 12 becomes a space of operators for the group 
Fact {G, H). Furthermore Trans {G, H) is an allowable subgroup; therefore 

(A.3) Ext {G, //} has operators 12. 

In similar fashion one concludes that Ext/ {G, H } and Ext/Ext/ have operators 12. 

As another case, take <t> t Horn {G, H) and define a homomorphism 
oxt> e Horn [G f H } by setting [oxt>](g) = <t>[co(g)] for each g eG. If G is compact 
or discrete, one may show that ox/> is a continuous function of w and 4>. In this 
case, and for any generalized topological group H , 

(A.4) Horn [Gy H) has operators 12. 

In particular, if G is discrete or compact, 

(A.5) Char G has operators 12. 

Given these interpretations of all our basic groups as groups with operators, 
we next demonstrate that the various isomorphisms between these groups, as 
established in Chapters II-IV, are allowable. In particular, an inspection of 
the construction used to establish the fundamental Theorem 10.1 of Chapter II 
proves 

(A.6) The isomorphism 

Ext [Gy H] ^ Horn {#, G}/Hom {F | R 9 G}, 
where H = F/R y F free , is allowable . 

11 Practically all the elementary formal facts about groups and homomorphisms apply 
to operator groups and allowable homomorphisms. 



GROUP EXTENSIONS AND HOMOLOGY 


827 


The same conclusion holds for the other isomorphisms stated in that theorem. 
Also, the isomorphism Ext {G, H\ ^ Char Horn {<7, H\ established in Theorem 
15.1 for compact topological G and discrete H is allowable. The proof of this 
fact depends essentially on showing that the “trace” used in that theorem has 
the commutation property, 

t(o)6 y 4>) = t($, axf>), for any 6 e Horn {#, G }, and 4> e Horn {G, H }. 

The allowability of the other isomorphisms in Chapters II-IV is similarly estab¬ 
lished. The proofs are closely analogous to the “naturality” proofs of §12, 
except that here the operators apply to G, while in §12 the operator T ap¬ 
plied to H . 

Now turn to the homology groups. Let c q be a chain in the star finite 
complex Ky with coefficients chosen in the group G with operators 12. For each 
os € 12, define 

o)(c q ) = Qirf) = ]C 

» « 

since the result is a chain, and since the requisite continuity holds, the group 
CUCy G) of g-chains has operators 12. Moreover, = dc o, so that both 
Z?{Ky G) and B q (Ky G) are allowable subgroups of C q . Therefore 

(A.7) H q (K, G ) has operators 12. 

The essential tool in establishing the isomorphisms of Chap. V is the Kronecker 
index J-d? for d? e C q (K, I), c q c C 9 (K, G ). We verify at once that 

(A.8) c o(c q -d q ) = (o3C q ) • d q (all a, c 12). 

Since the subgroup A 9 of Z 9 was defined as a certain annihilator under this 
Kronecker index (see (29.9)), it follows at once that A 9 is an allowable subgroup 
of Z 9 . Furthermore, the proof that A 9 is a direct factor of C 9 depended on a 
decomposition of (? fl as a direct product (? ? = X ® fl , for a suitably chosen 
group . In the notation of Lemma 16.2, we then had, by means of the 
Kronecker index (see the proof of Theorem 30.3) 

Horn {(? q ,G} == Horn {e t ,G;ffl f ,0} X Horn {C ff ,G;2 fl , 0). 

On the right both factors are allowable subgroups, and the isomorphism to the 
direct product is allowable; 32 furthermore, the second factor is the one which 
corresponds to the subgroup A 9 of C q . Therefore C 9 has a representation of the 
form C 9 = A 9 X D 9 , where D 9 is an allowable subgroup, complementary to A 9 . 
A similar decomposition holds for Z 9 and thus for its factor group H 9 = Z 9 /B?. 
In terms of the homology subgroup Q 9 = A 9 /B 9 determined by A 9 , this proves 

(A.9) The isomorphism H 9 ~ ( H 9 /Q 9 ) X Q 9 is allowable. 

The further analysis of these two factors, as carried out in Chapter V, all 
depended on the Kronecker index. In view of the property (A.8) of this index, 

M If A and B are two groups with operators 12 the direct product A X B has operators 12 
defined by «(o, b ) ■* («(a), «(&)) for « «12. 



828 


SAMUEL EILENBERG AND SAUNDERS MACLANE 


and the property (A.6) of the basic group-extension theorem, we have 
(A. 10) The isomorphisms 

H q {K , G)/Q\K , G) 9* Horn {3Q, (?}, 

Q\K , G) a* Ext {(?, 3Q + i} 

are allowable , as is the isomorphism H q = Horn X Ext, obtained by combining 
(A.9) and (A.10). 

Similar remarks apply to the representation of the “weak” homology group 
HI, (Theorem 32.2), which is a factor group of H q by an allowable subgroup. 
The same holds for the topologized homology group H 9 (i.e., the isomorphisms 
of Theorem 34.2 are allowable), for in any topological group G with operators 12, 
the continuity of the operators insures that the subgroup 0 CZ G is allowable 
(recall that H\ = 7T/0). 

Turn next to the analysis of the cohomology groups. The groups C q {K, G), 
with G discrete, again have operators in 12, under the natural definition. As 
in the case of the homology groups, we have 

(A.ll) TK q {K, G) has operators 12. 

The representation of these groups depended on duality; i.e., on the Kronecker 
index c 9 -d 9 , for c 9 e Z 9 {K, Char G), d 9 e Z?{K, G ). Given the various definitions 
of the effect of an operator w, one shows easily that 

{wc 9 )-d 9 = c 9 - {cod 9 ) (all co 6 12). 

From this formula one may deduce that the well known isomorphism 3Q(/f, G) ^ 
Char H 9 (K , Char G) is allowable. Theilce it follows that the isomorphisms of 
Theorem 33.1 representing TK' q are allowable. 

These considerations yield the following 

Addendum to the Universal Coefficient Theorem. If K is any star finite 
complex, G a group with operators 12, then the homology groups of K {and, if G is 
discrete , the finite cohomology groups) with coefficients in G all have operators 12. 
All these groups with their operators are determined by the group G {with its opera¬ 
tors) and the cohomology groups of the finite integral cocycles of K. 

A similar discussion applies to the results of Chap. VI. 

In many important cases the operators form a ring (or even a field). Let us 
assume then that 12 is a generalized topological ring; that is, a ring which is a 
generalized topological group under addition and in which the multiplication is 
continuous. Then G is called an 12- modulus if G is a generalized topological 
group with operators 12 (i.e., conditions (i) and (ii) above hold) such that 

(iii) ( 0510 ) 2 )g = wi(a> 2 g). (for w* c 12, g e G), 

(iv) (wi + <#i)g = o)ig + o^g (for e 12, g e G). 

In other words, addition and multiplication of operators are determined in the 
natural fashion from G. 

If the standard coefficient group G is now assumed to be an 12-modulus, simple 



GROUP EXTENSIONS AND HOMOLOGY 


829 


arguments will show that all the groups with operators 12 as described above 
are in fact 12-moduli. Since the basic isomorphisms are still 12-allowable, we 
conclude that the addendum to the universal coefficient group theorem still 
holds in these circumstances. 

It is sometimes convenient to use a set 12 of operators in which only the addi¬ 
tion or only the multiplication of operators is defined. More generally, we may 
consider a space 12 in which only certain sums on + a> 2 and products anon arc 
defined (and continuous); we then require that conditions (iii) and (iv) above 
hold only when the terms a>io> 2 or an + an are defined. The derived groups 
satisfy similar assumptions, and the universal coefficient theorem still holds. 

If the coefficient group G is locally compact, one can always take the Operators 
to form a ring, for any such group G has its endomorphism ring 12 0 as a natural 
ring of operators. Specifically, 12 0 is the additive group Horn }G, 6} of endo- 
morphisms of G , with its usual topology (§3), and the multiplication anon of 
two endomorphisms is defined by (iii) above. The requisite continuity prop¬ 
erties of ojioj 2 and o >g are readily established, in virtue of the local compactness 
of G . Furthermore, if 12 is any other space of operators on G, each o e 12 deter¬ 
mines uniquely an endomorphism u eQ 0 with ug = cog for each g. The corre¬ 
spondence Co ► oi is a continuous mapping of 12 into 12 G which preserves whatever 
sums and products may be present in 12 (assumed to satisfy (iii) and (iv)). 
Thus, any group, derived from G, which is an 12«-modulus is also a group with 
operators 12, and any isomorphism between groups which is fi^allowable is 
12-allowable. This indicates, that, for locally compact groups, one may restrict 
attention to operators of the ring 12 Q . 

The most useful case is that in which the coefficient group is a field F, which 
is its own ring of operators. In this case all the homology groups, groups of 
homomorphisms, etc., become F-moduli; that is, vector spaces over F. 

All these remarks suggest the following rather negative conclusion: although 
in many applications it is convenient to consider a homology theory over coefficients * 
which form more than merely a group , no new topological invariants can be so 
obtained . 


Appendix B. Solenoids 

Here we compute the one-dimensional homology group // J (2, I) of regular 
cycles for the solenoid 33 2, or the isomorphic group Ext {I, Char 2} (see (45.4)). 

A solenoid is uniquely determined by a Steinitz G-number; that is, by a 
formal (infinite) product G = n pY of distinct primes with exponents e* which 
are non-negative integers or oo. Any such number G can be represented (in 
many ways) as a formal product G = • • • a n • • • of ordinary integers a* ; 

if G is not an ordinary integer, we can take each a,- ^ 2. Given such a repre¬ 
sentation of G, take replicas P n of the additive group P of real numbers modulo 1, 
and let <t> n be the homomorphism which wraps P n Gn-i times around P»_i . The 

« Solenoids were studied by L. Vietoris, Math. Annalen 97 (1927), p. 459, and more in 
detail by D. van Dantzig, Fundam. Math. 15 (1930), pp. 102-135. See also L. Pontrjagin 
181, p. 171. 



830 SAMUEL EILENBERG AND SAUNDERS MACCLANE 

P„ then form an inverse system of groups, relative to the homomorphisms 
^+m,n = 0 n +1 • • • <£«+,» , and the solenoid X a is defined as the limit 2* * Lim P n . 
Therefore Char 2* = Lim Char P n , where the groups form a direct system 
under the dual correspondences <fi* . Here Char P n is an isomorphic replica /„ 
of the additive group of integers, and 4>* maps /„ into J w +i by multiplying each 
x tl n by a„ . Therefore Char 2<? = Lim J n is a subgroup No of the additive 
group of rational numbers, consisting of all rationals of the form a/d n , with a 
an integer and d n = «i • • • a„_i. Alternatively, consists of all rationals 
r/$ with $ a “divisor” of G; hence N 0 and 2 0 are uniquely determined by G, 
and are independent of the representation G — a 102 • • * a n • - • . 

A Steinitz G-number which is not an ordinary integer also determines a certain 
topological ring. Set G = a^ • • • a n • • • , d n = «i • • • a n _i. In the ring I of 
integers, introduce as neighborhoods of zero the sets ( d n ) of all multiples of d n . 
Since the intersection of all these ( d n ) is the zero element of I, these neighbor¬ 
hoods make I a topological ring. It can be embedded in a unique fashion in a 
minimal complete topological ring I G ZD /, so that every element of I a is a 
limit of a sequence of integers, under the given topology. This is one of the 
&„-adic rings introduced by D. van Dantzig. 84 The additive group of I G can be 
alternatively described as a limit of an inverse sequence; specifically, the factor 
group I/(d n + 1 ) has a natural homomorphism into //(d n ), and the limit group is 
I Q ^ Lifn I/(d n ). In the special case when G = p 00 is an infinite power of a 
prime p, I a is the ordinary ring of p-adic integers. 

Theorem. If G is any Steinitz G-number which is not an ordinary integer , 
2 0 the corresponding solenoid , and I G the corresponding complete ring containing 
the ring I of integers , then 

(B.l) Ext {/, Char 2 0 } ^ Io/I. 

Proof. As above, Char So is a group N a of rationals, generated by the 
s numbers r n = l/d n with relations a n r n +i = r n . Therefore N a can be repre¬ 
sented as F/R, where F is a free group with generators Z \, z 2 , • • • , and R the 
subgroup with generators y n = a n z n+1 — z n , n = 1, 2, • • • . By the funda¬ 
mental theorem on group extensions 

(B.2) Ext {/, Char 2 0 j & Horn {«, I }/Horn [F \R,I\. 

Let 6 c Horn {R, 7} and set 

x(d) = Lim [fyi + d 2 0y 2 + • • • + d n Sy n l 

n —*oo» 

Then x{9) is a well-defined element of I Q , and 0 —* x(9) is a homomorphic 
mapping of Horn {/2, /} into I 0 and thus, derivatively, into I a/1. We assert 
that the kernel of the latter mapping is Horn {F | R, /}. 

Assume first that 0 e Horn {F \ R, I], and let 0* be an extension of 0 to F. 
Then 

0(yi + diVi + ••■ + d n y„) * - 0 * 2 ! + d»+x 0 * 2 n+ i, 


94 Math. Annalen 107 (1932), pp. 587-626; Compositio Math. 2 (1935), pp. 201-223. 



GROUP EXTENSIONS AND HOMOLOGY 


831 


so that the limit x(0) is —6*zi , which is an integer in /. Conversely, suppose 
that x($) € J, and set x{6) = — Ci . We then have 

fyi + d 2 $y 2 + • • • + d n dy n = — ci (mod d n +i). 

By successive applications of this condition we find integers c n with 6y n — 
a n Cn +1 — Cn. The homomorphism d*z n = c n then provides an extension of 6 
to F, so that 6 c Horn {F | R, I\ . 

Every element in I G is a limit of integers, hence has the form Lim 
[&i + d ^)2 + • • • + d n b n \; therefore 0 —> x(d) is a mapping onto I 0 . We thus 
have 

(B.3) Horn {ft, /}/Horn {F | R y 1} s Iq/L 

The correspondence is topological, as one may readily verify that both (gen¬ 
eralized topological) groups carry the trivial topology in which the only open 
sets are zero and the whole space. Thus (B.2) and (B.3) prove the isomor¬ 
phism (B.l). 

By cardinal number considerations, one shows that the group I G /I is un¬ 
countable, hence not void. The formula (B.l) gives at once all the special 
properties of the homology group of the solenoid, as found by Steenrod [10] in 
his partial determination of this group. 

The University of Michigan 
Harvard University 

Bibliography 

[1] Alexandroff, P. and Hopf, H. Topologie, vol. I, Berlin, 1935. 

[2] Baer, R. Erweiterungen von Gruppen und ihren Isomorphismen , Math. Zeit. 38 (1934), 

pp. 375-416. 

[3] Cech, E. Les grouped de Betti d'un complexe infini, Fund. Math. 25 (1935), pp. 33-44. 

[4] Eilenberg, S. Cohomology and continuous mappings , Ann. of Math. 41 (1940), pp. 

231-251. 

[5] Eilenberg, S. and Mac Lane, S. Infinite Cycles and Homologies , Proc. Nat. Acad. 

Sci. U. S. A. 27 (1941), pp. 535-539. 

[6] Hall, M. Group Rings and Extensions , /, Ann. of Math. 39 (1938), pp. 220-234. 

[7] Lefschetz, S. Algebraic Topology , Amer. Math. Soc. Colloquium Series, vol. 27, 

New York, 1942. 

[8] Pontrjagin, L. Topological Groups f Princeton, 1939. 

[9] Steenrod, N. E. Universal Homology Groups , Amer. Journ. of Math. 58 (1936), pp. 

661-701. 

[10] -. Regular Cycles of Compact Metric Spaces, Ann. of Math. 41 (1940), pp. 

833-851. 

[11] Turing, A. M. The Extensions of a Group , Compositio Math. 5 (1938), pp. 357-367. 

[12] Weil, A. L’Integration dans les groupes topologiques et ses applications , Actualites 

Sci. et Ind. No. 869, Paris, 1940. 

[13] Whitney, H. Tensor Products of Abelian Groups. Duke Math. Journ. 4 (1938), pp. 

495-528. 

[14] -. On matrices of integers and combinatorial topology , Duke Math. Journ. 3 

(1937), pp. 35-45. 

[15] Zassenhaus, H. Lehrbuch der Gruppentheorie , Hamburg. Math. Einzelschriften, 21, 

Leipzig, 1937. 




INDEX 

Albert, A. A. Quadratic forms permitting composition. 161 

Albert, A. A. Non-associative algebras. I. Fundamental concepts 

and isotopy. 685 

Albert, A. A. Non-associative algebras. II. New simple algebras. 708 

Aronszajn, N. Le correspondant topologique de l’unicit6 dans la th4orie 

des Equations diff6rentielles. 730 

Birkhoff, G. Lattice-ordered groups. 298 

Bochner, S. On a theorem of Tannaka and Krein. 56 

Bochner, S., and Phillips, R. S. Absolutely convergent Fourier expan¬ 
sions for non-commutative normed rings. 409 

Blackwell, D. Idempotent Markoff chains. 560 

Chern, Shiing-shen. On integral geometry in Klein spaces. 178 

Chern, Shiing-shen. The geometry of isotropic surfaces.. 545 

Doob, J. L. The Brownian movement and stochastic equations. 351 

Eilenberg, S. Banach space methods in topology. 568 

Eilenberg, S., and MacLane, S. Group extensions and homology. 757 

Erdos, P. On the uniform distribution of the roots of certain polynomials. 59 

Erdos, P. On the asymptotic density of the sum of two sequences. 65 

Erdos, P. On the law of the iterated logarithm. 419 

Erdos, P. On an elementary proof of some asymptotic formulas in the 

theory of partitions. 437 

Erd6s, P., and Szego, G. On a problem of I. Schur. 451 

Freudenthal, H. Neuaufbau der Endentheorie. 261 

Freudenthal, H. Simplizialzerlegungen von beschrankter Flachheit_ 580 

Fubini, G. On Abel’s converse theorem. 471 

Garabedian, H. L. Hausdorff integral transformations. 501 

Grove, V. G. The transformation T of congruences. 623 

Hadamard, J. The problem of diffusion of waves. 510 

Halmos, P., and von Neumann, J. Operator methods in classical me¬ 
chanics II. 332 

Heins, M. H. On the continuation of a Riemann surface. 280 

Hooke, R. Linear p-adic groups and their Lie algebras. 641 

Kakutani, S. A proof that there exists-a circumscribing cube around any 

bounded closed convex set in R 3 . 739 

Kakutani, S. An extremum problem in product measure. 742 

Kolchin, E. R. Extensions of differential fields. 724 

Mackey, G. W. Isomorphisms of normed linear spaces. 244 

MacLane, S., and Eilenberg, S. Group extensions and homology. 757 

Mann, H. B. A proof of the fundamental theorem on the density of sums 

of sets of positive integers. 523 

Manning, Rhoda. On the derivatives of the sections of bounded power 
series. 617 

iii 



































iv 


INDEX 


Mayer, W. A new homology theory. 370 

Mayer, W. A new homology theory II. 594 

Nesbitt, C. J., and Thrall, R. M. On the modular representation of the 

symmetric group. 656 

Neumann, J. v., and Halmos, P. Operator methods in classical me¬ 
chanics II. 332 

Newman, M. H. A. On theories with a combinatorial definition of “equiv¬ 
alence” . 223 

Phillips, R. S. and Bochner, S. Absolutely convergent Fourier expan¬ 
sions for non-commutative normed rings. 409 

Scott, W. M. On matrix algebras over an algebraically closed field. . . *. . 147 

Shiffman, M. Unstable minimal surfaces with several boundaries. 197 

Siegel, C. L. Iteration of analytic functions. 607 

Siegel, C. L. Note on automorphic functions of several variables. 613 

Smiley, M. F. A remark on S. Kakutani’s characterization of (L) spaces. 528 
Steenrod, N. E. Topological methods for the construction of tensor 

functions. 116 

SzAsz, O. Some new summability methods with applications. 69 

Szego, G., and Erdos, P. On a problem of I. Schur. 451 

Thrall, R. M. On the decomposition of modular tensors (I). 671 

Thrall, R. M., and Nesbitt, C. J. On the modular representation of 

the symmetric group. 656 

Trjitzinsky, W. J. Analytic theory of singular elliptic partial differential 

equation. 1 

Ward, M. The closure operators of a lattice. 191 

Weyl, H. On the differential equations of the simplest boundary-layer 

problems. 381 

Whitehead, G. W. Homotopy properties of the real orthogonal groups. 132 
Whitehead, G. W. On the homotopy groups of spheres and rotation 

groups. 634 

Whitman, P. M. Free lattices II. 104 

Young, L. C. Generalized surfaces in the calculus of variations. I. 

Generalized Lipschitzian surfaces. 84 

Young, L. C. Generalized surfaces in the calculus of variations. II. 530 

Zariski, O. A simplified proof for the resolution of singularities of an 

algebraic surface. 583 


























INDEX 

Albert, A. A. Quadratic forms permitting composition. 161 

Albert, A. A. Non-associative algebras. I. Fundamental concepts 

and isotopy. 685 

Albert, A. A. Non-associative algebras. II. New simple algebras. 708 

Aronszajn, N. Le correspondant topologique dc l’unicit6 dans la th^orie 

des Equations diff&cntielles. 730 

Birkhoff, G. Lattice-ordered groups. 298 

Bochner, S. On a theorem of Tannaka and Krein. 56 

Bochner, S., and Phillips, R. S. Absolutely convergent Fourier expan¬ 
sions for non-commutative normed rings. 409 

Blackwell, D. Idempotent Markoff chains. 560 

Chern, Shiing-shen. On integral geometry in Klein spaces. 178 

Chern, Shiing-shen. The geometry of isotropic surfaces. 545 

Doob, J. L. The Brownian movement and stochastic equations. 351 

Eilenberg, S. Banach space methods in topology. 568 

Eilenberg, S., and MacLane, S. Group extensions and homology. 757 

Erdos, P. On the uniform distribution of the roots of certain polynomials. 59 

Erdos, P. On the asymptotic density of the sum of two sequences. 65 

Erdos, P. On the law of the iterated logarithm... 419 

Erdos, P. On an elementary proof of some asymptotic formulas in the 

theory of partitions. 437 

Erdos, P., and Szego, G. On a problem of I. Schur. 451 

Freudenthal, H. Neuaufbau der Endentheorie. 261 

Freudenthal, H. Simplizialzerlegungen von beschrankter Flachheit.... 580 

Fubini, G. On Abel’s converse theorem. 471 

Garabedian, H. L. Hausdorff integral transformations. 501 

Grove, V. G. The transformation T of congruences. 623 

Hadamard, J. The problem of diffusion of waves. 510 

Halmos, P., and von Neumann, J. Operator methods in classical me¬ 
chanics II. 332 

Heins, M. H. On the continuation of a Riemann surface. 280 

Hooke, R. Linear p-adic groups and their Lie algebras. 641 

Kakutani, S. A proof that there exists a circumscribing cube around any 

bounded closed convex set in R 3 . 739 

Kakutani, S. An extremum problem in product measure. 742 

Kolchin, E. R. Extensions of differential fields . 724 

Mackey, G. W. Isomorphisms of normed linear spaces. 244 

MacLane, S., and Eilenberg, S. Group extensions and homology. 757 

Mann, H. B. A proof of the fundamental theorem on the density of sums 

of sets of positive integers. 523 

Manning, Rhoda. On the derivatives of the sections of bounded power 

series. 617 

in 



































IV 


INDEX 


Mayer, W. A now homology theory. 370 

Mayer, W. A new homology theory II. 594 

Nesbitt, C. J., and Thrall, R. M. On the modular representation of the 

symmetric group. 656 

Neumann, J. v., and Halmos, P. Operator methods in classical me¬ 
chanics II. 332 

Newman, M. H. A. On theories with a combinatorial definition of “equiv¬ 
alence”. 223 

Phillips, R. S. and Bochner, S. Absolutely convergent Fourier expan¬ 
sions for non-commutative normed rings. 409 

Scott, W. M. On matrix algebras over an algebraically closed field....*.. 147 

Shiffman, M. Unstable minimal surfaces with several boundaries. 197 

Siegel, C. L. Iteration of analytic functions. 607 

Siegel, C. L. Note on automorphic functions of several variables... 613 

Smiley, M. F. A remark on S. Kakutani’s characterization of (L) spaces. 528 
Steenrod, N. E. Topological methods for the construction of tensor 

functions. 116 

SzAsz, O. Some new summability methods with applications. 69 

Szego, G., and Erdos, P. On a problem of I. Schur. 451 

Thrall, R. M. On the decomposition of modular tensors (I). 671 

Thrall, R. M., and Nesbitt, C. J. On the modular representation of 

the symmetric group. 656 

Trjitzinsky, W. J. Analytic theory of singular elliptic partial differential 

equation. 1 

Ward, M. The closure operators of a lattice. 191 

Weyl, H. On the differential equations of the simplest boundary-layer 

problems. 381 

Whitehead, G. W. Homotopy properties of the real orthogonal groups. 132 
Whitehead, G. W. On the homotopy groups of spheres and rotation 

groups. 634 

Whitman, P. M. Free lattices II. 104 

Young, L. C. Generalized surfaces in the calculus of variations. I. 

Generalized Lipschitzian surfaces. 84 

Young, L. C. Generalized surfaces in the calculus of variations. II. 530 

Zariskx, O. A simplified proof for the resolution of singularities of an 
algebraic surface. 583 



























l.A.K.I. 7S. 


INDIAN AGRICULTURAL RESEARCH 
INSTITUTE LIBRARY, NEW DELHI. 


Date of Issue 

Date of Issue 

Date of Issue 



• 











! 







• 
















\ 

* 






I.A.R.I.—29-4- 5-15,000 






